• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 87
  • 9
  • 9
  • 5
  • 4
  • 4
  • 2
  • 1
  • 1
  • Tagged with
  • 149
  • 149
  • 39
  • 37
  • 36
  • 22
  • 20
  • 18
  • 18
  • 17
  • 17
  • 17
  • 17
  • 15
  • 15
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
61

On Analysis of Sufficient Dimension Reduction Models

An, Panduan 04 June 2019 (has links)
No description available.
62

Advances on Dimension Reduction for Univariate and Multivariate Time Series

Mahappu Kankanamge, Tharindu Priyan De Alwis 01 August 2022 (has links) (PDF)
Advances in modern technologies have led to an abundance of high-dimensional time series data in many fields, including finance, economics, health, engineering, and meteorology, among others. This causes the “curse of dimensionality” problem in both univariate and multivariate time series data. The main objective of time series analysis is to make inferences about the conditional distributions. There are some methods in the literature to estimate the conditional mean and conditional variance functions in time series. However, most of those are inefficient, computationally intensive, or suffer from the overparameterization. We propose some dimension reduction techniques to address the curse of dimensionality in high-dimensional time series dataFor high-dimensional matrix-valued time series data, there are a limited number of methods in the literature that can preserve the matrix structure and reduce the number of parameters significantly (Samadi, 2014, Chen et al., 2021). However, those models cannot distinguish between relevant and irrelevant information and yet suffer from the overparameterization. We propose a novel dimension reduction technique for matrix-variate time series data called the "envelope matrix autoregressive model" (EMAR), which offers substantial dimension reduction and links the mean function and the covariance matrix of the model by using the minimal reducing subspace of the covariance matrix. The proposed model can identify and remove irrelevant information and can achieve substantial efficiency gains by significantly reducing the total number of parameters. We derive the asymptotic properties of the proposed maximum likelihood estimators of the EMAR model. Extensive simulation studies and a real data analysis are conducted to corroborate our theoretical results and to illustrate the finite sample performance of the proposed EMAR model.For univariate time series, we propose sufficient dimension reduction (SDR) methods based on some integral transformation approaches that can preserve sufficient information about the response. In particular, we use the Fourier and Convolution transformation methods (FM and CM) to perform sufficient dimension reduction in univariate time series and estimate the time series central subspace (TS-CS), the time series mean subspace (TS-CMS), and the time series variance subspace (TS-CVS). Using FM and CM procedures and with some distributional assumptions, we derive candidate matrices that can fully recover the TS-CS, TS-CMS, and TS-CVS, and propose an explicit estimate of the candidate matrices. The asymptotic properties of the proposed estimators are established under both normality and non-normality assumptions. Moreover, we develop some data-drive methods to estimate the dimension of the time series central subspaces as well as the lag order. Our simulation results and real data analyses reveal that the proposed methods are not only significantly more efficient and accurate but also offer substantial computational efficiency compared to the existing methods in the literature. Moreover, we develop an R package entitled “sdrt” to easily perform our program code in FM and CM procedures to estimate suffices dimension reduction subspaces in univariate time series.
63

W2R: an ensemble Anomaly detection model inspired by language models for web application firewalls security

Wang, Zelong, AnilKumar, Athira January 2023 (has links)
Nowadays, web application attacks have increased tremendously due to the large number of users and applications. Thus, industries are paying more attention to using Web application Firewalls and improving their security which acts as a shield between the app and the internet by filtering and monitoring the HTTP traffic. Most works focus on either traditional feature extraction or deep methods that require no feature extraction method. We noticed that a combination of an unsupervised language model and a classic dimension reduction method is less explored for this problem. Inspired by this gap, we propose a new unsupervised anomaly detection model with better results than the existing state-of-the-art model for anomaly detection in WAF security. This paper focuses on this structure to explore WAF security: 1) feature extraction from HTTP traffic packets by using NLP (natural language processing) methods such as word2vec and Bert, and 2) Dimension reduction by PCA and Autoencoder, 3) Using different types of anomaly detection techniques including OCSVM, isolation forest, LOF and combination of these algorithms to explore how these methods affect results.  We used the datasets CSIC 2010 and ECML/PKDD 2007 in this paper, and the model has better results.
64

Unsupervised Dimension Reduction Techniques for Lung Cancer Diagnosis Based on Radiomics

Kireta, Janet, Zahed, Mostafa, Dr. 25 April 2023 (has links)
One of the most pressing global health concerns is the impact of cancer, which remains a leading cause of death worldwide. The timeliness of detection and diagnosis is critical to maximizing the chances of successful treatment. Radiomics is an emerging medical imaging analysis proposed, which refers to the high-throughput extraction of a large number of image features. Radiomics generally refers to the use of CT, PET, MRI or Ultrasound imaging as input data, extracting expressive features from massive image-based data, and then using machine learning or statistical models for quantitative analysis and prediction of disease. Feature reduction is very critical in Radiomics as a large number of quantitative features can have redundant characteristics not necessarily important in the analysis process. Due to the immense features obtained from radiological images, the main objective of our research is the application of machine learning techniques to reduce the number of dimensions, thereby rendering the data more manageable. Radiomics involves several steps including: Imaging, segmentation, feature extraction, and analysis. Extracted features can be categorized in the description of tumor gray histograms, shape, texture features, and the tumor location and surrounding tissue. For this research, a large-scale CT dataset for Lung cancer diagnosis (Lung- PET-CT-Dx) which was collected by scholars from Medical University in Harbin in China is used to illustrate the dimension reduction techniques, which is a main part of radiomics process, via R, SAS and Python. The proposed reduction and analysis techniques in our research will entail; Principal Component Analysis, Clustering analysis (Hierarchical Clustering and K-means), and Manifold-based algorithms (Isometric Feature Mapping (ISOMAP).
65

Topics in Multivariate Time Series Analysis: Statistical Control, Dimension Reduction Visualization and Their Business Applications

Huang, Xuan 01 May 2010 (has links)
Most business processes are, by nature, multivariate and autocorrelated. High-dimensionality is rooted in processes where more than one variable is considered simultaneously to provide a more comprehensive picture. Time series models are preferable to an independently identically distributed (I.I.D.) model because they capture the fact that many processes have a memory of their past. Examples of multivariate autocorrelation can be found in processes in the business fields such as Operations Management, Finance and Marketing. The topic of statistical control is most relevant to Quality Management. While both multivariate I.I.D. processes and univariate autocorrelated processes have received much attention in the Statistical Process Control (SPC) literature, little work has been done to simultaneously address high-dimensionality and autocorrelation. In this dissertation, this gap is filled by extending the univariate special cause chart and common cause chart to multivariate situations. In addition, two-chart control schemes are extended to nonstationary processes. Further, a class of Markov Chain models is proposed to provide accurate Average Run Length (ARL) computation when the process is autocorrelated. The second part of this dissertation aims to devise a dimension reduction method for autocorrelated processes. High-dimensionality often obscures the true underlying components of a process. In traditional multivariate literature, Principal Components Analysis (PCA) is the standard tool for dimension reduction. For autocorrelated processes, however, PCA fails to take into account the autocorrelation information. Thus, it is doubtful that PCA is the best choice. A two-step dimension reduction procedure is devised for multivariate time series. Comparisons based on both simulated examples and case studies show that the two-step procedure is more efficient in retrieving true underlying factors. Visualization of multivariate time series assists our understanding of the process. In the last part of this dissertation a simple three-dimensional graph is proposed to assist visualizing the results of PCA. It is intended to complement existing graphical methods for multivariate time series data. The idea is to visualize multivariate data as a surface that in turn can be decomposed with PCA. The developed surface plots are intended for statistical process analysis but may also help visualize economics data and, in particular, co-integration.
66

Predictive Modeling of Spatio-Temporal Datasets in High Dimensions

Chen, Linchao 27 May 2015 (has links)
No description available.
67

Some Statistical Aspects of Association Studies in Genetics and Tests of the Hardy-Weinberg Equilibrium

He, Ran 08 October 2007 (has links)
No description available.
68

Two Essays on Single-index Models

Wu, Zhou 24 September 2008 (has links)
No description available.
69

Regularization for High-dimensional Time Series Models

Sun, Yan 20 September 2011 (has links)
No description available.
70

Topics on Sufficient Dimension Reduction

Nguyen, Son 19 July 2016 (has links)
No description available.

Page generated in 0.1278 seconds