Spelling suggestions: "subject:"aprincipal component 2analysis"" "subject:"aprincipal component 3analysis""
281 |
A Comparison Of Subspace Based Face Recognition MethodsGonder, Ozkan 01 September 2005 (has links) (PDF)
Different approaches to the face recognition are studied in this thesis. These approaches are PCA (Eigenface), Kernel Eigenface and Fisher LDA. Principal component analysis extracts the most important information contained in the face to construct a computational model that best describes the face. In Eigenface approach, variation between the face images are described by using a set of characteristic face images in order to find out the eigenvectors (Eigenfaces) of the covariance matrix of the distribution, spanned by a training set of face images. Then, every face image is represented by a linear combination of these eigenvectors. Recognition is implemented by projecting a new image into the face subspace spanned by the Eigenfaces and then classifying the face by comparing its position in face space with the positions of known individuals. In Kernel Eigenface method, non-linear mapping of input space is implemented before PCA in order to handle non-linearly embedded properties of images (i.e. background differences, illumination changes, and facial expressions etc.). In Fisher LDA, LDA is applied after PCA to increase the discrimination between classes.
These methods are implemented on three databases that are: Yale face database, AT& / T (formerly Olivetti Research Laboratory) face database, and METU Vision Lab face database. Experiment results are compared with respect to the effects of changes in illumination, pose and expression.
Kernel Eigenface and Fisher LDA show slightly better performance with respect to Eigenfaces method under changes in illumination. Expression differences did not affect the performance of Eigenfaces method.
From test results, it can be observed that Eigenfaces approach is an adequate method that can be used in face recognition systems due to its simplicity, speed and learning capability. By this way, it can easily be used in real time systems.
|
282 |
Design Of An Electromagnetic Classifier For Spherical TargetsAyar, Mehmet 01 May 2005 (has links) (PDF)
This thesis applies an electromagnetic feature extraction technique to design electromagnetic target classifiers for conductors, dielectrics and dielectric coated conductors using their natural resonance related late-time scattered responses. Classifier databases contain scattered data at only a few aspects for each candidate target. The targets are dielectric spheres of varying sizes and refractive indices, perfectly conducting spheres varying sizes and dielectric coated conducting spheres of varying refractive indices and thickness in coating. The applied classifier design technique is suitable for real-time target classification because of the computational efficiency of feature extraction and decision making approaches. The Wigner-Ville Distribution (WD) is employed in this study in addition to the Principal Components Analysis (PCA) technique to extract target features mainly from late-time target responses. WD is applied to the back-scattered responses at different aspects. To decrease aspect dependency, feature vectors are extracted from selected late-time portions of the WD outputs that include natural resonance related information. Principal components analysis is also used to fuse the feature vectors and/or late-time target responses extracted from reference aspects of a given target into a single characteristic feature vector for each target to further reduce aspect dependency.
|
283 |
An Extended Study On The Alu Insertion Polymorphisms In Anatolian Human PopulationSekeryapan, Ceran 01 September 2005 (has links) (PDF)
In the present study, for estimating the Central Asia contribution to the Anatolia, nine Alu insertion polymorphisms (ACE, PV92, FXIIIB, APO, A25, B65, TPA25, D1, HS4.32 ) in 100 individuals from Anatolia were examined. Alu insertion frequency for these loci were calculated as 0,410 / 0,220 / 0,579 / 0,963 / 0,067 / 0,667 / 0,390 / 0,427 / and 0,637 respectively and they were found to be in Hardy-Weinberg equilibrium (p< / 0,05). Observed insertion frequencies of each loci were compared with those of the previous observations (Dinç / , 2003 / Comas et al., 2004) and it was found that the present study results were not different than those obtained by Comas et al. (2004). Thus, these two data were pooled (N = 143) and used to examine genetic relationships between populations from Eurasia and Africa.
Pairwise Fst statistics indicated that there is higher genetic similarity between Anatolia and all of the Balkans and some of the Caucasian populations. Neighbor Joining (NJ) tree based on Reynold&rsquo / s genetic distances and Principal Component Analysis (PCA) both grouped the Anatolian populations with Balkans and some of the Caucasian populations and show clear differentiation of Asian populations from the Anatolian population.
The relative genetic contribution of Central Asian genes to the current Anatolian gene pool was quantified using Admix analysis, considering for comparison populations of Balkans (Greek, Romania, Albania and Hungarian) and Central Asia (Uighur, Uzbeks, Tajicks, Kazaks, Kyrgyzes, Dungans). Estimates suggest roughly 28 % contribution from Asia to Anatolia in concordance with the previous estimation (Benedetto et al., 2001).
|
284 |
A simplified approach in FAVAR estimationLien Oskarsson, Mathias, Lin, Christopher January 2018 (has links)
In the field of empirical macroeconomics factor-augmented vector autoregressive (FAVAR) models have become a popular tool in explaining how economic variables interact over time. FAVAR is based upon a data-reduction step using factor estimation, which are then employed in a vector autoregressive model. This paper aims to study alternative methods regarding factor estimation. More precisely, we compare the generally used principal component method with the uncomplicated common correlated effect estimation. Results show low divergence between the two factor estimation methods employed, indicating interchangeability between the two estimation approaches.
|
285 |
Use of multivariate statistical methods for control of chemical batch processesLopez Montero, Eduardo January 2016 (has links)
In order to meet tight product quality specifications for chemical batch processes, it is vital to monitor and control product quality throughout the batch duration. However, the frequent lack of in situ sensors for continuous monitoring of batch product quality complicates the control problem and calls for novel control approaches. This thesis focuses on the study and application of multivariate statistical methods to control product quality in chemical batch processes. These multivariate statistical methods can be used to identify data-driven prediction models that can be integrated within a model predictive control (MPC) framework. The ideal MPC control strategy achieves end-product quality specifications by performing trajectory tracking during the batch operating time. However, due to the lack of in-situ sensors, measurements of product quality are usually obtained by laboratory assays and are, therefore, inherently intermittent. This thesis proposes a new approach to realise trajectory tracking control of batch product quality in those situations where only intermittent measurements are available. The scope of this methodology consists of: 1) the identification of a partial least squares (PLS) model that works as an estimator of product quality, 2) the transformation of the PLS model into a recursive formulation utilising a moving window technique, and 3) the incorporation of the recursive PLS model as a predictor into a standard MPC framework for tracking the desired trajectory of batch product quality. The structure of the recursive PLS model allows a straightforward incorporation of process constraints in the optimisation process. Additionally, a method to incorporate a nonlinear inner relation within the proposed PLS recursive model is introduced. This nonlinear inner relation is a combination of feedforward artificial neural networks (ANNs) and linear regression. Nonlinear models based on this method can predict product quality of highly nonlinear batch processes and can, therefore, be used within an MPC framework to control such processes. The use of linear regression in addition to ANNs within the PLS model reduces the risk of overfitting and also reduces the computational e↵ort of the optimisation carried out by the controller. The benefits of the proposed modelling and control methods are demonstrated using a number of simulated batch processes.
|
286 |
Mensuração de risco para empresas do ramo frigoríficoShirassu, Fabio Koiti 06 February 2015 (has links)
Submitted by Fabio Koiti Shirassu (faks85@gmail.com) on 2015-03-12T01:50:32Z
No. of bitstreams: 1
FABIO KOITI SHIRASSU.pdf: 4557681 bytes, checksum: e9184ef217a2a98f318db3ab9277d6f1 (MD5) / Approved for entry into archive by JOANA MARTORINI (joana.martorini@fgv.br) on 2015-03-12T11:13:06Z (GMT) No. of bitstreams: 1
FABIO KOITI SHIRASSU.pdf: 4557681 bytes, checksum: e9184ef217a2a98f318db3ab9277d6f1 (MD5) / Made available in DSpace on 2015-03-12T11:59:38Z (GMT). No. of bitstreams: 1
FABIO KOITI SHIRASSU.pdf: 4557681 bytes, checksum: e9184ef217a2a98f318db3ab9277d6f1 (MD5)
Previous issue date: 2015-02-06 / This paper analyzes a risk measurement and management methodology for meat processing companies. Earnings at Risk (EaR) is used with the adoption of a topdown approach that expresses income variations as a function of market and idiosyncratic explanatory variables. With the elimination of multicolinearity between those variables thanks to the use of Principal Component Analysis (PCA), we analyze how the new EaR behaves against the more usual multiple linear regression model. Dummy variables are included in the estimation of future results for meat processing companies, representing the occurrence of diseases affecting cattle and the withdrawal of economic embargoes by importing countries during the period. As a result, it is found that the dummy variables do not contribute to determining the variation of EaR, and that no one comes to the conclusion that the EaR model using PCA shown better with less variables, with the same original variance and statistical significance. / Este trabalho apresenta metodologia de mensuração e gestão de risco para empresas do ramo frigorífico. A ferramenta utilizada é conhecida como Earnings at Risk (EaR), e se adota uma visão top-down, que mostra a variação do resultado da empresa de acordo com variáveis explicativas de mercado e variáveis idiossincráticas. Através da eliminação de multicolinearidade entre essas variáveis com o uso da métrica de Análise de Componentes Principais (ACP), busca-se analisar como o novo EaR se comportaria frente ao enfoque usual, construído com um modelo de regressão linear múltipla. Variáveis dummy fazem parte do modelo de estimação do resultado futuro das empresas frigoríficas, relacionadas à ocorrência ou não de doenças que afetam o gado bovino, e à retirada de embargos econômicos de países importadores durante o período de análise. Ao fim do trabalho é verificado que as variáveis dummy não possuem relevância para a determinação de EaR, e que não se chega a conclusão de que o modelo de EaR com ACP se mostra melhor com menos variáveis, mantendo a mesma variância e significância estatística originais.
|
287 |
Abordagens multivariadas para a seleção de variáveis com vistas à caracterização de medicamentos / Multivariate approaches to variable selection in order to characterize medicinesYamashita, Gabrielli Harumi January 2015 (has links)
A averiguação da autenticidade de medicamentos tem se apoiado na análise de perfil por espectroscopia de infravermelho (ATR-FTIR). Contudo, tal análise tipicamente gera dados caracterizados por elevado número de variáveis (comprimentos de onda) ruidosas e correlacionadas, necessitando assim da aplicação de técnicas para seleção das variáveis mais relevantes e informativas, tornando os modelos preditivos e exploratórios mais robustos. Esta dissertação testa sistemáticas para a seleção de variáveis com vistas à clusterização e classificação de medicamentos. Para tanto, inicialmente faz-se uso dos parâmetros oriundos da Análise de Componentes Principais (ACP) para a geração de três índices de importância de variáveis; tais índices guiam um processo iterativo de eliminação de variáveis com vistas a uma clusterização mais consistente, medida através do Silhouette Index. Na sequência, utiliza-se o Algoritmo Genético (AG) combinado com a ferramenta de classificação k nearest neighbor (kNN) para selecionar o subconjunto de variáveis que resultem na maior acurácia média com propósito de classificação das amostras em dois grupos, originais ou falsificados. Por fim, aplica-se a divisão dos dados ATR-FTIR em intervalos para selecionar as regiões espectroscópicas mais relevantes para a classificação das amostras via kNN; na sequência, aplica-se o AG para refinar os intervalos retidos anteriormente. A aplicação dos métodos de seleção de variáveis propostos permitiu realizar clusterizações e classificações mais precisas com base em um subconjunto reduzido de variáveis. / The investigation of the authenticity of drugs has relied on the profile analysis by infrared spectroscopy (ATR-FTIR). However, such analysis typically yields a large number of correlated and noisy variables (wavelengths), which require the application of techniques for selecting the most informative and relevant variables to improve model ability. This thesis test an approach to variable selection aimed at clustering and classifying drug samples. For that matter, it derives three variable importance indices based on Principal Component Analysis (PCA) components that guide an iterative process of variable elimination; clustering performance based on the reduced sets is assessed via Silhouette Index. Next, we combine the Genetic Algorithm (GA) with the k nearest neighbor classification technique (kNN) to select the subset of variables yielding the highest average accuracy for classifying samples into authentic or counterfeit categories. Finally, we split the ATR-FTIR data into intervals to select the most relevant spectroscopic regions for sample classification via kNN; we then apply GA to refine the ranges previously retained. The implementation of the proposed variable selection methods led to more accurate clustering and classification procedures based on a small subset of variables.
|
288 |
IMAGE-BASED MODELING AND PREDICTION OF NON-STATIONARY GROUND MOTIONSDAK HAZIRBABA, YILDIZ 01 May 2015 (has links)
Nonlinear dynamic analysis is a required step in seismic performance evaluation of many structures. Performing such an analysis requires input ground motions, which are often obtained through simulations, due to the lack of sufficient records representing a given scenario. As seismic ground motions are characterized by time-varying amplitude and frequency content, and the response of nonlinear structures is sensitive to the temporal variations in the seismic energy input, ground motion non-stationarities should be taken into account in simulations. This paper describes a nonparametric approach for modeling and prediction of non-stationary ground motions. Using Relevance Vector Machines, a regression model which takes as input a set of seismic predictors, and produces as output the expected evolutionary power spectral density, conditioned on the predictors. A demonstrative example is presented, where recorded and predicted ground motions are compared in time, frequency, and time-frequency domains. Analysis results indicate reasonable match between the recorded and predicted quantities.
|
289 |
Distinct Feature Learning and Nonlinear Variation Pattern Discovery Using Regularized AutoencodersJanuary 2016 (has links)
abstract: Feature learning and the discovery of nonlinear variation patterns in high-dimensional data is an important task in many problem domains, such as imaging, streaming data from sensors, and manufacturing. This dissertation presents several methods for learning and visualizing nonlinear variation in high-dimensional data. First, an automated method for discovering nonlinear variation patterns using deep learning autoencoders is proposed. The approach provides a functional mapping from a low-dimensional representation to the original spatially-dense data that is both interpretable and efficient with respect to preserving information. Experimental results indicate that deep learning autoencoders outperform manifold learning and principal component analysis in reproducing the original data from the learned variation sources.
A key issue in using autoencoders for nonlinear variation pattern discovery is to encourage the learning of solutions where each feature represents a unique variation source, which we define as distinct features. This problem of learning distinct features is also referred to as disentangling factors of variation in the representation learning literature. The remainder of this dissertation highlights and provides solutions for this important problem.
An alternating autoencoder training method is presented and a new measure motivated by orthogonal loadings in linear models is proposed to quantify feature distinctness in the nonlinear models. Simulated point cloud data and handwritten digit images illustrate that standard training methods for autoencoders consistently mix the true variation sources in the learned low-dimensional representation, whereas the alternating method produces solutions with more distinct patterns.
Finally, a new regularization method for learning distinct nonlinear features using autoencoders is proposed. Motivated in-part by the properties of linear solutions, a series of learning constraints are implemented via regularization penalties during stochastic gradient descent training. These include the orthogonality of tangent vectors to the manifold, the correlation between learned features, and the distributions of the learned features. This regularized learning approach yields low-dimensional representations which can be better interpreted and used to identify the true sources of variation impacting a high-dimensional feature space. Experimental results demonstrate the effectiveness of this method for nonlinear variation pattern discovery on both simulated and real data sets. / Dissertation/Thesis / Doctoral Dissertation Industrial Engineering 2016
|
290 |
Abordagens multivariadas para a seleção de variáveis com vistas à caracterização de medicamentos / Multivariate approaches to variable selection in order to characterize medicinesYamashita, Gabrielli Harumi January 2015 (has links)
A averiguação da autenticidade de medicamentos tem se apoiado na análise de perfil por espectroscopia de infravermelho (ATR-FTIR). Contudo, tal análise tipicamente gera dados caracterizados por elevado número de variáveis (comprimentos de onda) ruidosas e correlacionadas, necessitando assim da aplicação de técnicas para seleção das variáveis mais relevantes e informativas, tornando os modelos preditivos e exploratórios mais robustos. Esta dissertação testa sistemáticas para a seleção de variáveis com vistas à clusterização e classificação de medicamentos. Para tanto, inicialmente faz-se uso dos parâmetros oriundos da Análise de Componentes Principais (ACP) para a geração de três índices de importância de variáveis; tais índices guiam um processo iterativo de eliminação de variáveis com vistas a uma clusterização mais consistente, medida através do Silhouette Index. Na sequência, utiliza-se o Algoritmo Genético (AG) combinado com a ferramenta de classificação k nearest neighbor (kNN) para selecionar o subconjunto de variáveis que resultem na maior acurácia média com propósito de classificação das amostras em dois grupos, originais ou falsificados. Por fim, aplica-se a divisão dos dados ATR-FTIR em intervalos para selecionar as regiões espectroscópicas mais relevantes para a classificação das amostras via kNN; na sequência, aplica-se o AG para refinar os intervalos retidos anteriormente. A aplicação dos métodos de seleção de variáveis propostos permitiu realizar clusterizações e classificações mais precisas com base em um subconjunto reduzido de variáveis. / The investigation of the authenticity of drugs has relied on the profile analysis by infrared spectroscopy (ATR-FTIR). However, such analysis typically yields a large number of correlated and noisy variables (wavelengths), which require the application of techniques for selecting the most informative and relevant variables to improve model ability. This thesis test an approach to variable selection aimed at clustering and classifying drug samples. For that matter, it derives three variable importance indices based on Principal Component Analysis (PCA) components that guide an iterative process of variable elimination; clustering performance based on the reduced sets is assessed via Silhouette Index. Next, we combine the Genetic Algorithm (GA) with the k nearest neighbor classification technique (kNN) to select the subset of variables yielding the highest average accuracy for classifying samples into authentic or counterfeit categories. Finally, we split the ATR-FTIR data into intervals to select the most relevant spectroscopic regions for sample classification via kNN; we then apply GA to refine the ranges previously retained. The implementation of the proposed variable selection methods led to more accurate clustering and classification procedures based on a small subset of variables.
|
Page generated in 0.1207 seconds