Global ETD Search

491	Modèles de covariance pour l'analyse et la classification de signaux électroencéphalogrammes / Covariance models for electroencephalogramm signals analysis and classification Spinnato, Juliette 06 July 2015 (has links) Cette thèse s’inscrit dans le contexte de l’analyse et de la classification de signaux électroencéphalogrammes (EEG) par des méthodes d’analyse discriminante. Ces signaux multi-capteurs qui sont, par nature, très fortement corrélés spatialement et temporellement sont considérés dans le plan temps-fréquence. En particulier, nous nous intéressons à des signaux de type potentiels évoqués qui sont bien représentés dans l’espace des ondelettes. Par la suite, nous considérons donc les signaux représentés par des coefficients multi-échelles et qui ont une structure matricielle électrodes × coefficients. Les signaux EEG sont considérés comme un mélange entre l’activité d’intérêt que l’on souhaite extraire et l’activité spontanée (ou "bruit de fond"), qui est largement prépondérante. La problématique principale est ici de distinguer des signaux issus de différentes conditions expérimentales (classes). Dans le cas binaire, nous nous focalisons sur l’approche probabiliste de l’analyse discriminante et des modèles de mélange gaussien sont considérés, décrivant dans chaque classe les signaux en termes de composantes fixes (moyenne) et aléatoires. Cette dernière, caractérisée par sa matrice de covariance, permet de modéliser différentes sources de variabilité. Essentielle à la mise en oeuvre de l’analyse discriminante, l’estimation de cette matrice (et de son inverse) peut être dégradée dans le cas de grandes dimensions et/ou de faibles échantillons d’apprentissage, cadre applicatif de cette thèse. Nous nous intéressons aux alternatives qui se basent sur la définition de modèle(s) de covariance(s) particulier(s) et qui permettent de réduire le nombre de paramètres à estimer. / The present thesis finds itself within the framework of analyzing and classifying electroencephalogram signals (EEG) using discriminant analysis. Those multi-sensor signals which are, by nature, highly correlated spatially and temporally are considered, in this work, in the timefrequency domain. In particular, we focus on low-frequency evoked-related potential-type signals (ERPs) that are well described in the wavelet domain. Thereafter, we will consider signals represented by multi-scale coefficients and that have a matrix structure electrodes × coefficients. Moreover, EEG signals are seen as a mixture between the signal of interest that we want to extract and spontaneous activity (also called "background noise") which is overriding. The main problematic is here to distinguish signals from different experimental conditions (class). In the binary case, we focus on the probabilistic approach of the discriminant analysis and Gaussian mixtures are used, describing in each class the signals in terms of fixed (mean) and random components. The latter, characterized by its covariance matrix, allow to model different variability sources. The estimation of this matrix (and of its inverse) is essential for the implementation of the discriminant analysis and can be deteriorated by high-dimensional data and/or by small learning samples, which is the application framework of this thesis. We are interested in alternatives that are based on specific covariance model(s) and that allow to decrease the number of parameters to estimate. Analyse discriminante Données matricielles Matrice de covariance séparable Modèle de mélange gaussien Modèle linéaire mixte Décomposition en valeurs singulières Transformation en ondelettes discrète Signaux électroencéphalogrammes Discriminant analysis Matrix-Based data Separable covariance matrix Gaussian mixtures Linear mixed model Singular value decomposition Discrete wavelet transform Electroencephalogramm signals
492	Estimation de synchrones de consommation électrique par sondage et prise en compte d'information auxiliaire / Estimate the mean electricity consumption curve by survey and take auxiliary information into account Lardin, Pauline 26 November 2012 (has links) Dans cette thèse, nous nous intéressons à l'estimation de la synchrone de consommation électrique (courbe moyenne). Etant donné que les variables étudiées sont fonctionnelles et que les capacités de stockage sont limitées et les coûts de transmission élevés, nous nous sommes intéressés à des méthodes d'estimation par sondage, alternatives intéressantes aux techniques de compression du signal. Nous étendons au cadre fonctionnel des méthodes d'estimation qui prennent en compte l'information auxiliaire disponible afin d'améliorer la précision de l'estimateur de Horvitz-Thompson de la courbe moyenne de consommation électrique. La première méthode fait intervenir l'information auxiliaire au niveau de l'estimation, la courbe moyenne est estimée à l'aide d'un estimateur basé sur un modèle de régression fonctionnelle. La deuxième l'utilise au niveau du plan de sondage, nous utilisons un plan à probabilités inégales à forte entropie puis l'estimateur de Horvitz-Thompson fonctionnel. Une estimation de la fonction de covariance est donnée par l'extension au cadre fonctionnel de l'approximation de la covariance donnée par Hájek. Nous justifions de manière rigoureuse leur utilisation par une étude asymptotique. Pour chacune de ces méthodes, nous donnons, sous de faibles hypothèses sur les probabilités d'inclusion et sur la régularité des trajectoires, les propriétés de convergence de l'estimateur de la courbe moyenne ainsi que de sa fonction de covariance. Nous établissons également un théorème central limite fonctionnel. Afin de contrôler la qualité de nos estimateurs, nous comparons deux méthodes de construction de bande de confiance sur un jeu de données de courbes de charge réelles. La première repose sur la simulation de processus gaussiens. Une justification asymptotique de cette méthode sera donnée pour chacun des estimateurs proposés. La deuxième utilise des techniques de bootstrap qui ont été adaptées afin de tenir compte du caractère fonctionnel des données / In this thesis, we are interested in estimating the mean electricity consumption curve. Since the study variable is functional and storage capacities are limited or transmission cost are high survey sampling techniques are interesting alternatives to signal compression techniques. We extend, in this functional framework, estimation methods that take into account available auxiliary information and that can improve the accuracy of the Horvitz-Thompson estimator of the mean trajectory. The first approach uses the auxiliary information at the estimation stage, the mean curve is estimated using model-assisted estimators with functional linear regression models. The second method involves the auxiliary information at the sampling stage, considering πps (unequal probability) sampling designs and the functional Horvitz-Thompson estimator. Under conditions on the entropy of the sampling design the covariance function of the Horvitz-Thompson estimator can be estimated with the Hájek approximation extended to the functional framework. For each method, we show, under weak hypotheses on the sampling design and the regularity of the trajectories, some asymptotic properties of the estimator of the mean curve and of its covariance function. We also establish a functional central limit theorem.Next, we compare two methods that can be used to build confidence bands. The first one is based on simulations of Gaussian processes and is assessed rigorously. The second one uses bootstrap techniques in a finite population framework which have been adapted to take into account the functional nature of the data Approximation de Hájek Bande de confiance Bootstrap Données fonctionnelles Estimateur de Horvitz-Thompson Estimateur model-assisted Fonction de covariance Modèle linéaire fonctionnel Théorème central limite fonctionnel Sondage Hajek variance approximation Confidence band Bootstrap Functional data Horvitz-Thompson estimator Model-assisted estimator Covariance function Functional linear model Functional central limit theorem Survey sampling 519
493	Diagnostic d'une Turbine Eolienne à Distance à l'aide du Réseau de Capteurs sans Fil / Diagnosis of a wind turbine using wireless sensor networks Gliga, Lavinius ioan 19 November 2019 (has links) Les Éoliennes à Entraînement Direct (ÉED) sont équipées de Générateurs Syn- chrones à Aimants Permanents (GSAP). Leurs trois plus courantes défaillances sont la dé- magnétisation, l’excentricité (statique, dynamique et mixte) et le court-circuit inter-tour. L’analyse de la signature du courant de la machine est souvent utilisée pour rechercher des problèmes du générateur, car ces altérations introduisent des harmoniques supplémen- taires dans les courants générés. La Transformée de Fourier Rapide (TFR) est utilisée pour calculer le spectre des courants. Cependant, la TFR permet de calculer l’ensemble du spec- tre, tandis que le nombre de défauts possible et le nombre d’harmoniques introduites sont faibles. L’algorithme de Goertzel, mis en oeuvre sous forme de filtre (le filtre de Goertzel), est présenté comme une alternative plus efficace au TFR. Le spectre des courants change avec la vitesse du vent, ce qui rend la détection plus difficile. Le Filtre de Kalman Étendu (FKÉ) est proposé comme solution. Le spectre de résidus, calcule entre les courants estimés et les courants générés, est constant, quelle que soit la vitesse du vent. Cependant, l’effet des défauts est visible dans leur spectre. Lors de l’utilisation de l’FKÉ, un défi consiste à estime la matrice de covariance pour le bruit du processus. Une nouvelle méthode était développée pour ça, qui n’utilise aucune de maîtrise du filtre. Les ÉED sont placés soit dans des zones éloignées, soit dans des villes. Pour la surveillance des ÉED, des dizaines ou des centaines de kilomètres de câbles sont nécessaires. Les Réseaux de Capteurs sans Fil (RCF) sont bien adaptés pour être utilisés dans l’infrastructure de communication des ÉED. RCF ont des coûts initiaux et d’entretien plus faibles et leurs installations sont rapides. De plus, ils peuvent compléter les réseaux câblés. Différentes technologies sans fil sont comparées : les technologies à grande surface, ainsi que les technologies à courte portée qui supportent des débits de données élevés. / Direct Drive Wind Turbines (DDWTs) are equipped with Permanent Magnet Syn- chronous Generators (PMSGs). Their three most common failures are demagnetization, ec- centricity (static, dynamic and mixed) and inter-turn short circuit. Machine Current Signa- ture Analysis is often used to look for generator problems, as these impairments introduce additional harmonics into the generated currents. The Fast Fourier Transform (FFT) is utilized to compute the spectrum of the currents. However, the FFT calculates the whole spectrum, while the number of possible faults and the number of introduced harmonics is low. The Goertzel algorithm, implemented as a filter (the Goertzel filter), is presented as a more efficient alternative to the FFT. The spectrum of the currents changes with the wind speed, and thus the detection is made more difficult. The Extended Kalman Filter (EKF) is proposed as a solution. The spectrum of the residuals, computed between the estimated and the generated current, is constant, regardless of the wind speed. However, the effect of the faults is visible in the spectrum. When using the EKF, one challenge is to find out the covariance matrix of the process noise. A new method was developed in this regard, which does not use any of the matrices of the filter. DDWTs are either placed in remote areas or in cities. For the monitoring of a DDWT, tens or hundreds of kilometers of cables are necessary. Wireless Sensor Networks (WSNs) are suited to be used in the communication infrastructure of DDWTs. WSNs have lower initial and maintenance costs, and they are quickly installed. Moreover, they can complement wired networks. Different wireless technologies are com- pared - both wide area ones, as well as short range technologies which support high data rates. Eoliennes à entraînement direct Filtre de Kalman étendu Transformée de Fourier Filtre de Goertzel Réseaux de capteurs sans fil Direct drive wind turbines Permanent magnet synchronous generator Extended Kalman filter Process covariance estimation Fast Fourier transform Goertzel filter Wireless sensor networks 629.8
494	Surface-atmosphere energy exchanges and their effects on surface climate and boundary layer dynamics in the forest-tundra ecotone in northwestern Canada Graveline, Vincent 04 1900 (has links) La région boréale arctique (RBA) couvre une vaste étendue qui lui confère un rôle important dans le système climatique mondial, par ses échanges d'énergie et de matière avec l'atmosphère. La température de l'air dans la région boréale arctique a augmenté à des taux disproportionnés par rapport à la moyenne mondiale, entraînant des changements dans la composition et la structure de la végétation. La RBA comprend l'écotone de la forêt boréale et de la toundra (EFT), qui s'étend sur plus de 10,000 km à travers l'hémisphère nord. La structure et la composition de la végétation varient considérablement à travers l’EFT. Du sud au nord, les arbres deviennent plus courts, plus dispersés et finalement absents. Ce gradient entraîne des variations dans la balance énergétique de surface. Ainsi, des changements dans la composition et la structure de la végétation dans l’EFT pourraient influencer le climat régional futur de ces régions. Ces changements régionaux pourraient se répercuter sur le climat mondial en interagissant avec le cycle du carbone par des changements dans les régimes de perturbations et la profondeur de la couche limite atmosphérique. L'objectif de cette étude était de développer un état des lieux de la variation latitudinale des interactions entre la surface et l’atmosphère et du climat régional à travers l’EFT dans le nord-ouest du Canada. Nous avons utilisé des mesures de covariance des turbulences provenant d’une forêt subarctique en marge de l’EFT et d’une toundra minérale caractérisant l’EFT du nord-ouest du Canada afin de quantifier les différences journalière et saisonnières des échanges d'énergie. Quatre paramètres de surface (albédo, conductance aérodynamique, conductance de surface et facteur de découplage) ont été examinés dans le but d’expliquer les différences dans la balance énergétique de surface. Des observations par radiosonde basées sur des campagnes de terrain et une expérience de modélisation de la couche limite atmosphérique ont été réalisées afin de discuter des conséquences potentielles des changements de végétation sur la dynamique de la couche limite atmosphérique (hauteur, température, humidité) et ses implications pour le climat régional. La forêt subarctique a démontré une meilleure capacité à transférer la chaleur vers l’atmosphère et une plus grande résistance à l'évapotranspiration, se traduisant par des conditions atmosphériques plus chaudes et sèches, spécialement au printemps. En été et automne, une conductance de surface plus élevée sur le site de la toundra s’est traduite par à une plus grande proportion de l'énergie utilisée pour humidifier l'atmosphère, résultant en une couche atmosphérique moins épaisse et un refroidissement régional du climat. La caractérisation des interactions entre la surface et l’atmosphère à travers l’EFT contribuera à améliorer les prédictions des effets des changements de végétation en cours sur le climat régional dans la région boréale arctique. / Considering its vast extent, the Arctic-boreal region (ABR) plays an important role in the global climate system through its exchange of energy and matter with the atmosphere. Air temperature across the ABR has been increasing at a higher rate compared to the global average and has led to changes in vegetation composition and structure across the ABR. The ABR includes the forest-tundra ecotone (FTE), spanning more than 10,000 km across the northern hemisphere. As the world’s longest transition zone, the FTE separates the boreal and Arctic biomes over a width of only a few tens to hundreds of kilometers. Vegetation composition and structure varies considerably across the FTE as trees become, from south to north, shorter and more stunted, sparser, and eventually, absent. The associated latitudinal gradient in surface properties results in corresponding latitudinal variations in the energy balance. Thus, changes in the latitudinal variation in surface properties and energy exchanges within the atmospheric boundary layer (ABL) may affect future regional climate across the FTE. The goal of this study was to develop a baseline understanding of the latitudinal variation in surface-atmosphere interactions and atmospheric boundary layer dynamics across the FTE in northwestern Canada. We used paired eddy covariance measurements of surface energy fluxes and supporting environmental measurements at a subarctic woodland (‘woodland’) and a mineral upland tundra site (‘tundra’) to quantify differences in daily and seasonal differences in woodland and tundra properties and energy exchanges. Four bulk surface parameters (albedo, aerodynamic conductance, surface conductance, and decoupling factor) were examined to explain drivers of those differences. Campaign-based radiosonde observations and numerical experiments using an ABL model were used to examine the impacts of a sparse tree cover on ABL dynamics (height, temperature, humidity) and their implications for surface climate compared to treeless tundra. The sparse tree cover at the woodland site showed an enhanced ability to transfer heat into the atmosphere and a higher resistance to evapotranspiration compared to tundra, leading to warmer and drier conditions especially in late winter and spring. In summer and fall, higher bulk surface conductance at the tundra site led to more energy being used to moisten the atmosphere, resulting in a shallower ABL and regional cooling of the atmosphere. Refined characterization of land surface-atmosphere interactions across the FTE will help to project the effect of ongoing vegetation changes on regional climate in the circumpolar Arctic-boreal region. Surface energy balance atmospheric boundary layer surface properties forest-tundra ecotone eddy covariance mixed-layer slab model vegetation shifts Balance énergétique de surface Couche limite atmosphérique Propriétés de surface Écotone forêt-toundra Covariance des turbulences
495	The identification and application of common principal components Pepler, Pieter Theo 12 1900 (has links) Thesis (PhD)--Stellenbosch University, 2014. / ENGLISH ABSTRACT: When estimating the covariance matrices of two or more populations, the covariance matrices are often assumed to be either equal or completely unrelated. The common principal components (CPC) model provides an alternative which is situated between these two extreme assumptions: The assumption is made that the population covariance matrices share the same set of eigenvectors, but have di erent sets of eigenvalues. An important question in the application of the CPC model is to determine whether it is appropriate for the data under consideration. Flury (1988) proposed two methods, based on likelihood estimation, to address this question. However, the assumption of multivariate normality is untenable for many real data sets, making the application of these parametric methods questionable. A number of non-parametric methods, based on bootstrap replications of eigenvectors, is proposed to select an appropriate common eigenvector model for two population covariance matrices. Using simulation experiments, it is shown that the proposed selection methods outperform the existing parametric selection methods. If appropriate, the CPC model can provide covariance matrix estimators that are less biased than when assuming equality of the covariance matrices, and of which the elements have smaller standard errors than the elements of the ordinary unbiased covariance matrix estimators. A regularised covariance matrix estimator under the CPC model is proposed, and Monte Carlo simulation results show that it provides more accurate estimates of the population covariance matrices than the competing covariance matrix estimators. Covariance matrix estimation forms an integral part of many multivariate statistical methods. Applications of the CPC model in discriminant analysis, biplots and regression analysis are investigated. It is shown that, in cases where the CPC model is appropriate, CPC discriminant analysis provides signi cantly smaller misclassi cation error rates than both ordinary quadratic discriminant analysis and linear discriminant analysis. A framework for the comparison of di erent types of biplots for data with distinct groups is developed, and CPC biplots constructed from common eigenvectors are compared to other types of principal component biplots using this framework. A subset of data from the Vermont Oxford Network (VON), of infants admitted to participating neonatal intensive care units in South Africa and Namibia during 2009, is analysed using the CPC model. It is shown that the proposed non-parametric methodology o ers an improvement over the known parametric methods in the analysis of this data set which originated from a non-normally distributed multivariate population. CPC regression is compared to principal component regression and partial least squares regression in the tting of models to predict neonatal mortality and length of stay for infants in the VON data set. The tted regression models, using readily available day-of-admission data, can be used by medical sta and hospital administrators to counsel parents and improve the allocation of medical care resources. Predicted values from these models can also be used in benchmarking exercises to assess the performance of neonatal intensive care units in the Southern African context, as part of larger quality improvement programmes. / AFRIKAANSE OPSOMMING: Wanneer die kovariansiematrikse van twee of meer populasies beraam word, word dikwels aanvaar dat die kovariansiematrikse of gelyk, of heeltemal onverwant is. Die gemeenskaplike hoofkomponente (GHK) model verskaf 'n alternatief wat tussen hierdie twee ekstreme aannames gele e is: Die aanname word gemaak dat die populasie kovariansiematrikse dieselfde versameling eievektore deel, maar verskillende versamelings eiewaardes het. 'n Belangrike vraag in die toepassing van die GHK model is om te bepaal of dit geskik is vir die data wat beskou word. Flury (1988) het twee metodes, gebaseer op aanneemlikheidsberaming, voorgestel om hierdie vraag aan te spreek. Die aanname van meerveranderlike normaliteit is egter ongeldig vir baie werklike datastelle, wat die toepassing van hierdie metodes bevraagteken. 'n Aantal nie-parametriese metodes, gebaseer op skoenlus-herhalings van eievektore, word voorgestel om 'n geskikte gemeenskaplike eievektor model te kies vir twee populasie kovariansiematrikse. Met die gebruik van simulasie eksperimente word aangetoon dat die voorgestelde seleksiemetodes beter vaar as die bestaande parametriese seleksiemetodes. Indien toepaslik, kan die GHK model kovariansiematriks beramers verskaf wat minder sydig is as wanneer aanvaar word dat die kovariansiematrikse gelyk is, en waarvan die elemente kleiner standaardfoute het as die elemente van die gewone onsydige kovariansiematriks beramers. 'n Geregulariseerde kovariansiematriks beramer onder die GHK model word voorgestel, en Monte Carlo simulasie resultate toon dat dit meer akkurate beramings van die populasie kovariansiematrikse verskaf as ander mededingende kovariansiematriks beramers. Kovariansiematriks beraming vorm 'n integrale deel van baie meerveranderlike statistiese metodes. Toepassings van die GHK model in diskriminantanalise, bi-stippings en regressie-analise word ondersoek. Daar word aangetoon dat, in gevalle waar die GHK model toepaslik is, GHK diskriminantanalise betekenisvol kleiner misklassi kasie foutkoerse lewer as beide gewone kwadratiese diskriminantanalise en line^ere diskriminantanalise. 'n Raamwerk vir die vergelyking van verskillende tipes bi-stippings vir data met verskeie groepe word ontwikkel, en word gebruik om GHK bi-stippings gekonstrueer vanaf gemeenskaplike eievektore met ander tipe hoofkomponent bi-stippings te vergelyk. 'n Deelversameling van data vanaf die Vermont Oxford Network (VON), van babas opgeneem in deelnemende neonatale intensiewe sorg eenhede in Suid-Afrika en Namibi e gedurende 2009, word met behulp van die GHK model ontleed. Daar word getoon dat die voorgestelde nie-parametriese metodiek 'n verbetering op die bekende parametriese metodes bied in die ontleding van hierdie datastel wat afkomstig is uit 'n nie-normaal verdeelde meerveranderlike populasie. GHK regressie word vergelyk met hoofkomponent regressie en parsi ele kleinste kwadrate regressie in die passing van modelle om neonatale mortaliteit en lengte van verblyf te voorspel vir babas in die VON datastel. Die gepasde regressiemodelle, wat maklik bekombare dag-van-toelating data gebruik, kan deur mediese personeel en hospitaaladministrateurs gebruik word om ouers te adviseer en die toewysing van mediese sorg hulpbronne te verbeter. Voorspelde waardes vanaf hierdie modelle kan ook gebruik word in normwaarde oefeninge om die prestasie van neonatale intensiewe sorg eenhede in die Suider-Afrikaanse konteks, as deel van groter gehalteverbeteringprogramme, te evalueer. UCTD Analysis of covariance Discriminant analysis Monte Carlo method Multivariate analysis Principal components analysis
496	Liberté infinitésimale et modèles matriciels déformés Fevrier, Maxime 03 December 2010 (has links) (PDF) Le travail effectué dans cette thèse concerne les domaines de la théorie des matrices aléatoires et des probabilités libres, dont on connaît les riches connexions depuis le début des années 90. Les résultats s'organisent principalement en deux parties : la première porte sur la liberté infinitésimale, la seconde sur les matrices aléatoires déformées. Plus précisément, on jette les bases d'une théorie combinatoire de la liberté infinitésimale, au premier ordre d'abord, telle que récemment introduite par Belinschi et Shlyakhtenko, puis aux ordres supérieurs. On en donne un cadre simple et général, et on introduit des fonctionnelles de cumulants non-croisés, caractérisant la liberté infinitésimale. L'accent est mis sur la combinatoire et les idées d'essence différentielle qui sous-tendent cette notion. La seconde partie poursuit l'étude des déformations de modèles matriciels, qui a été ces dernières années un champ de recherche très actif. Les résultats présentés sont originaux en ce qu'ils concernent des perturbations déterministes Hermitiennes de rang non nécessairement fini de matrices de Wigner et de Wishart. En outre, un apport de ce travail est la mise en lumière du lien entre la convergence des valeurs propres de ces modèles et les probabilités libres, plus particulièrement le phénomène de subordination pour la convolution libre. Ce lien donne une illustration de la puissance des idées des probabilités libres dans les problèmes de matrices aléatoires. [MATH] Mathematics Probabilités Matrices aléatoires Probabilités libres Probabilités libres de type B Liberté infinitésimale Cumulants non-croisés infinitésimaux Système dual de dérivation Subordination Modèle matriciel déformé Plus grande valeur propre Valeur propre extrêmale Matrice de Wigner Matrice de covariance empirique
497	混合連續與間斷資料之馬式距離的穩健估計 / Robust estimation of the Mahalanobis distance for multivariate data mixed with continuous and discrete variables 任嘉珩, Jen , Chia Heng Unknown Date (has links) 本研究採用Lee 和Poon 所提出的隱藏常態變數模型來估計混合連續與間斷型變數之參數估計，並估計其馬式距離。此外，並利用穩健估計來估計混合型資料參數及其馬式距離，可在有離群值時解決最大蓋似估計的不穩定。 / Poon and Lee (1987) applied normal latent variable model to deal with the parameters estimation for the data mixed with continuous and discrete variables and Bedrick et al. (2000) used this idea to evaluate the Mahalanobis distance. In this thesis, we extend a similar idea to robustly estimate Multivariate Data Mixed with Continuous and Discrete Variables with the same model. Furthermore, we evaluate the Mahalanobis distance which can determine similarity of variables. The proposed method can overcome the unreliability of MLE while there exist outliers in the data. 混合型資料隱藏常態變數模型穩健估計馬式距離 mixed data normal latnet variable model robust estimation Mahalanobis distacne minimum covariance determinant
498	應用共變異矩陣描述子及半監督式學習於行人偵測 / Semi-supervised learning for pedestrian detection with covariance matrix feature 黃靈威, Huang, Ling Wei Unknown Date (has links) 行人偵測為物件偵測領域中一個極具挑戰性的議題。其主要問題在於人體姿勢以及衣著服飾的多變性，加之以光源照射狀況迥異，大幅增加了辨識的困難度。吾人在本論文中提出利用共變異矩陣描述子及結合單純貝氏分類器與級聯支持向量機的線上學習辨識器，以增進行人辨識之正確率與重現率。實驗結果顯示，本論文所提出之線上學習策略在某些辨識狀況較差之資料集中能有效提升正確率與重現率達百分之十四。此外，即便於相同之初始訓練條件下，在USC Pedestrian Detection Test Set、 INRIA Person dataset 及 Penn-Fudan Database for Pedestrian Detection and Segmentation三個資料集中，本研究之正確率與重現率亦較HOG搭配AdaBoost之行人辨識方式為優。 / Pedestrian detection is an important yet challenging problem in object classification due to flexible body pose, loose clothing and ever-changing illumination. In this thesis, we employ covariance feature and propose an on-line learning classifier which combines naïve Bayes classifier and cascade support vector machine (SVM) to improve the precision and recall rate of pedestrian detection in a still image. Experimental results show that our on-line learning strategy can improve precision and recall rate about 14% in some difficult situations. Furthermore, even under the same initial training condition, our method outperforms HOG + AdaBoost in USC Pedestrian Detection Test Set, INRIA Person dataset and Penn-Fudan Database for Pedestrian Detection and Segmentation. 半監督式學習支持向量機單純貝氏分類器共變異描述子 Semi-supervised learning Support vector machine Naïve Bayes classifier Covariance descriptor
499	Studium a srovnávání hlavních typů evolučních algoritmů / Study and comparison of main kinds of evolutionary algorithms Štefan, Martin January 2012 (has links) Evolutionary algorithms belongs among the youngest and the most progressive methods of solving difficult optimization tasks. They received huge popularity mainly due to good experimental results in optimization, a simplicity of the implementation and a high modularity, which is an ability to be modified for different problems. Among the most frequently used Evolutionary algorithms belongs Genetic Algorithm, Differential Evolution and Evolutionary Strategy. It is able to apply these algorithms and theirs variants to both continuous, discrete and mixed optimization tasks. A subject of this theses is to compare three main types of algorithms on the catalyst optimization task with mixed variables, linear constraints and experimentally evaluated fitness function.
500	Contribuições à análise de outliers em modelos de equações estruturais / Contributions to the analysis of outliers in structural equation models Bulhões, Rodrigo de Souza 10 May 2013 (has links) O Modelo de Equações Estruturais (MEE) é habitualmente ajustado para realizar uma análise confirmatória sobre as conjecturas de um pesquisador acerca do relacionamento entre as variáveis observadas e latentes de algum estudo. Na prática, a maneira mais recorrente de avaliar a qualidade das estimativas de um MEE é a partir de medidas que buscam mensurar o quanto a usual matriz de covariâncias clássicas ou ordinárias se distancia da matriz de covariâncias do modelo ajustado, ou a magnitude do afastamento entre as funções de discrepância do modelo hipotético e do modelo saturado. Entretanto, elas podem não captar problemas no ajuste quando há muitos parâmetros a estimar ou bastantes observações. A fim de detectar irregularidades no ajustamento resultantes do impacto provocado pela presença de outliers no conjunto de dados, este trabalho contemplou alguns indicadores conhecidos na literatura, como também considerou alterações no Índice da Qualidade do Ajuste (ou GFI, de Goodness-of-Fit Index) e no Índice Corrigido da Qualidade do Ajuste (ou AGFI, de Ajusted Goodness-of-Fit Index), ambos nas expressões para estimação de parâmetros pelo método de Máxima Verossimilhança, que consistiram em substituir a tradicional matriz de covariâncias pelas matrizes de covariâncias computadas com os seguintes estimadores: Elipsoide de Volume Mínimo, Covariância de Determinante Mínimo, S, MM e Gnanadesikan-Kettenring Ortogonalizado (GKO). Através de estudos de simulação sobre perturbações de desvio de simetria e excesso de curtose, em baixa e alta frações de contaminação, em diferentes tamanhos de amostra e quantidades de variáveis observadas afetadas, foi possível constatar que as propostas de modificação do GFI e do AGFI adaptadas pelo estimador GKO foram as únicas que conseguiram ser informativas em todas essas situações, devendo-se escolher a primeira ou a segunda respectivamente quando a quantidade de parâmetros a serem estimados é baixa ou elevada. / The Structural Equation Model (SEM) is usually set to perform a confirmatory analysis on the assumptions of a researcher about the relationship between the observed variables and the latent variables of such a study. In practice, the most iterant way of evaluating the quality of the estimates of a SEM comes either from procedures of measuring how distant the usual classic or ordinary covariance matrix is from the covariance matrix of the adjusted model, or from the magnitude of the hiatus in discrepancy functions of both the hypothetical model and the saturated model. Nevertheless, they may fail to capture problems in the adjustment in the occurrence of either several parameters to estimate or several observations. This study included indicators known in the literature in order to detect irregularities in the adjustment resulting from the impact caused by the presence of outliers in the data set. This study has also considered changes in both the Goodness-of-Fit Index (GFI) and the Adjusted Goodness-of-Fit Index (AGFI) in the expressions for parameter estimation by Maximum Likelihood method, which consisted in replacing the traditional covariance matrix by the robust covariance matrices computed through the following estimators: Minimum Volume Ellipsoid, Minimum Covariance Determinant, S, MM and Orthogonalized Gnanadesikan-Kettenring (OGK). Through simulation studies on disturbances of both symmetry deviations and excess kurtosis in both low and high fractions of contamination in different sample sizes and quantities of affected observed variables it has become clear that the proposals of modification of both the GFI and the AGFI adapted by the OGK estimator were the only ones able to be informative in all these situations. It must be considered that GFI or AGFI must be used when the number of parameters to be estimated is either low or high, respectively. Adjusted Goodness-of-Fit Index AGFI AGFI GFI GFI Goodness-of-Fit Index Índice Corrigido da Qualidade do Ajuste Índice da Qualidade do Ajuste Matrizes de Covariâncias Robustas Modelo de Equações Estruturais Outliers Outliers Robust Covariance Matrices Structural Equation Model

Search results