Global ETD Search

141	Contributions à l'identification de modèles à temps continu à partir de données échantillonnées à pas variable / Contributions to the identification of continuous-time models from irregulalrly sampled data Chen, Fengwei 21 November 2014 (has links) Cette thèse traite de l’identification de systèmes dynamiques à partir de données échantillonnées à pas variable. Ce type de données est souvent rencontré dans les domaines biomédical, environnemental, dans le cas des systèmes mécaniques où un échantillonnage angulaire est réalisé ou lorsque les données transitent sur un réseau. L’identification directe de modèles à temps continu est l’approche à privilégier lorsque les données disponibles sont échantillonnées à pas variable ; les paramètres des modèles à temps discret étant dépendants de la période d’échantillonnage. Dans une première partie, un estimateur optimal de type variable instrumentale est développé pour estimer les paramètres d’un modèle Box-Jenkins à temps continu. Ce dernier est itératif et présente l’avantage de fournir des estimées non biaisées lorsque le bruit de mesure est coloré et sa convergence est peu sensible au choix du vecteur de paramètres initial. Une difficulté majeure dans le cas où les données sont échantillonnées à pas variable concerne l’estimation de modèles de bruit de type AR et ARMA à temps continu (CAR et CARMA). Plusieurs estimateurs pour les modèles CAR et CARMA s’appuyant sur l’algorithme Espérance-Maximisation (EM) sont développés puis inclus dans l’estimateur complet de variable instrumentale optimale. Une version étendue au cas de l’identification en boucle fermée est également développée. Dans la deuxième partie de la thèse, un estimateur robuste pour l'identification de systèmes à retard est proposé. Cette classe de systèmes est très largement rencontrée en pratique et les méthodes disponibles ne peuvent pas traiter le cas de données échantillonnées à pas variable. Le retard n’est pas contraint à être un multiple de la période d’échantillonnage, contrairement à l’hypothèse traditionnelle dans le cas de modèles à temps discret. L’estimateur développé est de type bootstrap et combine la méthode de variable instrumentale itérative pour les paramètres de la fonction de transfert avec un algorithme numérique de type gradient pour estimer le retard. Un filtrage de type passe-bas est introduit pour élargir la région de convergence pour l’estimation du retard. Tous les estimateurs proposés sont inclus dans la boîte à outils logicielle CONTSID pour Matlab et sont évalués à l’aide de simulation de Monte-Carlo / The output of a system is always corrupted by additive noise, therefore it is more practical to develop estimation algorithms that are capable of handling noisy data. The effect of white additive noise has been widely studied, while a colored additive noise attracts less attention, especially for a continuous-time (CT) noise. Sampling issues of CT stochastic processes are reviewed in this thesis, several sampling schemes are presented. Estimation of a CT stochastic process is studied. An expectation-maximization-based (EM) method to CT autoregressive/autoregressive moving average model is developed, which gives accurate estimation over a large range of sampling interval. Estimation of CT Box-Jenkins models is also considered in this thesis, in which the noise part is modeled to improve the performance of plant model estimation. The proposed method for CT Box-Jenkins model identification is in a two-step and iterative framework. Two-step means the plant and noise models are estimated in a separate and alternate way, where in estimating each of them, the other is assumed to be fixed. More specifically, the plant is estimated by refined instrumental variable (RIV) method while the noise is estimated by EM algorithm. Iterative means that the proposed method repeats the estimation procedure several times until a optimal estimate is found. Many practical systems have inherent time-delay. The problem of identifying delayed systems are of great importance for analysis, prediction or control design. The presence of a unknown time-delay greatly complicates the parameter estimation problem, essentially because the model are not linear with respect to the time-delay. An approach to continuous-time model identification of time-delay systems, combining a numerical search algorithm for the delay with the RIV method for the dynamic has been developed in this thesis. In the proposed algorithm, the system parameters and time-delay are estimated reciprocally in a bootstrap manner. The time-delay is estimated by an adaptive gradient-based method, whereas the system parameters are estimated by the RIV method. Since numerical method is used in this algorithm, the bootstrap method is likely to converge to local optima, therefore a low-pass filter has been used to enlarge the convergence region for the time-delay. The performance of the proposed algorithms are evaluated by numerical examples Modèles à temps continu Estimateur de la variable instrumentale Modèle de Box-Jenkins Processus stochastique à temps continu Algorithme espérance-maximisation Systèmes à retard Continuous-time models Irregularly sampled data Instrumental variable Box-Jenkins models Continuous-time stochastic process Expectation-maximization algorithm Time-delay 658.500 285 003
142	Expressing emotions through vibration for perception and control / Expressing emotions through vibration ur Réhman, Shafiq January 2010 (has links) This thesis addresses a challenging problem: “how to let the visually impaired ‘see’ others emotions”. We, human beings, are heavily dependent on facial expressions to express ourselves. A smile shows that the person you are talking to is pleased, amused, relieved etc. People use emotional information from facial expressions to switch between conversation topics and to determine attitudes of individuals. Missing emotional information from facial expressions and head gestures makes the visually impaired extremely difficult to interact with others in social events. To enhance the visually impaired’s social interactive ability, in this thesis we have been working on the scientific topic of ‘expressing human emotions through vibrotactile patterns’. It is quite challenging to deliver human emotions through touch since our touch channel is very limited. We first investigated how to render emotions through a vibrator. We developed a real time “lipless” tracking system to extract dynamic emotions from the mouth and employed mobile phones as a platform for the visually impaired to perceive primary emotion types. Later on, we extended the system to render more general dynamic media signals: for example, render live football games through vibration in the mobile for improving mobile user communication and entertainment experience. To display more natural emotions (i.e. emotion type plus emotion intensity), we developed the technology to enable the visually impaired to directly interpret human emotions. This was achieved by use of machine vision techniques and vibrotactile display. The display is comprised of a ‘vibration actuators matrix’ mounted on the back of a chair and the actuators are sequentially activated to provide dynamic emotional information. The research focus has been on finding a global, analytical, and semantic representation for facial expressions to replace state of the art facial action coding systems (FACS) approach. We proposed to use the manifold of facial expressions to characterize dynamic emotions. The basic emotional expressions with increasing intensity become curves on the manifold extended from the center. The blends of emotions lie between those curves, which could be defined analytically by the positions of the main curves. The manifold is the “Braille Code” of emotions. The developed methodology and technology has been extended for building assistive wheelchair systems to aid a specific group of disabled people, cerebral palsy or stroke patients (i.e. lacking fine motor control skills), who don’t have ability to access and control the wheelchair with conventional means, such as joystick or chin stick. The solution is to extract the manifold of the head or the tongue gestures for controlling the wheelchair. The manifold is rendered by a 2D vibration array to provide user of the wheelchair with action information from gestures and system status information, which is very important in enhancing usability of such an assistive system. Current research work not only provides a foundation stone for vibrotactile rendering system based on object localization but also a concrete step to a new dimension of human-machine interaction. / Taktil Video Multimodal Signal Processing Mobile Communication Vibrotactile Rendering Locally Linear Embedding Object Detection Human Facial Expression Analysis Lip Tracking Object Tracking HCI Expectation-Maximization Algorithm Lipless Tracking Image Analysis Visually Impaired. Signal processing Signalbehandling Image analysis Bildanalys Computer science Datavetenskap Telecommunication Telekommunikation Systems engineering Systemteknik
143	確定提撥制退休金之評價:馬可夫調控跳躍過程模型下股價指數之實證 / Valuation of a defined contribution pension plan: evidence from stock indices under Markov-Modulated jump diffusion model 張玉華, Chang, Yu Hua Unknown Date (has links) 退休金是退休人未來生活的依靠，確保在退休後能得到適足的退休給付，政府在退休金上實施保證收益制度，此制度為最低保證利率與投資報酬率連結。本文探討退休金給付標準為確定提撥制，當退休金的投資報酬率是根據其連結之股價指數的表現來計算時，股價指數報酬率的模型假設為馬可夫調控跳躍過程模型，考慮市場狀態與布朗運動項、跳躍項的跳躍頻率相關，即為Elliot et al. (2007) 的模型特例。使用1999年至2012年的道瓊工業指數與S&P 500指數的股價指數對數報酬率作為研究資料，採用EM演算法估計參數及SEM演算法估計參數共變異數矩陣。透過概似比檢定說明馬可夫調控跳躍過程模型比狀態轉換模型、跳躍風險下狀態轉換模型更適合描述股價指數報酬率變動情形，也驗證馬可夫調控跳躍過程模型具有描述報酬率不對稱、高狹峰及波動叢聚的特性。最後，假設最低保證利率為固定下，利用Esscher轉換法計算不同模型下型I保證之確定提撥制退休金的評價公式，從公式中可看出受雇人提領的退休金價值可分為政府補助與個人帳戶擁有之退休金兩部分。以執行敏感度分析探討估計參數對於馬可夫調控跳躍過程模型評價公式的影響，而型II保證之確定提撥制退休金的價值則以蒙地卡羅法模擬並探討其敏感度分析結果。 / Pension plan make people a guarantee life in their retirement. In order to ensure the appropriate amount of pension plan, government guarantees associated with pension plan which ties minimum rate of return guarantees and underlying asset rate of return. In this paper, we discussed the pension plan with defined contribution (DC). When the return of asset is based on the stock indices, the return model was set on the assumption that markov-modulated jump diffusion model (MMJDM) could the Brownian motion term and jump rate be both related to market states. This model is the specific case of Elliot et al. (2007) offering. The sample observations is Dow-Jones industrial average and S&P 500 index from 1999 to 2012 by logarithm return of the stock indices. We estimated the parameters by the Expectation-Maximization (EM) algorithm and calculated the covariance matrix of the estimates by supplemented EM (SEM) algorithm. Through the likelihood ratio test (LRT), the data fitted the MMJDM better than other models. The empirical evidence indicated that the MMJDM could describe the asset return for asymmetric, leptokurtic, volatility clustering particularly. Finally, we derived different model's valuation formula for DC pension plan with type-I guarantee by Esscher transformation under rate of return guarantees is constant. From the formula, the value of the pension plan could divide into two segment: government supplement and employees deposit made pension to their personal bank account. And then, we done sensitivity analysis through the MMJDM valuation formula. We used Monte Carlo simulations to evaluate the valuation of DC pension plan with type-II guarantee and discussed it from sensitivity analysis. 確定提撥制退休金保證收益馬可夫調控跳躍過程模型 EM演算法 Esscher轉換法 defined contribution guarantee Markov-Modulated jump diffusion model expectation-maximization algorithm Esscher transformation
144	A class of bivariate Erlang distributions and ruin probabilities in multivariate risk models Groparu-Cojocaru, Ionica 11 1900 (has links) Nous y introduisons une nouvelle classe de distributions bivariées de type Marshall-Olkin, la distribution Erlang bivariée. La transformée de Laplace, les moments et les densités conditionnelles y sont obtenus. Les applications potentielles en assurance-vie et en finance sont prises en considération. Les estimateurs du maximum de vraisemblance des paramètres sont calculés par l'algorithme Espérance-Maximisation. Ensuite, notre projet de recherche est consacré à l'étude des processus de risque multivariés, qui peuvent être utiles dans l'étude des problèmes de la ruine des compagnies d'assurance avec des classes dépendantes. Nous appliquons les résultats de la théorie des processus de Markov déterministes par morceaux afin d'obtenir les martingales exponentielles, nécessaires pour établir des bornes supérieures calculables pour la probabilité de ruine, dont les expressions sont intraitables. / In this contribution, we introduce a new class of bivariate distributions of Marshall-Olkin type, called bivariate Erlang distributions. The Laplace transform, product moments and conditional densities are derived. Potential applications of bivariate Erlang distributions in life insurance and finance are considered. Further, our research project is devoted to the study of multivariate risk processes, which may be useful in analyzing ruin problems for insurance companies with a portfolio of dependent classes of business. We apply results from the theory of piecewise deterministic Markov processes in order to derive exponential martingales needed to establish computable upper bounds of the ruin probabilities, as their exact expressions are intractable. Distribution Erlang Algorithme Espérance-Maximisation Modèle de risque multivarié Probabilité de ruine Modèle de Poisson avec chocs communs Processus de renouvellement Copules Erlang distribution Expectation-Maximization algorithm Piecewise deterministic Markov processes Multivariate risk model Ruin probability Poisson model with common shocks Renewal processes Copulas
145	Estimation du taux d'erreurs binaires pour n'importe quel système de communication numérique DONG, Jia 18 December 2013 (has links) (PDF) This thesis is related to the Bit Error Rate (BER) estimation for any digital communication system. In many designs of communication systems, the BER is a Key Performance Indicator (KPI). The popular Monte-Carlo (MC) simulation technique is well suited to any system but at the expense of long time simulations when dealing with very low error rates. In this thesis, we propose to estimate the BER by using the Probability Density Function (PDF) estimation of the soft observations of the received bits. First, we have studied a non-parametric PDF estimation technique named the Kernel method. Simulation results in the context of several digital communication systems are proposed. Compared with the conventional MC method, the proposed Kernel-based estimator provides good precision even for high SNR with very limited number of data samples. Second, the Gaussian Mixture Model (GMM), which is a semi-parametric PDF estimation technique, is used to estimate the BER. Compared with the Kernel-based estimator, the GMM method provides better performance in the sense of minimum variance of the estimator. Finally, we have investigated the blind estimation of the BER, which is the estimation when the sent data are unknown. We denote this case as unsupervised BER estimation. The Stochastic Expectation-Maximization (SEM) algorithm combined with the Kernel or GMM PDF estimation methods has been used to solve this issue. By analyzing the simulation results, we show that the obtained BER estimate can be very close to the real values. This is quite promising since it could enable real-time BER estimation on the receiver side without decreasing the user bit rate with pilot symbols for example. BER estimation Monte-Carlo simulation Probability Density Function Soft observations Kernel method Gaussian Mixture Model
146	在序列相關因子模型下探討動態模型化投資組合信用風險 / Dynamic modeling portfolio credit risk under serially dependent factor model 游智惇, Yu, Chih Tun Unknown Date (has links) 獨立因子模型廣泛的應用在信用風險領域，此模型可用來估計經濟資本與投資組合的損失率分配。然而獨立因子模型假設因子獨立地服從同分配，因而可能會得到估計不精確的違約機率與資產相關係數。因此我們在本論文中提出序列相關因子模型來改進獨立因子模型的缺失，同時可以捕捉違約率的動態行為與授信戶間相關性。我們也分別從古典與貝氏的角度下估計序列相關因子模型。首先，我們在序列相關因子模型下利用貝氏的方法應用馬可夫鍊蒙地卡羅技巧估計違約機率與資產相關係數，使用標準普爾違約資料進行外樣本資料預測，能夠證明序列相關因子模型是比獨立因子模型合理。第二，蒙地卡羅期望最大法與蒙地卡羅最大概似法這兩種估計方法也使用在本篇論文。從模擬結果發現，若違約資料具有較大的序列相關與資產相關特性，蒙地卡羅最大概似法能夠配適的比蒙地卡羅期望最大法好。 / The independent factor model has been widely used in the credit risk field, and has been applied in estimating the economic capital allocations and loss rate distribution on a credit portfolio. However, this model assumes independent and identically distributed common factor which may produce inaccurate estimates of default probabilities and asset correlation. In this thesis, we address a serially dependent factor model (SDFM) to improve this phenomenon. This model can capture both dynamic behavior of default risk and dependence among individual obligors. We also address the estimation of the SDFM from both frequentist and Bayesian point of view. Firstly, we consider the Bayesian approach by applying Markov chain Monte Carlo (MCMC) techniques in estimating default probability and asset correlation under SDFM. The out-of-sample forecasting for S&P default data provide strong evidence to support that the SDFM is more reliable than the independent factor model. Secondly, we use two frequentist estimation methods to estimate the default probability and asset correlation under SDFM. One is Monte Carlo Expectation Maximization (MCEM) estimation method along with a Gibbs sampler and an acceptance method and the other is Monte Carlo maximum likelihood (MCML) estimation method with importance sampling techniques. 序列相關因子模型投資組合信用風險貝氏分析蒙地卡羅最大概似法蒙地卡羅期望最大法 Serially dependent factor model Portfolio credit risk Bayesian inference Monte Carlo Expectation Maximization Monte Carlo maximum likelihood
147	Probabilistic and Bayesian nonparametric approaches for recommender systems and networks / Approches probabilistes et bayésiennes non paramétriques pour les systemes de recommandation et les réseaux Todeschini, Adrien 10 November 2016 (has links) Nous proposons deux nouvelles approches pour les systèmes de recommandation et les réseaux. Dans la première partie, nous donnons d’abord un aperçu sur les systèmes de recommandation avant de nous concentrer sur les approches de rang faible pour la complétion de matrice. En nous appuyant sur une approche probabiliste, nous proposons de nouvelles fonctions de pénalité sur les valeurs singulières de la matrice de rang faible. En exploitant une représentation de modèle de mélange de cette pénalité, nous montrons qu’un ensemble de variables latentes convenablement choisi permet de développer un algorithme espérance-maximisation afin d’obtenir un maximum a posteriori de la matrice de rang faible complétée. L’algorithme résultant est un algorithme à seuillage doux itératif qui adapte de manière itérative les coefficients de réduction associés aux valeurs singulières. L’algorithme est simple à mettre en œuvre et peut s’adapter à de grandes matrices. Nous fournissons des comparaisons numériques entre notre approche et de récentes alternatives montrant l’intérêt de l’approche proposée pour la complétion de matrice à rang faible. Dans la deuxième partie, nous présentons d’abord quelques prérequis sur l’approche bayésienne non paramétrique et en particulier sur les mesures complètement aléatoires et leur extension multivariée, les mesures complètement aléatoires composées. Nous proposons ensuite un nouveau modèle statistique pour les réseaux creux qui se structurent en communautés avec chevauchement. Le modèle est basé sur la représentation du graphe comme un processus ponctuel échangeable, et généralise naturellement des modèles probabilistes existants à structure en blocs avec chevauchement au régime creux. Notre construction s’appuie sur des vecteurs de mesures complètement aléatoires, et possède des paramètres interprétables, chaque nœud étant associé un vecteur représentant son niveau d’affiliation à certaines communautés latentes. Nous développons des méthodes pour simuler cette classe de graphes aléatoires, ainsi que pour effectuer l’inférence a posteriori. Nous montrons que l’approche proposée peut récupérer une structure interprétable à partir de deux réseaux du monde réel et peut gérer des graphes avec des milliers de nœuds et des dizaines de milliers de connections. / We propose two novel approaches for recommender systems and networks. In the first part, we first give an overview of recommender systems and concentrate on the low-rank approaches for matrix completion. Building on a probabilistic approach, we propose novel penalty functions on the singular values of the low-rank matrix. By exploiting a mixture model representation of this penalty, we show that a suitably chosen set of latent variables enables to derive an expectation-maximization algorithm to obtain a maximum a posteriori estimate of the completed low-rank matrix. The resulting algorithm is an iterative soft-thresholded algorithm which iteratively adapts the shrinkage coefficients associated to the singular values. The algorithm is simple to implement and can scale to large matrices. We provide numerical comparisons between our approach and recent alternatives showing the interest of the proposed approach for low-rank matrix completion. In the second part, we first introduce some background on Bayesian nonparametrics and in particular on completely random measures (CRMs) and their multivariate extension, the compound CRMs. We then propose a novel statistical model for sparse networks with overlapping community structure. The model is based on representing the graph as an exchangeable point process, and naturally generalizes existing probabilistic models with overlapping block-structure to the sparse regime. Our construction builds on vectors of CRMs, and has interpretable parameters, each node being assigned a vector representing its level of affiliation to some latent communities. We develop methods for simulating this class of random graphs, as well as to perform posterior inference. We show that the proposed approach can recover interpretable structure from two real-world networks and can handle graphs with thousands of nodes and tens of thousands of edges. Systèmes de recommandation Filtrage collaboratif Complétion de matrice de rang faible Modèles probabilistes Espérance-maximisation Réseaux Parcimonie Comportement en loi de puissance Structure en communautés Mesures complètement aléatoires Monte Carlo par chaîne de Markov Graphes Recommender systems Collaborative filtering Low-rank matrix completion Probabilistic models Expectation maximization Networks Graphs Sparsity Power-law behavior Community structure Bayesian nonparametrics Completely random measures Markov chain Monte Carlo
148	Individualization of fixed-dose combination regimens : Methodology and application to pediatric tuberculosis / Individualisering av design och dosering av kombinationstabletter : Metodologi och applicering inom pediatrisk tuberkulos Yngman, Gunnar January 2015 (has links) Introduction: No Fixed-Dose Combination (FDC) formulations currently exist for pediatric tuberculosis (TB) treatment. Earlier work implemented, in the software NONMEM, a rational method for optimizing design and individualization of pediatric anti-TB FDC formulations based on patient body weight, but issues with parameter estimation, dosage strata heterogeneity and representative pharmacokinetics remained. Aim: To further develop the rational model-based methodology aiding the selection of appropriate FDC formulation designs and dosage regimens, in pediatric TB treatment. Materials and Methods: Optimization of the method with respect to the estimation of body weight breakpoints was sought. Heterogeneity of dosage groups with respect to treatment efficiency was sought to be improved. Recently published pediatric pharmacokinetic parameters were implemented and the model translated to MATLAB, where also the performance was evaluated by stochastic estimation and graphical visualization. Results: A logistic function was found better suited as an approximation of breakpoints. None of the estimation methods implemented in NONMEM were more suitable than the originally used FO method. Homogenization of dosage group treatment efficiency could not be solved. MATLAB translation was successful but required stochastic estimations and highlighted high densities of local minima. Representative pharmacokinetics were successfully implemented. Conclusions: NONMEM was found suboptimal for the task due to problems with discontinuities and heterogeneity, but a stepwise method with representative pharmacokinetics were successfully implemented. MATLAB showed more promise in the search for a method also addressing the heterogeneity issue. pharmacometrics pharmacokinetics PK population pharmacokinetics optimal design FDC fixed-dose combination pediatrics tuberculosis rifampicin pyrazinamide isoniazid ethambutol modeling simulation estimation allometric scaling stochastical simulation and estimation NONMEM MATLAB FO FOCE Expectation-Maximization algorithm importance sampling Nelder-Mead method logistic function discontinuity euclidean distance Sammon mapping global minimum local minima Pharmaceutical Sciences Farmaceutiska vetenskaper Bioinformatics (Computational Biology) Bioinformatik (beräkningsbiologi)
149	Classification de données multivariées multitypes basée sur des modèles de mélange : application à l'étude d'assemblages d'espèces en écologie / Model-based clustering for multivariate and mixed-mode data : application to multi-species spatial ecological data Georgescu, Vera 17 December 2010 (has links) En écologie des populations, les distributions spatiales d'espèces sont étudiées afin d'inférer l'existence de processus sous-jacents, tels que les interactions intra- et interspécifiques et les réponses des espèces à l'hétérogénéité de l'environnement. Nous proposons d'analyser les données spatiales multi-spécifiques sous l'angle des assemblages d'espèces, que nous considérons en termes d'abondances absolues et non de diversité des espèces. Les assemblages d'espèces sont une des signatures des interactions spatiales locales des espèces entre elles et avec leur environnement. L'étude des assemblages d'espèces peut permettre de détecter plusieurs types d'équilibres spatialisés et de les associer à l'effet de variables environnementales. Les assemblages d'espèces sont définis ici par classification non spatiale des observations multivariées d'abondances d'espèces. Les méthodes de classification basées sur les modèles de mélange ont été choisies afin d'avoir une mesure de l'incertitude de la classification et de modéliser un assemblage par une loi de probabilité multivariée. Dans ce cadre, nous proposons : 1. une méthode d'analyse exploratoire de données spatiales multivariées d'abondances d'espèces, qui permet de détecter des assemblages d'espèces par classification, de les cartographier et d'analyser leur structure spatiale. Des lois usuelles, telle que la Gaussienne multivariée, sont utilisées pour modéliser les assemblages, 2. un modèle hiérarchique pour les assemblages d'abondances lorsque les lois usuelles ne suffisent pas. Ce modèle peut facilement s'adapter à des données contenant des variables de types différents, qui sont fréquemment rencontrées en écologie, 3. une méthode de classification de données contenant des variables de types différents basée sur des mélanges de lois à structure hiérarchique (définies en 2.). Deux applications en écologie ont guidé et illustré ce travail : l'étude à petite échelle des assemblages de deux espèces de pucerons sur des feuilles de clémentinier et l'étude à large échelle des assemblages d'une plante hôte, le plantain lancéolé, et de son pathogène, l'oïdium, sur les îles Aland en Finlande / In population ecology, species spatial patterns are studied in order to infer the existence of underlying processes, such as interactions within and between species, and species response to environmental heterogeneity. We propose to analyze spatial multi-species data by defining species abundance assemblages. Species assemblages are one of the signatures of the local spatial interactions between species and with their environment. Species assemblages are defined here by a non spatial classification of the multivariate observations of species abundances. Model-based clustering procedures using mixture models were chosen in order to have an estimation of the classification uncertainty and to model an assemblage by a multivariate probability distribution. We propose : 1. An exploratory tool for the study of spatial multivariate observations of species abundances, which defines species assemblages by a model-based clustering procedure, and then maps and analyzes the spatial structure of the assemblages. Common distributions, such as the multivariate Gaussian, are used to model the assemblages. 2. A hierarchical model for abundance assemblages which cannot be modeled with common distributions. This model can be easily adapted to mixed mode data, which are frequent in ecology. 3. A clustering procedure for mixed-mode data based on mixtures of hierarchical models. Two ecological case-studies guided and illustrated this work: the small-scale study of the assemblages of two aphid species on leaves of Citrus trees, and the large-scale study of the assemblages of a host plant, Plantago lanceolata, and its pathogen, the powdery mildew, on the Aland islands in south-west Finland Assemblage d'espèces Coexistence Données mixtes Données multivariées spatiales Modèle gaussien latent Modèle hiérarchique Monte Carlo EM Species assemblages Finite mixture models Coexistence Mixed mode data Multivariate data Latent gaussian model Hierarchical model Model-based clustering Spatial data
150	Performance analysis of EM-MPM and K-means clustering in 3D ultrasound breast image segmentation Yang, Huanyi 05 1900 (has links) Indiana University-Purdue University Indianapolis (IUPUI) / Mammographic density is an important risk factor for breast cancer, detecting and screening at an early stage could help save lives. To analyze breast density distribution, a good segmentation algorithm is needed. In this thesis, we compared two popularly used segmentation algorithms, EM-MPM and K-means Clustering. We applied them on twenty cases of synthetic phantom ultrasound tomography (UST), and nine cases of clinical mammogram and UST images. From the synthetic phantom segmentation comparison we found that EM-MPM performs better than K-means Clustering on segmentation accuracy, because the segmentation result fits the ground truth data very well (with superior Tanimoto Coefficient and Parenchyma Percentage). The EM-MPM is able to use a Bayesian prior assumption, which takes advantage of the 3D structure and finds a better localized segmentation. EM-MPM performs significantly better for the highly dense tissue scattered within low density tissue and for volumes with low contrast between high and low density tissues. For the clinical mammogram, image segmentation comparison shows again that EM-MPM outperforms K-means Clustering since it identifies the dense tissue more clearly and accurately than K-means. The superior EM-MPM results shown in this study presents a promising future application to the density proportion and potential cancer risk evaluation. segmentation, EM-MPM, ultrasound Breast -- Ultrasonic imaging Image processing -- Digital techniques Bayesian statistical decision theory Tomography Digital images -- Noise Three-dimensional imaging in medicine Diagnostic imaging -- Data processing Cluster analysis -- Data processing Computer algorithms -- Research

Search results