Global ETD Search

11	Geochemical Characterization of Tea Leaves (Camellia sinensis) and Soils for Provenance Studies based on Compositional Data Analysis Pospiech, Solveig 17 September 2018 (has links) No description available. 910 550 tea Camellia sinensis compositional data soil-plant interaction plant analysis plant ionom provenance Darjeeling
12	Penalized regression models for compositional data / Métodos de regressão penalizados para dados composicionais Shimizu, Taciana Kisaki Oliveira 10 December 2018 (has links) Compositional data consist of known vectors such as compositions whose components are positive and defined in the interval (0,1) representing proportions or fractions of a whole, where the sum of these components must be equal to one. Compositional data is present in different areas, such as in geology, ecology, economy, medicine, among many others. Thus, there is great interest in new modeling approaches for compositional data, mainly when there is an influence of covariates in this type of data. In this context, the main objective of this thesis is to address the new approach of regression models applied in compositional data. The main idea consists of developing a marked method by penalized regression, in particular the Lasso (least absolute shrinkage and selection operator), elastic net and Spike-and-Slab Lasso (SSL) for the estimation of parameters of the models. In particular, we envision developing this modeling for compositional data, when the number of explanatory variables exceeds the number of observations in the presence of large databases, and when there are constraints on the dependent variables and covariates. / Dados composicionais consistem em vetores conhecidos como composições cujos componentes são positivos e definidos no intervalo (0,1) representando proporções ou frações de um todo, sendo que a soma desses componentes totalizam um. Tais dados estão presentes em diferentes áreas, como na geologia, ecologia, economia, medicina entre outras. Desta forma, há um grande interesse em ampliar os conhecimentos acerca da modelagem de dados composicionais, principalmente quando há a influência de covariáveis nesse tipo de dado. Nesse contexto, a presente tese tem por objetivo propor uma nova abordagem de modelos de regressão aplicada em dados composicionais. A ideia central consiste no desenvolvimento de um método balizado por regressão penalizada, em particular Lasso, do inglês least absolute shrinkage and selection operator, elastic net e Spike-e-Slab Lasso (SSL) para a estimação dos parâmetros do modelo. Em particular, visionamos o desenvolvimento dessa modelagem para dados composicionais, com o número de variáveis explicativas excedendo o número de observações e na presença de grandes bases de dados, e além disso, quando há restrição na variável resposta e nas covariáveis. Compositional data Coordenadas log-razão isométricas Dados composicionais Isometric log-ratio coordinates Modelo de regressão Regression model Seleção de variáveis Variable selection
13	Uma análise bayesiana para dados composicionais. Obage, Simone Cristina 03 March 2005 (has links) Made available in DSpace on 2016-06-02T20:05:59Z (GMT). No. of bitstreams: 1 DissSCO.pdf: 3276753 bytes, checksum: eea407b94c282f57d7fb7e97200ee05a (MD5) Previous issue date: 2005-03-03 / Universidade Federal de Sao Carlos / Compositional data are given by vectors of positive numbers with sum equals to one. These kinds of data are common in many applications, as in geology, biology, economy among many others. In this paper, we introduce a Bayesian analysis for compositional data considering additive log-ratio (ALR) and Box-Cox transformations assuming a mul- tivariate normal distribution for correlated errors. These results generalize some existing Bayesian approaches assuming uncorrelated errors. We also consider the use of expo- nential power distributions for uncorrelated errors considering additive log-ratio (ALR) transformation. We illustrate the proposed methodology considering a real data set. / Dados Composicionais são dados por vetores com elementos positivos cuja soma é um. Exemplos típicos de dados desta natureza são encontrados nas mais diversas áreas; como em geologia, biologia, economia entre outras. Neste trabalho, introduzimos uma análise Bayesiana para dados composicionais considerando as transformações razão log-aditiva e Box-Cox, assumindo a distribuição normal multivariada para erros correlacionados. Estes resultados generalizam uma abordagem bayesiana assumindo erros não correlacionados. Também consideramos o uso da distribuição potência exponencial para erros não correla- cionados, assumindo a transformação razão log-aditiva. Nós ilustramos a metodologia proposta considerando um conjunto de dados reais. Estatística - análise Inferência bayesiana Dados composicionais Compositional data Correlated errors Bayesian Inference MCMC
14	A Hierarchical Bayesian Model for the Unmixing Analysis of Compositional Data subject to Unit-sum Constraints Yu, Shiyong 15 May 2015 (has links) Modeling of compositional data is emerging as an active area in statistics. It is assumed that compositional data represent the convex linear mixing of definite numbers of independent sources usually referred to as end members. A generic problem in practice is to appropriately separate the end members and quantify their fractions from compositional data subject to nonnegative and unit-sum constraints. A number of methods essentially related to polytope expansion have been proposed. However, these deterministic methods have some potential problems. In this study, a hierarchical Bayesian model was formulated, and the algorithms were coded in MATLABÒ. A test run using both a synthetic and real-word dataset yields scientifically sound and mathematically optimal outputs broadly consistent with other non-Bayesian methods. Also, the sensitivity of this model to the choice of different priors and structure of the covariance matrix of error were discussed. Applied Statistics
15	A remote sensing and geospatial statistical approach to understanding distribution and evolution of ignimbrites in the Central Andes with a focus on Southern Peru Brandmeier, Melanie 24 February 2014 (has links) No description available. 910 550 Andes ignimbrites remote sensing ASTER alteration mapping compositional data analysis geospatial statistics Ar/Ar dating Peru
16	Transformações em dados composicionais para a aplicação da análise de componentes principais / Transformations in compositional data for application of principal components analysis Messias, Ricardo Matioli 29 April 2016 (has links) A análise de dados composicionais está sendo amplamente utilizada nas diversas áreas do conhecimento como por exemplo na análise de sedimentos rochosos, na comparação de diferentes células e até na análise criminalística na comparação de evidências de crimes. Durante a história da análise deste tipo de dados existiram muitos tipos de ajustes utilizados para contornar o problema da soma constante das variáveis e ainda hoje não temos um consenso de qual a melhor solução a ser utilizada. Neste trabalho, temos como objetivo a enunciação das 7 transformações que mais foram utilizadas ao longo do tempo e suas vantagens e desvantagens. A análise de componentes principais foi escolhida para o comparativo destas transformações. Fizemos a aplicação destas transformações em três bancos de dados reais com características diferentes entre si, comparamos os resultados e analisamos qual das transformações apresentou o melhor desempenho em cada base de dados. Os critérios de comparação foram o percentual da variância explicada, as variáveis que foram mais importantes para a primeira componente principal, cargas das variáveis nas componentes principais mais importantes assim como suas correlações com as variáveis. Também, simulamos quatro estruturas de bases de dados composicionais para avaliar o desempenho das transformações. Para essas comparações e simulações, foram desenvolvidas algumas funções, utilizando o \\textit estatístico R, que visam facilitar a comparação entre as sete transformações, assim auxiliando na escolha de qual das transformações melhor se adapta aos dados. Pelos resultados obtidos notamos que: nas bases de dados reais, os resultados das explicações da variância das transformações são similares e as transformações Ref e Alr mostram melhores desempenhos que as demais; nas quatro estruturas simuladas as transformações Ref e Alr também possuem os melhores resultados na explicação da variância e a interpretação de suas componentes principais são parecidas, assim como as transformações Trad, Log e Clr. Com isso notamos que independentemente da aplicação do logaritmo nas transformações Alr e Log elas apresentaram resultados muitos similares às transformações Ref e Trad, respectivamente, tanto na explicação da variância como na interpretação das componentes principais. / The compositional data analysis is being widely used in several areas of knowledge such as the analysis of rocky sediments, to compare different biological cells and even in forensic analysis to compare crimes evidences. During the history of the analysis of such data, to circumvent the problem of variable\'s constant sum were used many types of adjustments. Until now, we do not have a consensus in which is the best solution to be used in this cases. In this paper, we aim to enunciate seven transformations that most were used over time and their advantages and disadvantages. The principal component analysis was chosen for the comparison of these transformations. We applied this transformations in three real databases with different characteristics, we hope to compare the results and analyze which transformation have the best performance in each database. The comparison criteria were the percentage of explained variance, the variables that were most important to the first principal component,variable\'s loads in the most important principal components as well their correlation with the variables. We also simulated four compositional data bases structures to evaluate the performance of the transformations. For these comparisons and simulations were developed some functions, using the statistical software R, to facilitate comparison between the seven transformations, thus assisting in choosing which of the best transformation fits to the data. From the results we note that: for the real databases, the results of the variance explanation of all transformations are similar, thus Ref and Alr transformations show better performances than the others; in the four simulated structures the Ref and Alr transformations also have the best results in the variance explanation and interpretation of its main components are similar, as well as the transformations Trad, Log and Clr. Thus we note that independently of applying logarithm in and Log and Alr transformations they present very similar results as Ref and Trad transformations, respectively, both in variance explanation and in the interpretation of the principal components. Additive log-ratio Additive log-ratio Center log-ratio Center log-ratio Componentes principais Compositional data Dados composionais Log transformation Principal component analysis Spherical transformation Transformação esférica Transformação log
17	Transformações em dados composicionais para a aplicação da análise de componentes principais / Transformations in compositional data for application of principal components analysis Ricardo Matioli Messias 29 April 2016 (has links) A análise de dados composicionais está sendo amplamente utilizada nas diversas áreas do conhecimento como por exemplo na análise de sedimentos rochosos, na comparação de diferentes células e até na análise criminalística na comparação de evidências de crimes. Durante a história da análise deste tipo de dados existiram muitos tipos de ajustes utilizados para contornar o problema da soma constante das variáveis e ainda hoje não temos um consenso de qual a melhor solução a ser utilizada. Neste trabalho, temos como objetivo a enunciação das 7 transformações que mais foram utilizadas ao longo do tempo e suas vantagens e desvantagens. A análise de componentes principais foi escolhida para o comparativo destas transformações. Fizemos a aplicação destas transformações em três bancos de dados reais com características diferentes entre si, comparamos os resultados e analisamos qual das transformações apresentou o melhor desempenho em cada base de dados. Os critérios de comparação foram o percentual da variância explicada, as variáveis que foram mais importantes para a primeira componente principal, cargas das variáveis nas componentes principais mais importantes assim como suas correlações com as variáveis. Também, simulamos quatro estruturas de bases de dados composicionais para avaliar o desempenho das transformações. Para essas comparações e simulações, foram desenvolvidas algumas funções, utilizando o \\textit estatístico R, que visam facilitar a comparação entre as sete transformações, assim auxiliando na escolha de qual das transformações melhor se adapta aos dados. Pelos resultados obtidos notamos que: nas bases de dados reais, os resultados das explicações da variância das transformações são similares e as transformações Ref e Alr mostram melhores desempenhos que as demais; nas quatro estruturas simuladas as transformações Ref e Alr também possuem os melhores resultados na explicação da variância e a interpretação de suas componentes principais são parecidas, assim como as transformações Trad, Log e Clr. Com isso notamos que independentemente da aplicação do logaritmo nas transformações Alr e Log elas apresentaram resultados muitos similares às transformações Ref e Trad, respectivamente, tanto na explicação da variância como na interpretação das componentes principais. / The compositional data analysis is being widely used in several areas of knowledge such as the analysis of rocky sediments, to compare different biological cells and even in forensic analysis to compare crimes evidences. During the history of the analysis of such data, to circumvent the problem of variable\'s constant sum were used many types of adjustments. Until now, we do not have a consensus in which is the best solution to be used in this cases. In this paper, we aim to enunciate seven transformations that most were used over time and their advantages and disadvantages. The principal component analysis was chosen for the comparison of these transformations. We applied this transformations in three real databases with different characteristics, we hope to compare the results and analyze which transformation have the best performance in each database. The comparison criteria were the percentage of explained variance, the variables that were most important to the first principal component,variable\'s loads in the most important principal components as well their correlation with the variables. We also simulated four compositional data bases structures to evaluate the performance of the transformations. For these comparisons and simulations were developed some functions, using the statistical software R, to facilitate comparison between the seven transformations, thus assisting in choosing which of the best transformation fits to the data. From the results we note that: for the real databases, the results of the variance explanation of all transformations are similar, thus Ref and Alr transformations show better performances than the others; in the four simulated structures the Ref and Alr transformations also have the best results in the variance explanation and interpretation of its main components are similar, as well as the transformations Trad, Log and Clr. Thus we note that independently of applying logarithm in and Log and Alr transformations they present very similar results as Ref and Trad transformations, respectively, both in variance explanation and in the interpretation of the principal components. Additive log-ratio Center log-ratio Componentes principais Dados composionais Transformação esférica Transformação log Additive log-ratio Center log-ratio Compositional data Log transformation Principal component analysis Spherical transformation
18	DirichletReg: Dirichlet Regression for Compositional Data in R Maier, Marco J. 18 January 2014 (has links) (PDF) Dirichlet regression models can be used to analyze a set of variables lying in a bounded interval that sum up to a constant (e.g., proportions, rates, compositions, etc.) exhibiting skewness and heteroscedasticity, without having to transform the data. There are two parametrization for the presented model, one using the common Dirichlet distribution's alpha parameters, and a reparametrization of the alpha's to set up a mean-and-dispersion-like model. By applying appropriate link-functions, a GLM-like framework is set up that allows for the analysis of such data in a straightforward and familiar way, because interpretation is similar to multinomial logistic regression. This paper gives a brief theoretical foundation and describes the implementation as well as application (including worked examples) of Dirichlet regression methods implemented in the package DirichletReg (Maier, 2013) in the R language (R Core Team, 2013). (author's abstract) / Series: Research Report Series / Department of Statistics and Mathematics RVK SK 840, QH 234
19	Evaluation de la biodisponibilité du nickel, cobalt et manganèse dans les poussières de sols ultramafiques et développement d'un outil de bioindication lichénique des poussières émises par les activités minières en Nouvelle Calédonie / Bioavailability estimation of nickel, cobalt and manganese in dust from ultramafic soils and development of a bioindication tool using lichen for dust emitted by mining activities in New caledonia Pasquet, Camille 13 December 2016 (has links) Les terrains ultramafiques de Nouvelle-Calédonie, riches en Ni, Co, Mn et Cr sont exploités par des mines à ciel ouvert, ce qui génère l'émission de poussières riches en métaux. L'objectif de ce travail est de développer des approches pour estimer le risque environnemental lié aux poussières émises par les mines à ciel ouvert et les usines de traitement du minerai de nickel. L'estimation de la fraction biodisponible des métaux contenus dans deux fractions granulométriques de poussières, celle inférieure à 100 ~m mobilisable par le vent (F<1 OO~m.) et celle susceptible de pénétrer le système respiratoire (PM1 0), a été réalisée par extractions cinétiques à I'EDTA. L'obtention des PM10 a nécessité la mise au point d'une technique de tri par transport des particules dans un tube horizontal grâce à un flux d'azote. Les extractions cinétiques ont permis de discrimtner trois fractions de métaux: rapidement extraite, lentement extraite et non biodisponibles. Les concentrations en métaux potentiellement biodisponibles sont toujours très élevées et la fraction lentement extraite est toujours la plus concentrée. Pour F<1 00 ~m. les constantes cinétiques de la fraction lentement extraite sont plus faibles pour les poussières de sols miniers que celles de sols forestiers. Les poussières issues de sols miniers seraient alors un réservoir en éléments métalliques biodisponibles à plus long terme. La bioindication lichénique avec traitement en données compositionnelles des concentrations en métaux permet de définir un indice de dispersion des poussières. Cette méthodologie pourrait appuyer les réseaux de surveillance de la qualité de l'air en Nouvelle-Calédonie. / Bioavailability estimation of nickel, cobalt and manganese in dust from ultramafic soils likely to be mobilized by wind and~eve lopment of a bioindication tool using lichen for dust emitted by mining activities in New Caledonia New Caledonian altered ultramafic soils, particularly rich in Ni, Co, Mn and Cr, are extracted by opencast mines which generale dust rich in metals. The objective of th is work is to develop approaches for environmental risk assessment of dust emitted by opencast mines and nickel ore metallurgical plants. The assessmentof metals' bioavailable fraction from two dust granulometrie size fractions, one less than 100 IJm which is mobilizable by wind (F<1001Jm,) and another one able to penetrate the respiratory system (PM 1 0), has been determined by kinetic extraction with EDT A. The development of a new separation deviee based on particle transport subjected to a nitrogen flux in a horizontal tube has been necessary for PM1 0 segregation. Kinetic extractions le ad to the distinction of th ree metal pools: rapidly labile, less rapidly labile and non-bioavailable. Trace metal potentially bioavailable concentrations were always high and the less rapidly labile pool is always the most concentrated pool. Concerning F<1 001Jm, the less rapidly kinetic constant of the less rapidly labile pool is weaker for mining soils than forest soils. F<1001Jm fractions from mining soils representa more durable reserve in trace metal than the same fraction from forest soils. Bioindication using lichens with compositional data analysis of their metal concentration allow defining an indicator of emission dispersion. This methodology could support air quality monitoring networks in New Caledonia. Ultramafique Éléments métalliques Lichen Mine Extraction cinétique Données compositionnelles Roches ultrabasiques Ultramafic Metalic element Lichen Mine Kinetic extraction Compositional Data Ultrabasic rocks 551.9 552.2 553.48 546.3 304.2 363.7 577.2 333.8 622

Search results