Global ETD Search

11	Bayesian and Frequentist Approaches for the Analysis of Multiple Endpoints Data Resulting from Exposure to Multiple Health Stressors. Nyirabahizi, Epiphanie 08 March 2010 (has links) In risk analysis, Benchmark dose (BMD)methodology is used to quantify the risk associated with exposure to stressors such as environmental chemicals. It consists of fitting a mathematical model to the exposure data and the BMD is the dose expected to result in a pre-specified response or benchmark response (BMR). Most available exposure data are from single chemical exposure, but living objects are exposed to multiple sources of hazards. Furthermore, in some studies, researchers may observe multiple endpoints on one subject. Statistical approaches to address multiple endpoints problem can be partitioned into a dimension reduction group and a dimension preservative group. Composite scores using desirability function is used, as a dimension reduction method, to evaluate neurotoxicity effects of a mixture of five organophosphate pesticides (OP) at a fixed mixing ratio ray, and five endpoints were observed. Then, a Bayesian hierarchical model approach, as a single unifying dimension preservative method is introduced to evaluate the risk associated with the exposure to mixtures chemicals. At a pre-specied vector of BMR of interest, the method estimates a tolerable area referred to as benchmark dose tolerable area (BMDTA) in multidimensional Euclidean plan. Endpoints defining the BMDTA are determined and model uncertainty and model selection problems are addressed by using the Bayesian Model Averaging (BMA) method. Benchmark dose tolerable area Multiple endpoints data Benchmark dose tolerable region Bayesian Hierarchical Models MCMC Bayesian Model averaging Composite scores. Biostatistics Physical Sciences and Mathematics Statistics and Probability
12	Méthodes bayésiennes en génétique des populations : relations entre structure génétique des populations et environnement / Bayesian methods for population genetics : relationships between genetic population structure and environment. Jay, Flora 14 November 2011 (has links) Nous présentons une nouvelle méthode pour étudier les relations entre la structure génétique des populations et l'environnement. Cette méthode repose sur des modèles hiérarchiques bayésiens qui utilisent conjointement des données génétiques multi-locus et des données spatiales, environnementales et/ou culturelles. Elle permet d'estimer la structure génétique des populations, d'évaluer ses liens avec des covariables non génétiques, et de projeter la structure génétique des populations en fonction de ces covariables. Dans un premier temps, nous avons appliqué notre approche à des données de génétique humaine pour évaluer le rôle de la géographie et des langages dans la structure génétique des populations amérindiennes. Dans un deuxième temps, nous avons étudié la structure génétique des populations pour 20 espèces de plantes alpines et nous avons projeté les modifications intra spécifiques qui pourront être causées par le réchauffement climatique. / We introduce a new method to study the relationships between population genetic structure and environment. This method is based on Bayesian hierarchical models which use both multi-loci genetic data, and spatial, environmental, and/or cultural data. Our method provides the inference of population genetic structure, the evaluation of the relationships between the structure and non-genetic covariates, and the prediction of population genetic structure based on these covariates. We present two applications of our Bayesian method. First, we used human genetic data to evaluate the role of geography and languages in shaping Native American population structure. Second, we studied the population genetic structure of 20 Alpine plant species and we forecasted intra-specific changes in response to global warming. STAR Structure génétique des populations Covariables environnementales Modèles bayésiens hiérarchiques Modèles à classes latentes MCMC Modèles bioclimatiques Population genetic structure Environmental covariates Bayesian hierarchical models Latent class models MCMC Bioclimatic models
13	Marketing Mix Modelling: A comparative study of statistical models / En jämförelsestudie av statistiska modeller i en Marketing Mix Modelling-kontext Wigren, Richard, Cornell, Filip January 2019 (has links) Deciding the optimal media advertisement spending is a complex issue that many companies today are facing. With the rise of new ways to market products, the choices can appear infinite. One methodical way to do this is to use Marketing Mix Modelling (MMM), in which statistical modelling is used to attribute sales to media spendings. However, many problems arise during the modelling. Modelling and mitigation of uncertainty, time-dependencies of sales, incorporation of expert information and interpretation of models are all issues that need to be addressed. This thesis aims to investigate the effectiveness of eight different statistical and machine learning methods in terms of prediction accuracy and certainty, each one addressing one of the previously mentioned issues. It is concluded that while Shapley Value Regression has the highest certainty in terms of coefficient estimation, it sacrifices some prediction accuracy. The overall highest performing model is the Bayesian hierarchical model, achieving both high prediction accuracy and high certainty. Marketing mix modelling Media mix modelling Model evaluation Time series regression XGBoost Bayesian structural time series Bayesian hierarchical models Computer Engineering Datorteknik
14	Quantification of Uncertainties in Urban Precipitation Extremes Chandra Rupa, R January 2017 (has links) (PDF) Urbanisation alters the hydrologic response of a catchment, resulting in increased runoff rates and volumes, and loss of infiltration and base flow. Quantification of uncertainties is important in hydrologic designs of urban infrastructure. Major sources of uncertainty in the Intensity Duration Frequency (IDF) relationships are due to insufficient quantity and quality of data leading to parameter uncertainty and, in the case of projections of future IDF relationships under climate change, uncertainty arising from use of multiple General Circulation Models (GCMs) and scenarios. The work presented in the thesis presents methodologies to quantify the uncertainties arising from parameters of the distribution fitted to data and the multiple GCMs using a Bayesian approach. High uncertainties in GEV parameters and return levels are observed at shorter durations for Bangalore City. Twenty six GCMs from the CMIP5 datasets, along with four RCP scenarios are considered for studying the effects of climate change. It is observed that the uncertainty in short duration rainfall return levels is high when compared to the longer durations. Further it is observed that parameter uncertainty is large compared to the model uncertainty. Disaggregation of precipitation extremes from larger time scales to smaller time scales when the extremes are modeled with the GPD is burdened with difficulties arising from varying thresholds for different durations. In this study, the scale invariance theory is used to develop a disaggregation model for precipitation extremes exceeding specified thresholds. A scaling relationship is developed for a range of thresholds obtained from a set of quantiles of non-zero precipitation of different durations. The disaggregation model is applied to precipitation datasets of Berlin City, Germany and Bangalore City, India. From both the applications, it is observed that the uncertainty in the scaling exponent has a considerable effect on uncertainty in scaled parameters and return levels of shorter durations. A Bayesian hierarchical model is used to obtain spatial distribution of return levels of precipitation extremes in urban areas and quantify the associated uncertainty. Applicability of the methodology is demonstrated with data from 19 telemetric rain gauge stations in Bangalore City, India. For this case study, it is inferred that the elevation and mean monsoon precipitation are the predominant covariates for annual maximum precipitation. For the monsoon maximum precipitation, it is observed that the geographic covariates dominate while for the summer maximum precipitation, elevation and mean summer precipitation are the predominant covariates. In this work, variation in the dependence structure of extreme precipitation within an urban area and its surrounding non-urban areas at various durations is studied. The Berlin City, Germany, with surrounding non-urban area is considered to demonstrate the methodology. For this case study, the hourly precipitation shows independence within the city even at small distances, whereas the daily precipitation shows a high degree of dependence. This dependence structure of the daily precipitation gets masked as more and more surrounding non-urban areas are included in the analysis. The geographical covariates are seen to be predominant within the city and the climatological covariates prevail when non-urban areas are added. These results suggest the importance of quantification of dependence structure of spatial precipitation at the sub-daily timescales, as well as the need to more precisely model spatial extremes within the urban areas. The work presented in this thesis thus contributes to quantification of uncertainty in precipitation extremes through developing methodologies for generating probabilistic future IDF relationships under climate change, spatial mapping of probabilistic return levels and modeling dependence structure of extreme precipitation in urban areas at fine resolutions. Urban Precipitation Extremes Intensity Duration Frequency General Circulation Models Urban Heat Island (UHI) Modelling Precipitation Extremes Modelling Urban Precipitation Bayesian Hierarchical Models (BHM) Generalized Pareto Distribution Civil Engineering
15	Méthodes statistiques pour la différenciation génotypique des plantes à l’aide des modèles de croissance / Statistical methods for the genotypic differentiation of plants using growth models Viaud, Gautier 22 January 2018 (has links) Les modèles de croissance de plantes peuvent être utilisés afin de prédire des quantités d’intérêt ou évaluer la variabilité génotypique au sein d’une population de plantes ; ce double usage est mis en évidence au sein de ce travail. Trois modèles de plantes sont ainsi considérés (LNAS pour la betterave et le blé, GreenLab pour Arabidopsis thaliana) au sein du cadre mathématique des modèles à espace d’états généraux.Une nouvelle plate-forme de calcul générique pour la modélisation et l’inférence statistique (ADJUSTIN’) a été développée en Julia, permettant la simulation des modèles de croissance de plantes considérés ainsi que l’utilisation de techniques d’estimation de pointe telles que les méthodes de Monte Carlo par chaînes de Markov ou de Monte Carlo séquentielles.L’inférence statistique au sein des modèles de croissance de plantes étant de première importance pour des applications concrètes telles que la prédiction de rendement, les méthodes d’estimation de paramètres et d’états au sein de modèles à espaces d’états et dans un cadre bayésien furent tout d’abord étudiées, et plusieurs cas d’étude pour les plantes considérées sont analysés pour le cas d’une plante individuelle.La caractérisation de la variabilité au sein d’une population de plantes est envisagée à travers les distributions des paramètres de population au sein de modèles hiérarchiques bayésiens. Cette approche requérant l’acquisition de nombreuses données pour chaque individu, un algorithme de segmentation-suivi pour l’analyse d’images d’Arabidopsis thaliana, obtenues grâce au Phénoscope, une plate-forme de phénotypage à haut rendement de l’INRA Versailles, est proposé.Finalement, l’intérêt de l’utilisation des modèles hiérarchiques bayésiens pour la mise en évidence de la variabilité au sein d’une population de plantes est discutée. D’abord par l’étude de différents scénarios sur des données simulées, et enfin en utilisant les données expérimentales obtenues à partir de l’analyse d’images pour une population d’Arabidopsis thaliana comprenant 48 individus. / Plant growth models can be used in order to predict quantities of interest or assess the genotypic variability of a population of plants; this dual use is emphasized throughout this work.Three plant growth models are therefore considered (LNAS for sugar beet and wheat, GreenLab for Arabidopsis thaliana) within the mathematical framework of general state space models.A new generic computing platform for modelling and statistical inference (ADJUSTIN’) has been developed in Julia, allowing to simulate the plant growth models considered as well as the use of state-of-the-art estimation techniques such as Markov chain Monte Carlo and sequential Monte Carlo methods.Statistical inference within plant growth models is of primary importance for concrete applications such as yield prediction, parameter and state estimation methods within general state-space models in a Bayesian framework were first studied and several case studies for the plants considered are then investigated in the case of an individual plant.The characterization of the variability of a population of plants is envisioned through the distributions of parameters using Bayesian hierarchical models. This approach requiring the acquisition of numerous data for each individual, a segmentation-tracking algorithm for the analysis of images of Arabidopsis thaliana, obtained thanks to the Phenoscope, a high-throughput phenotyping platform of INRA Versailles, is proposed.Finally, the interest of using Bayesian hierarchical models to evidence the variability of a population of plants is discussed. First through the study of different scenarios on simulated data, and then by using the experimental data acquired via image analysis for the population of Arabidopsis thaliana comprising 48 individuals. Modèles de croissance de plantes Inférence statistique Julia Analyse d'images Différentiation génotypique Modèles hiérarchiques bayésiens Plant growth models Statistical inference Julia Image analysis Genotypic differentiation Bayesian hierarchical models
16	Influência da estrutura da vegetação sobre a diversidade e detectabilidade das espécies de aves do Cerrado / Influence of vegetation structure on the diversity and detectability of Cerrado birds Rodrigues, Rodolpho Credo 12 August 2016 (has links) Em diversos estudos ao redor do globo, a estrutura e heterogeneidade da vegetação têm se mostrado um fator determinante na diversidade de espécies de aves e também de outros grupos de animais. O Cerrado é o segundo mais extenso e mais ameaçado bioma de ocorrência no Brasil. Este bioma também é caracterizado por um evidente gradiente ambiental de estrutura e heterogeneidade de vegetação. Na presente tese analisamos a influência da estrutura e heterogeneidade da vegetação sobre a diversidade em comunidades de aves do Cerrado. Nossa expectativa era corroborar a “Hipótese de Heterogeneidade de Habitats”, que propõe que quanto maior a estrutura e heterogeneidade da vegetação, maior será a diversidade de espécies. No primeiro capítulo, realizamos uma compilação sistemática de estudos publicados sobre a diversidade de aves em áreas ocupadas por algumas fisionomias típicas de Cerrado lato sensu, com o intuito de analisar o conhecimento obtido até então acerca da relação entre diversidade de aves e a estrutura da vegetação no Cerrado. Foram selecionadas 72 amostras de 22 estudos, sendo que estas amostras variaram quanto ao tipo fisionomia amostrada e o método amostral empregado, além de também estarem disponíveis em diferentes artigos e serem realizadas em diferentes regiões geográficas. Para análises destes dados, utilizamos a análise de modelos lineares generalizados de efeitos mistos (modelo com distribuição de erros poisson), que permite analisar os efeitos de variáveis fixas e aleatórias sobre a variável explicativa (riqueza de espécies). As variáveis fixas foram o tipo de vegetação amostrada (vegetação campestre, savânica e florestal) e o método amostral empregado (ponto fixo, transecto e redes de neblina). Já as variáveis de efeito aleatório utilizadas foram o estudo onde os dados foram publicados, o autor de cada estudo e a localidade geográfica. O efeito destas variáveis aleatórias poderiam afetar somente os interceptos das relações entre as variáveis fixas e a variável explicativa ou poderiam alterar a relação entre as variáveis fixas e explicativa. Construímos diversos modelos a partir da combinação de variáveis de efeito fixo e aleatório e a seleção do modelo mais parcimonioso foi feito por meio do critério AICc (critério de informação de Akaike corrigido para pequenas amostras). O modelo que apresentou menor valor de AICc (mais parcimonioso) foi aquele que incluiu os efeitos de ambas variáveis de efeito fixo (fisionomia e método amostral) e também um efeito da interação entre estas duas variáveis. Neste modelo também foram incluídos os efeitos das variáveis aleatórias estudo e localidade geográfica sobre os interceptos das relações entre as variáveis de efeito fixo e a variável explicativa. Estes resultados mostraram que a riqueza de espécies de aves em nosso estudo variou não só em função da fisionomia e do método amostral empregado, mas dependendo do método amostral utilizado a relação entre riqueza e fisionomia também foi alterada. Portanto, esta interação não permitiu que fosse estimada a relação entre fisionomia e riqueza sem considerar o efeito dos métodos. Já os efeitos das variáveis aleatórias mostraram que a variação estimada nos interceptos entre estudos foi duas vezes maior do que a variação estimada entre localidades geográficas. O efeito da interação entre as variáveis fisionomia e método amostral apontou para a existência de heterogeneidade de detecção entre locais com diferentes fisionomias, além também de um efeito das fisionomias na efetividade dos diferentes métodos amostrais. A influência dos métodos amostrais no número de espécies observadas em cada fisonomia pode ser esperada devido às diferenças intrínsecas dos métodos, já que ponto fixo e transecto são baseados em contatos visuais e auditivos com as espécies, enquanto que o método de rede de neblina consiste na captura passiva das espécies que voam na altura das redes. Assim, redes de neblina podem ser mais efetivas em habitats menos estruturados (por ex. campos limpos e sujos), onde a rede alcança quase todo os estratos de vegetação. No entanto, o método de transecto pode ser mais efetivo que o método de ponto fixo em áreas de florestas, pois nestes hábitats as espécies tendem a ter territórios menores e o deslocamento do observador proporciona ao observador cobrir um maior número de terrítórios. Por outro lado, o ponto fixo pode ser mais vantajoso por não produzir ruído e afugentar as espécies, o que pode ser uma desvantagem do método de transecto. Outros fatores, como a experiência e número de observadores, número de pontos amostrais, número de redes utilizadas e comprimento de transectos, podem explicar a grande variação estimada entre os estudos. Uma das maneiras de se contornar estes efeitos metodológicos é utilizar métodos desenvolvidos especialmente para lidar com diferentes probabilidades de detecção entre espécies, entre sítios e até métodos amostrais, o que poderia render dados mais confiáveis para o estudo da ecologia das espécies e para a elaboração de planos de manejo e/ou conservação. No segundo capítulo, a relação entre diversidade de aves e estrutura da vegetação foi analisada a partir de dados coletados em campo e utilizando um protocolo de amostragem específico para se estimar e considerar os efeitos da vegetação sobre a detecção das espécies. As amostragens foram realizadas em um dos maiores e mais preservados remanescentes de Cerrado (Parque Nacional Grande Sertão Veredas-PARNA GSV) e consistiram do registro das espécies de aves em 32 áreas dispostas em um gradiente de vegetação de Cerrado, que variaram desde campos limpos e sujos, campos cerrado a cerrados sensu stricto. O cálculo da riqueza de espécies de aves em cada sítio foi realizado através de modelos de ocupação-detecção, adaptados para estimar a riqueza de espécies em comunidades. A vegetação, por sua vez, foi medida a partir de estimativas de presença da vegetação entre 0 e 4 m de altura (16 intervalos de 22,5 cm cada um) e duas variáveis de estrutura foram obtidas a partir de uma análise de componentes principais, que foi aplicada para resumir a variação da presença de vegetação nos 16 intervalos de altura. Estas variáveis de vegetação foram relacionadas tanto com a ocupação quanto com a detecção das espécies, já que a estrutura da vegetação poderia influenciar não só a ocorrência mas também a detecção das espécies. O dia da amostragem e também a temperatura no momento da amostragem também foram incluídas como covariáveis que poderiam afetar a detecção. Após a estimativa da riqueza de espécies pelo modelo de ocupação-detecção para comunidades, esta riqueza estimada foi relacionada por uma função quadrática com a estrutura da vegetação usando um modelo bayesiano de metanálise, que permitiu incluir a incerteza nas estimativas de riqueza na análise. A título de comparação, também foi ajustado um modelo quadrático GLM (distribuição de erros normal) aos dados de riqueza observada. Os resultados mostraram que a riqueza estimada a partir dos dados das 38 espécies mais detectadas durante as amostragens teve uma fraca relação com as duas covariáveis de estrutura de vegetação, sendo que houve uma maior riqueza de espécies em sítios com vegetação intermediária em altura e uma maior riqueza de espécies de aves em sítios onde houve maior presença de vegetação abaixo de 2 m de altura. No entanto, as relações entre riqueza estimada e estas covariáveis foi menos intensa mas qualitativamente similar às relações entre a riqueza observada e as covariáveis de vegetação. A menor intensidade nas relações da riqueza estimada foi evidenciada principalmente em ambos os extremos do gradiente de estrutura vertical da vegetação e também nas áreas com menor presença de vegetação abaixo de 2 m. Estes resultados mostraram que o efeito da detecção pode alterar o efeito da relação entre riqueza de espécies e estrutura de vegetação. Além disso, ao menos para as 38 espécies mais comumente encontradas na área de estudo, os resultados apontam para a importância de todo o gradiente de estrutura da vegetação para a manutenção da riqueza de espécies de aves no Cerrado. Futuros estudos que visem aprimorar o uso destes modelos de ocupação e detecção para comunidades são fundamentais para permitir o uso dos dados de todas as espécies da comunidade. Além disto, outros estudos que se proponham a analisar a dinâmica e composição das comunidades de aves nestes gradientes de estrutura de vegetação são fundamentais para um maior conhecimento sobre a ecologia e conservação das aves no Cerrado / In several studies around the globe, the structure and diversity of vegetation have been shown to be a determining factor in the diversity of species of birds and also other groups of animals. The Cerrado is the second most extensive and most threatened biome occurrence in Brazil. This biome is also characterized by an obvious environmental gradient of vegetation structure and heterogeneity. In this thesis we analysed the influence of the structure and diversity of the vegetation on the diversity in the Cerrado bird communities. Our expectation was to support the “Habitat Heterogeneity Hypothesis” which suggests that the higher the structure and diversity of vegetation, the greater the diversity of species. In the first chapter, we conducted a systematic compilation of published studies on the diversity of birds in areas occupied by some typical physiognomy of Cerrado textit lato sensu, in order to analyze the knowledge obtained so far about the relationship between diversity of birds and the structure of the vegetation in the Cerrado. We selected 72 samples from 22 studies, and these samples varied as the sampled vegetation physiognomy, the sampling method used, and they also are available in different articles and be carried out in different geographical regions. We performed the analysis of generalized linear mixed effects models (model poisson distribution errors), which allows us to analyse the effects of fixed and random variables on the explanatory variable (species richness). Fixed variables were the type of sampled vegetation (grassland, savanna and forest) and the sample method employed (fixed point, transect and mist nets). The random variables used were the study where the data were published, the author of each study and geographic location. These random variables could only affect the intercepts of the relationship between fixed and variable explanatory variable or could alter the relationship between fixed and explanatory variables. We built several models from the combination of fixed and random effects variables and selection the most parsimonious model was made by the AIC criterion (Akaike information criterion corrected for small samples). The model that showed lower value of AIC (more parsimonious) was the one that included the effects of both fixed effect variables (physiognomy and sampling method) and also an effect of the interaction between these two variables. In this model were also included the effects of random variables study and geographic location of the intercepts of the relationship between the fixed effect variables and the explanatory variable. These results showed that the bird species richness in our study varied not only in terms of physiognomy and sample method, but depending on the sampling method used the relationship between richness and physiognomy has also changed. Therefore, this interaction does not allowed us to estimate the relationship between physiognomy and richness without considering the effect of the methods. Since the effects of random variables showed that the variation in the estimated intercept between studies was twice larger than the estimated variation between geographic locations. The effect of interaction between the vegetation physiognomy and sampling method variables pointed to the existence of heterogeneity detection between locations with different physiognomies, in addition also of an effect of the physiognomies in the effectiveness of different sampling methods. The influence of the sampling method in the number of species observed in each physiognomy may be expected due to intrinsic differences in the methods, since fixed point counts and transect are based on visual and aural contacts with the species, while the mist net method consists in passive capture of species flying at the time of the networks. Thus, mist nets may be more effective in less structured environments (eg. Clean and dirty fields) where the net reaches virtually all vegetation layers. However, transect method can be more effective than the fixed point method in areas of forests since in these habitats species tend to have smaller territory areas, and the observer movement provides the observer cover greater areas. On the other hand, the point counts can be more advantageous not to produce noise and chase species, which may be a disadvantage of transect method. Other factors, such as experience and number of observers, the number of sampling points, the number of nets used and length of transects, may explain the wide variation between studies estimated. One of the ways to overcome these methodological effects is to use methods developed especially to deal with different probabilities of detection of species, between sites and sampling methods, which could yield more reliable data for the ecological study of the species and the development of management plans and/or conservation. In the second chapter, the relationship between diversity of birds and vegetation structure was analysed from data collected in the field and using a specific sampling protocol to estimate and consider the effects of vegetation on the detection of species. The samples were taken in one of the largest and well preserved remnants of Cerrado (Grande Sertão Veredas National Park-PARNA GSV) and consisted of the record of bird speciesin 32 areas arranged in a Cerrado vegetation gradient, ranging from grasslands, open and dense savannas. The calculation of the bird species richness at each site was conducted using occupancy-detection models adapted to estimate the number of species in communities. The vegetation, in turn, was measured from estimates of the presence of vegetation in height intervals between 0 and 4 m (16 intervals of 22.5 cm each) and two structure variables were obtained from a principal component analysis applied to summarize the variation of the vegetation presence in height intervals. These vegetation variables were related to both the occupation and detection of species, since the vegetation structure could influence not only the occurrence but also the detection of species. The day of sampling and also the temperature at the time of sampling were also included as covariates that may a_ect the detection. After the estimation of species richness by model occupancy detection for communities, this estimated richness was related by a quadratic function with the vegetation structure using a Bayesian meta-analysis model, which allowed us include uncertainty in richness estimates. By way of comparison, we also fit a quadratic model GLM (normal distribution errors) to the observed richness data. The results showed that the richness estimated from the data of the 38 most detected species during sampling had a weak relationship with both covariates vegetation structure, and there was a greater number of species at sites with intermediate vegetation height and greater bird species richness in places where there was a greater presence of vegetation below 2 m in height. However, relations between estimated richness and these covariates was less intense but qualitatively similar to the relationship between observed richness and vegetation covariates. The lowest intensity in the estimated richness relationship was observed mainly at both ends of the vertical gradient of vegetation and also in areas with less presence of vegetation below 2 m. These results showed that the effect of detection can change the effect of the relationship between species richness and vegetation structure. Moreover, at least for the 38 species most commonly found in the study area, the results point to the importance of the entire vegetation structure gradient to maintain the bird species richness in Cerrado. Future studies aiming to improve the use of these models of occupation and detection for communities are essential to allow the use of data of all species in the community. In addition, other studies that propose to analyse the dynamics and composition of bird communities in these vegetation structure gradients are fundamental for a better understanding of the ecology and conservation of Cerrado birds Avifauna Avifauna Bayesian hierarchical models Biodiversity conservation Biodiversity hotspot Conservação Detecção Detection Environmental gradient Fitofisionomia Gradiente ambiental Habitat heterogeneity Heterogeneidade de habitats Hotspot Mixed-e_ects models Modelos bayesianos hierárquicos Modelos de efeitos mistos Occupancy Ocupação Savana Savanna Vegetation physiognomy
17	Aplicações em meta-análise sob um enfoque bayesiano usando dados médicos. Pissini, Carla Fernanda 21 March 2006 (has links) Made available in DSpace on 2016-06-02T20:06:11Z (GMT). No. of bitstreams: 1 DissCFP.pdf: 956101 bytes, checksum: e21a11e1dc4754a5751b0b0840943082 (MD5) Previous issue date: 2006-03-21 / Financiadora de Estudos e Projetos / In this work, we consider the use of Meta-analysis with a Bayesian approach. Meta-analysis is a statistical technique that combines the results of di¤erent independent studies with purpose to find general conclusions. This term was introduced by Glass (1976) and it has been used when the number of studies about some research project is small. Usually, the models for Meta-analysis assume a large number of parameters and the Bayesian approach using MCMC (Markov Chain Monte Carlo) methods is a good alternative to combine information of independent studies, to obtain accutrate inferences about a specified treatment. As illustration, we consider real medical data sets on di¤erent studies, in which, we consider fixed and random e¤ects models. We also assume mixture of normal distributions for the error of the models. Another application is considered with educational data. / Neste trabalho, consideramos o uso de Meta-análise sob um enfoque Bayesiano. Meta-análise é uma técnica estatística que combina resultados de diversos estudos in-dependentes, com o propósito de descrever conclusões gerais. Este termo foi introduzido por Glass (1976) usado quando o número de estudos sobre alguma pesquisa científica é pequeno. Os modelos propostos para Meta-análise usualmente assumem muitos parâmetros e o enfoque Bayesiano com MCMC (Monte Carlo em Cadeias de Markov) é uma alternativa apropriada para combinar informações de estudos independentes. O uso de modelos Bayesianos hierárquicos permite combinações de vários estudos independentes, para a obtenção de inferências precisas sobre um determinado tratamento. Como ilustração numérica consideramos conjuntos de dados médicos de diferentes estudos e, na análise, utilizamos modelos de efeitos fixos e aleatórios e mistura de distribuições normais para o erro do modelo de regressão. Em uma outra aplicação relacionamos Meta-análise e Educação, através do efeito da espectativa do professor associada ao QI dos estudantes. Análise bayesiana hierárquica MCMC Modelo bayesiano hierárquico Meta-análises Bayesian analysis Bayesian hierarchical models Meta-analysis E¤ects random models Mixture of distributions MCMC
18	Influência da estrutura da vegetação sobre a diversidade e detectabilidade das espécies de aves do Cerrado / Influence of vegetation structure on the diversity and detectability of Cerrado birds Rodolpho Credo Rodrigues 12 August 2016 (has links) Em diversos estudos ao redor do globo, a estrutura e heterogeneidade da vegetação têm se mostrado um fator determinante na diversidade de espécies de aves e também de outros grupos de animais. O Cerrado é o segundo mais extenso e mais ameaçado bioma de ocorrência no Brasil. Este bioma também é caracterizado por um evidente gradiente ambiental de estrutura e heterogeneidade de vegetação. Na presente tese analisamos a influência da estrutura e heterogeneidade da vegetação sobre a diversidade em comunidades de aves do Cerrado. Nossa expectativa era corroborar a “Hipótese de Heterogeneidade de Habitats”, que propõe que quanto maior a estrutura e heterogeneidade da vegetação, maior será a diversidade de espécies. No primeiro capítulo, realizamos uma compilação sistemática de estudos publicados sobre a diversidade de aves em áreas ocupadas por algumas fisionomias típicas de Cerrado lato sensu, com o intuito de analisar o conhecimento obtido até então acerca da relação entre diversidade de aves e a estrutura da vegetação no Cerrado. Foram selecionadas 72 amostras de 22 estudos, sendo que estas amostras variaram quanto ao tipo fisionomia amostrada e o método amostral empregado, além de também estarem disponíveis em diferentes artigos e serem realizadas em diferentes regiões geográficas. Para análises destes dados, utilizamos a análise de modelos lineares generalizados de efeitos mistos (modelo com distribuição de erros poisson), que permite analisar os efeitos de variáveis fixas e aleatórias sobre a variável explicativa (riqueza de espécies). As variáveis fixas foram o tipo de vegetação amostrada (vegetação campestre, savânica e florestal) e o método amostral empregado (ponto fixo, transecto e redes de neblina). Já as variáveis de efeito aleatório utilizadas foram o estudo onde os dados foram publicados, o autor de cada estudo e a localidade geográfica. O efeito destas variáveis aleatórias poderiam afetar somente os interceptos das relações entre as variáveis fixas e a variável explicativa ou poderiam alterar a relação entre as variáveis fixas e explicativa. Construímos diversos modelos a partir da combinação de variáveis de efeito fixo e aleatório e a seleção do modelo mais parcimonioso foi feito por meio do critério AICc (critério de informação de Akaike corrigido para pequenas amostras). O modelo que apresentou menor valor de AICc (mais parcimonioso) foi aquele que incluiu os efeitos de ambas variáveis de efeito fixo (fisionomia e método amostral) e também um efeito da interação entre estas duas variáveis. Neste modelo também foram incluídos os efeitos das variáveis aleatórias estudo e localidade geográfica sobre os interceptos das relações entre as variáveis de efeito fixo e a variável explicativa. Estes resultados mostraram que a riqueza de espécies de aves em nosso estudo variou não só em função da fisionomia e do método amostral empregado, mas dependendo do método amostral utilizado a relação entre riqueza e fisionomia também foi alterada. Portanto, esta interação não permitiu que fosse estimada a relação entre fisionomia e riqueza sem considerar o efeito dos métodos. Já os efeitos das variáveis aleatórias mostraram que a variação estimada nos interceptos entre estudos foi duas vezes maior do que a variação estimada entre localidades geográficas. O efeito da interação entre as variáveis fisionomia e método amostral apontou para a existência de heterogeneidade de detecção entre locais com diferentes fisionomias, além também de um efeito das fisionomias na efetividade dos diferentes métodos amostrais. A influência dos métodos amostrais no número de espécies observadas em cada fisonomia pode ser esperada devido às diferenças intrínsecas dos métodos, já que ponto fixo e transecto são baseados em contatos visuais e auditivos com as espécies, enquanto que o método de rede de neblina consiste na captura passiva das espécies que voam na altura das redes. Assim, redes de neblina podem ser mais efetivas em habitats menos estruturados (por ex. campos limpos e sujos), onde a rede alcança quase todo os estratos de vegetação. No entanto, o método de transecto pode ser mais efetivo que o método de ponto fixo em áreas de florestas, pois nestes hábitats as espécies tendem a ter territórios menores e o deslocamento do observador proporciona ao observador cobrir um maior número de terrítórios. Por outro lado, o ponto fixo pode ser mais vantajoso por não produzir ruído e afugentar as espécies, o que pode ser uma desvantagem do método de transecto. Outros fatores, como a experiência e número de observadores, número de pontos amostrais, número de redes utilizadas e comprimento de transectos, podem explicar a grande variação estimada entre os estudos. Uma das maneiras de se contornar estes efeitos metodológicos é utilizar métodos desenvolvidos especialmente para lidar com diferentes probabilidades de detecção entre espécies, entre sítios e até métodos amostrais, o que poderia render dados mais confiáveis para o estudo da ecologia das espécies e para a elaboração de planos de manejo e/ou conservação. No segundo capítulo, a relação entre diversidade de aves e estrutura da vegetação foi analisada a partir de dados coletados em campo e utilizando um protocolo de amostragem específico para se estimar e considerar os efeitos da vegetação sobre a detecção das espécies. As amostragens foram realizadas em um dos maiores e mais preservados remanescentes de Cerrado (Parque Nacional Grande Sertão Veredas-PARNA GSV) e consistiram do registro das espécies de aves em 32 áreas dispostas em um gradiente de vegetação de Cerrado, que variaram desde campos limpos e sujos, campos cerrado a cerrados sensu stricto. O cálculo da riqueza de espécies de aves em cada sítio foi realizado através de modelos de ocupação-detecção, adaptados para estimar a riqueza de espécies em comunidades. A vegetação, por sua vez, foi medida a partir de estimativas de presença da vegetação entre 0 e 4 m de altura (16 intervalos de 22,5 cm cada um) e duas variáveis de estrutura foram obtidas a partir de uma análise de componentes principais, que foi aplicada para resumir a variação da presença de vegetação nos 16 intervalos de altura. Estas variáveis de vegetação foram relacionadas tanto com a ocupação quanto com a detecção das espécies, já que a estrutura da vegetação poderia influenciar não só a ocorrência mas também a detecção das espécies. O dia da amostragem e também a temperatura no momento da amostragem também foram incluídas como covariáveis que poderiam afetar a detecção. Após a estimativa da riqueza de espécies pelo modelo de ocupação-detecção para comunidades, esta riqueza estimada foi relacionada por uma função quadrática com a estrutura da vegetação usando um modelo bayesiano de metanálise, que permitiu incluir a incerteza nas estimativas de riqueza na análise. A título de comparação, também foi ajustado um modelo quadrático GLM (distribuição de erros normal) aos dados de riqueza observada. Os resultados mostraram que a riqueza estimada a partir dos dados das 38 espécies mais detectadas durante as amostragens teve uma fraca relação com as duas covariáveis de estrutura de vegetação, sendo que houve uma maior riqueza de espécies em sítios com vegetação intermediária em altura e uma maior riqueza de espécies de aves em sítios onde houve maior presença de vegetação abaixo de 2 m de altura. No entanto, as relações entre riqueza estimada e estas covariáveis foi menos intensa mas qualitativamente similar às relações entre a riqueza observada e as covariáveis de vegetação. A menor intensidade nas relações da riqueza estimada foi evidenciada principalmente em ambos os extremos do gradiente de estrutura vertical da vegetação e também nas áreas com menor presença de vegetação abaixo de 2 m. Estes resultados mostraram que o efeito da detecção pode alterar o efeito da relação entre riqueza de espécies e estrutura de vegetação. Além disso, ao menos para as 38 espécies mais comumente encontradas na área de estudo, os resultados apontam para a importância de todo o gradiente de estrutura da vegetação para a manutenção da riqueza de espécies de aves no Cerrado. Futuros estudos que visem aprimorar o uso destes modelos de ocupação e detecção para comunidades são fundamentais para permitir o uso dos dados de todas as espécies da comunidade. Além disto, outros estudos que se proponham a analisar a dinâmica e composição das comunidades de aves nestes gradientes de estrutura de vegetação são fundamentais para um maior conhecimento sobre a ecologia e conservação das aves no Cerrado / In several studies around the globe, the structure and diversity of vegetation have been shown to be a determining factor in the diversity of species of birds and also other groups of animals. The Cerrado is the second most extensive and most threatened biome occurrence in Brazil. This biome is also characterized by an obvious environmental gradient of vegetation structure and heterogeneity. In this thesis we analysed the influence of the structure and diversity of the vegetation on the diversity in the Cerrado bird communities. Our expectation was to support the “Habitat Heterogeneity Hypothesis” which suggests that the higher the structure and diversity of vegetation, the greater the diversity of species. In the first chapter, we conducted a systematic compilation of published studies on the diversity of birds in areas occupied by some typical physiognomy of Cerrado textit lato sensu, in order to analyze the knowledge obtained so far about the relationship between diversity of birds and the structure of the vegetation in the Cerrado. We selected 72 samples from 22 studies, and these samples varied as the sampled vegetation physiognomy, the sampling method used, and they also are available in different articles and be carried out in different geographical regions. We performed the analysis of generalized linear mixed effects models (model poisson distribution errors), which allows us to analyse the effects of fixed and random variables on the explanatory variable (species richness). Fixed variables were the type of sampled vegetation (grassland, savanna and forest) and the sample method employed (fixed point, transect and mist nets). The random variables used were the study where the data were published, the author of each study and geographic location. These random variables could only affect the intercepts of the relationship between fixed and variable explanatory variable or could alter the relationship between fixed and explanatory variables. We built several models from the combination of fixed and random effects variables and selection the most parsimonious model was made by the AIC criterion (Akaike information criterion corrected for small samples). The model that showed lower value of AIC (more parsimonious) was the one that included the effects of both fixed effect variables (physiognomy and sampling method) and also an effect of the interaction between these two variables. In this model were also included the effects of random variables study and geographic location of the intercepts of the relationship between the fixed effect variables and the explanatory variable. These results showed that the bird species richness in our study varied not only in terms of physiognomy and sample method, but depending on the sampling method used the relationship between richness and physiognomy has also changed. Therefore, this interaction does not allowed us to estimate the relationship between physiognomy and richness without considering the effect of the methods. Since the effects of random variables showed that the variation in the estimated intercept between studies was twice larger than the estimated variation between geographic locations. The effect of interaction between the vegetation physiognomy and sampling method variables pointed to the existence of heterogeneity detection between locations with different physiognomies, in addition also of an effect of the physiognomies in the effectiveness of different sampling methods. The influence of the sampling method in the number of species observed in each physiognomy may be expected due to intrinsic differences in the methods, since fixed point counts and transect are based on visual and aural contacts with the species, while the mist net method consists in passive capture of species flying at the time of the networks. Thus, mist nets may be more effective in less structured environments (eg. Clean and dirty fields) where the net reaches virtually all vegetation layers. However, transect method can be more effective than the fixed point method in areas of forests since in these habitats species tend to have smaller territory areas, and the observer movement provides the observer cover greater areas. On the other hand, the point counts can be more advantageous not to produce noise and chase species, which may be a disadvantage of transect method. Other factors, such as experience and number of observers, the number of sampling points, the number of nets used and length of transects, may explain the wide variation between studies estimated. One of the ways to overcome these methodological effects is to use methods developed especially to deal with different probabilities of detection of species, between sites and sampling methods, which could yield more reliable data for the ecological study of the species and the development of management plans and/or conservation. In the second chapter, the relationship between diversity of birds and vegetation structure was analysed from data collected in the field and using a specific sampling protocol to estimate and consider the effects of vegetation on the detection of species. The samples were taken in one of the largest and well preserved remnants of Cerrado (Grande Sertão Veredas National Park-PARNA GSV) and consisted of the record of bird speciesin 32 areas arranged in a Cerrado vegetation gradient, ranging from grasslands, open and dense savannas. The calculation of the bird species richness at each site was conducted using occupancy-detection models adapted to estimate the number of species in communities. The vegetation, in turn, was measured from estimates of the presence of vegetation in height intervals between 0 and 4 m (16 intervals of 22.5 cm each) and two structure variables were obtained from a principal component analysis applied to summarize the variation of the vegetation presence in height intervals. These vegetation variables were related to both the occupation and detection of species, since the vegetation structure could influence not only the occurrence but also the detection of species. The day of sampling and also the temperature at the time of sampling were also included as covariates that may a_ect the detection. After the estimation of species richness by model occupancy detection for communities, this estimated richness was related by a quadratic function with the vegetation structure using a Bayesian meta-analysis model, which allowed us include uncertainty in richness estimates. By way of comparison, we also fit a quadratic model GLM (normal distribution errors) to the observed richness data. The results showed that the richness estimated from the data of the 38 most detected species during sampling had a weak relationship with both covariates vegetation structure, and there was a greater number of species at sites with intermediate vegetation height and greater bird species richness in places where there was a greater presence of vegetation below 2 m in height. However, relations between estimated richness and these covariates was less intense but qualitatively similar to the relationship between observed richness and vegetation covariates. The lowest intensity in the estimated richness relationship was observed mainly at both ends of the vertical gradient of vegetation and also in areas with less presence of vegetation below 2 m. These results showed that the effect of detection can change the effect of the relationship between species richness and vegetation structure. Moreover, at least for the 38 species most commonly found in the study area, the results point to the importance of the entire vegetation structure gradient to maintain the bird species richness in Cerrado. Future studies aiming to improve the use of these models of occupation and detection for communities are essential to allow the use of data of all species in the community. In addition, other studies that propose to analyse the dynamics and composition of bird communities in these vegetation structure gradients are fundamental for a better understanding of the ecology and conservation of Cerrado birds Avifauna Conservação Detecção Fitofisionomia Gradiente ambiental Heterogeneidade de habitats Hotspot Modelos bayesianos hierárquicos Modelos de efeitos mistos Ocupação Savana Avifauna Bayesian hierarchical models Biodiversity conservation Biodiversity hotspot Detection Environmental gradient Habitat heterogeneity Mixed-e_ects models Occupancy Savanna Vegetation physiognomy

Search results