• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 67
  • 31
  • 9
  • 7
  • 6
  • 4
  • 3
  • 2
  • 2
  • 2
  • 1
  • 1
  • 1
  • 1
  • Tagged with
  • 157
  • 157
  • 37
  • 32
  • 32
  • 31
  • 24
  • 24
  • 23
  • 22
  • 22
  • 19
  • 19
  • 19
  • 18
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
21

Bayesian Logistic Regression Model with Integrated Multivariate Normal Approximation for Big Data

Fu, Shuting 28 April 2016 (has links)
The analysis of big data is of great interest today, and this comes with challenges of improving precision and efficiency in estimation and prediction. We study binary data with covariates from numerous small areas, where direct estimation is not reliable, and there is a need to borrow strength from the ensemble. This is generally done using Bayesian logistic regression, but because there are numerous small areas, the exact computation for the logistic regression model becomes challenging. Therefore, we develop an integrated multivariate normal approximation (IMNA) method for binary data with covariates within the Bayesian paradigm, and this procedure is assisted by the empirical logistic transform. Our main goal is to provide the theory of IMNA and to show that it is many times faster than the exact logistic regression method with almost the same accuracy. We apply the IMNA method to the health status binary data (excellent health or otherwise) from the Nepal Living Standards Survey with more than 60,000 households (small areas). We estimate the proportion of Nepalese in excellent health condition for each household. For these data IMNA gives estimates of the household proportions as precise as those from the logistic regression model and it is more than fifty times faster (20 seconds versus 1,066 seconds), and clearly this gain is transferable to bigger data problems.
22

Modelo de calibração ultraestrutural / Ultrastructural calibration model

Talarico, Alina Marcondes 23 January 2014 (has links)
Os programas de Ensaios de Prociência (EP) são utilizados pela sociedade para avaliar a competência e a confiabilidade de laboratórios na execução de medições específicas. Atualmente, diversos grupos de EP foram estabelecidos pelo INMETRO, entre estes, o grupo de testes de motores. Cada grupo é formado por diversos laboratórios que medem o mesmo artefato e suas medições são comparadas através de métodos estatísticos. O grupo de motores escolheu um motor gasolina 1.0, gentilmente cedido pela GM Powertrain, como artefato. A potência do artefato foi medida em 10 pontos de rotação por 6 laboratórios. Aqui, motivados por este conjunto de dados, estendemos o modelo de calibração comparativa de Barnett (1969) para avaliar a compatibilidade dos laboratórios considerando a distribuição t de Student e apresentamos os resultados obtidos das aplicações e simulações a este conjunto de dados / Proficiency Testing (PT) programs are used by society to assess the competence and the reliability in laboratories execution of specific measurements. Nowadays many PT groups were established by INMETRO, including the motor\'s test group. Each group is formed by laboratories measuring the same artifact and their measurements are compared through statistic methods. The motor\'s group chose a gasoline engine 1.0, kindly provided by GM as an artifact. The artifact\'s power was measured at ten points of rotation by 6 laboratories. Here, motivated by this set data, we extend the Barnet comparative calibration model (1969) to assess the compatibility of the laboratories considering the Student-t distribution and show the results obtained from application and simulation of this set data
23

Comparison of Different Methods for Estimating Log-normal Means

Tang, Qi 01 May 2014 (has links)
The log-normal distribution is a popular model in many areas, especially in biostatistics and survival analysis where the data tend to be right skewed. In our research, a total of ten different estimators of log-normal means are compared theoretically. Simulations are done using different values of parameters and sample size. As a result of comparison, ``A degree of freedom adjusted" maximum likelihood estimator and Bayesian estimator under quadratic loss are the best when using the mean square error (MSE) as a criterion. The ten estimators are applied to a real dataset, an environmental study from Naval Construction Battalion Center (NCBC), Super Fund Site in Rhode Island.
24

Mean-field analysis of basal ganglia and thalamocortical dynamics

van Albada, Sacha Jennifer January 2009 (has links)
PhD / When modeling a system as complex as the brain, considerable simplifications are inevitable. The nature of these simplifications depends on the available experimental evidence, and the desired form of model predictions. A focus on the former often inspires models of networks of individual neurons, since properties of single cells are more easily measured than those of entire populations. However, if the goal is to describe the processes responsible for the electroencephalogram (EEG), such models can become unmanageable due to the large numbers of neurons involved. Mean-field models in which assemblies of neurons are represented by their average properties allow activity underlying the EEG to be captured in a tractable manner. The starting point of the results presented here is a recent physiologically-based mean-field model of the corticothalamic system, which includes populations of excitatory and inhibitory cortical neurons, and an excitatory population representing the thalamic relay nuclei, reciprocally connected with the cortex and the inhibitory thalamic reticular nucleus. The average firing rates of these populations depend nonlinearly on their membrane potentials, which are determined by afferent inputs after axonal propagation and dendritic and synaptic delays. It has been found that neuronal activity spreads in an approximately wavelike fashion across the cortex, which is modeled as a two-dimensional surface. On the basis of the literature, the EEG signal is assumed to be roughly proportional to the activity of cortical excitatory neurons, allowing physiological parameters to be extracted by inverse modeling of empirical EEG spectra. One objective of the present work is to characterize the statistical distributions of fitted model parameters in the healthy population. Variability of model parameters within and between individuals is assessed over time scales of minutes to more than a year, and compared with the variability of classical quantitative EEG (qEEG) parameters. These parameters are generally not normally distributed, and transformations toward the normal distribution are often used to facilitate statistical analysis. However, no single optimal transformation exists to render data distributions approximately normal. A uniformly applicable solution that not only yields data following the normal distribution as closely as possible, but also increases test-retest reliability, is described in Chapter 2. Specialized versions of this transformation have been known for some time in the statistical literature, but it has not previously found its way to the empirical sciences. Chapter 3 contains the study of intra-individual and inter-individual variability in model parameters, also providing a comparison of test-retest reliability with that of commonly used EEG spectral measures such as band powers and the frequency of the alpha peak. It is found that the combined model parameters provide a reliable characterization of an individual's EEG spectrum, where some parameters are more informative than others. Classical quantitative EEG measures are found to be somewhat more reproducible than model parameters. However, the latter have the advantage of providing direct connections with the underlying physiology. In addition, model parameters are complementary to classical measures in that they capture more information about spectral structure. Another conclusion from this work was that a few minutes of alert eyes-closed EEG already contain most of the individual variability likely to occur in this state on the scale of years. In Chapter 4, age trends in model parameters are investigated for a large sample of healthy subjects aged 6-86 years. Sex differences in parameter distributions and trends are considered in three age ranges, and related to the relevant literature. We also look at changes in inter-individual variance across age, and find that subjects are in many respects maximally different around adolescence. This study forms the basis for prospective comparisons with age trends in evoked response potentials (ERPs) and alpha peak morphology, besides providing a standard for the assessment of clinical data. It is the first study to report physiologically-based parameters for such a large sample of EEG data. The second main thrust of this work is toward incorporating the thalamocortical system and the basal ganglia in a unified framework. The basal ganglia are a group of gray matter structures reciprocally connected with the thalamus and cortex, both significantly influencing, and influenced by, their activity. Abnormalities in the basal ganglia are associated with various disorders, including schizophrenia, Huntington's disease, and Parkinson's disease. A model of the basal ganglia-thalamocortical system is presented in Chapter 5, and used to investigate changes in average firing rates often measured in parkinsonian patients and animal models of Parkinson's disease. Modeling results support the hypothesis that two pathways through the basal ganglia (the so-called direct and indirect pathways) are differentially affected by the dopamine depletion that is the hallmark of Parkinson's disease. However, alterations in other components of the system are also suggested by matching model predictions to experimental data. The dynamics of the model are explored in detail in Chapter 6. Electrophysiological aspects of Parkinson's disease include frequency reduction of the alpha peak, increased relative power at lower frequencies, and abnormal synchronized fluctuations in firing rates. It is shown that the same parameter variations that reproduce realistic changes in mean firing rates can also account for EEG frequency reduction by increasing the strength of the indirect pathway, which exerts an inhibitory effect on the cortex. Furthermore, even more strongly connected subcircuits in the indirect pathway can sustain limit cycle oscillations around 5 Hz, in accord with oscillations at this frequency often observed in tremulous patients. Additionally, oscillations around 20 Hz that are normally present in corticothalamic circuits can spread to the basal ganglia when both corticothalamic and indirect circuits have large gains. The model also accounts for changes in the responsiveness of the components of the basal ganglia-thalamocortical system, and increased synchronization upon dopamine depletion, which plausibly reflect the loss of specificity of neuronal signaling pathways in the parkinsonian basal ganglia. Thus, a parsimonious explanation is provided for many electrophysiological correlates of Parkinson's disease using a single set of parameter changes with respect to the healthy state. Overall, we conclude that mean-field models of brain electrophysiology possess a versatility that allows them to be usefully applied in a variety of scenarios. Such models allow information about underlying physiology to be extracted from the experimental EEG, complementing traditional measures that may be more statistically robust but do not provide a direct link with physiology. Furthermore, there is ample opportunity for future developments, extending the basic model to encompass different neuronal systems, connections, and mechanisms. The basal ganglia are an important addition, not only leading to unified explanations for many hitherto disparate phenomena, but also contributing to the validation of this form of modeling.
25

三維條件常態分配相容性的探討 / On the compatibility of three conditional normal distributions in three dimensions

何靉 Unknown Date (has links)
關於二維之變數,Arnold and Press (1989) 首先提出檢驗兩個條件分配是否滿足相容性的理論。本研究嘗試對n維之變數,探討n個條件分配滿足相容性的檢驗方式;並提出在三維聯合分配下,給定三個條件分配為常態(normal) 時,檢驗此三個條件分配滿足相容性的充分必要條件;最後,並推導出此三個條件分配滿足相容性時,其所對應的聯合機率密度函數之公式。若此三個條件分配其所對應的聯合機率密度函數進一步假設為常態時,檢驗其相容性的充分必要條件可更加以簡化。 / Arnold and Press (1989) first provide the theory about the compatibility of two conditional distributions in two dimensions. In this research, we extend the two dimensional cases to the high dimensional cases. In particular, we find the necessary and sufficient conditions of the compatibility of three conditional normal distributions in three dimensions. Furthermore, we also provide a formula to find the joint probability density function when three dimensional conditional normal distributions are compatible. Finally, simple sufficient and necessary conditions are also given when the joint distribution is further assumed to be normal.
26

Statistische Eigenschaften von Clusterverfahren / Statistical properties of cluster procedures

Schorsch, Andrea January 2008 (has links)
Die vorliegende Diplomarbeit beschäftigt sich mit zwei Aspekten der statistischen Eigenschaften von Clusterverfahren. Zum einen geht die Arbeit auf die Frage der Existenz von unterschiedlichen Clusteranalysemethoden zur Strukturfindung und deren unterschiedlichen Vorgehensweisen ein. Die Methode des Abstandes zwischen Mannigfaltigkeiten und die K-means Methode liefern ausgehend von gleichen Daten unterschiedliche Endclusterungen. Der zweite Teil dieser Arbeit beschäftigt sich näher mit den asymptotischen Eigenschaften des K-means Verfahrens. Hierbei ist die Menge der optimalen Clusterzentren konsistent. Bei Vergrößerung des Stichprobenumfangs gegen Unendlich konvergiert diese in Wahrscheinlichkeit gegen die Menge der Clusterzentren, die das Varianzkriterium minimiert. Ebenfalls konvergiert die Menge der optimalen Clusterzentren für n gegen Unendlich gegen eine Normalverteilung. Es hat sich dabei ergeben, dass die einzelnen Clusterzentren voneinander abhängen. / The following thesis describes two different views onto the statistical characterics of clustering procedures. At first it adresses the questions whether different clustering methods exist to ascertain the structure of clusters and in what ays the strategies of these methods differ from each other. The method of distance between the manifolds as well as the k-means method provide different final clusters based on equal initial data. The second part of the thesis concentrates on asymptotic properties of the k-means procedure. Here the amount of optimal clustering centres is consistent. If the size of the sample range is enlarged towards infinity, it also converges in probability towards the amount of clustering centres which minimized the whithin cluster sum of squares. Likewise the amount of optimal clustering centres converges for infinity towards the normal distribution. The main result shows that the individual clustering centres are dependent on each other.
27

The k-Sample Problem When k is Large and n Small

Zhan, Dongling 2012 May 1900 (has links)
The k-sample problem, i.e., testing whether two or more data sets come from the same population, is a classic one in statistics. Instead of having a small number of k groups of samples, this dissertation works on a large number of p groups of samples, where within each group, the sample size, n, is a fixed, small number. We call this as a "Large p, but Small n" setting. The primary goal of the research is to provide a test statistic based on kernel density estimation (KDE) that has an asymptotic normal distribution when p goes to infinity with n fixed. In this dissertation, we propose a test statistic called Tp(S) and its standardized version, T(S). By using T(S), we conduct our test based on the critical values of the standard normal distribution. Theoretically, we show that our test is invariant to a location and scale transformation of the data. We also find conditions under which our test is consistent. Simulation studies show that our test has good power against a variety of alternatives. The real data analyses show that our test finds differences between gene distributions that are not due simply to location.
28

A recursive formula for computing Taylor polynomial of quantile

Kuo, Chiu-huang 28 June 2004 (has links)
This paper presents a simple recursive formula to compute the Taylor polynomial of quantile for a continuous random variable. It is very easy to implement the formula in standard symbolic programming system, for example Mathematica (Wolfram, 2003). Applications of the formula to standard normal distribution and to the generation of random variables for continuous distribution with bounded support are illustrated.
29

Effect Of Estimation In Goodness-of-fit Tests

Eren, Emrah 01 September 2009 (has links) (PDF)
In statistical analysis, distributional assumptions are needed to apply parametric procedures. Assumptions about underlying distribution should be true for accurate statistical inferences. Goodness-of-fit tests are used for checking the validity of the distributional assumptions. To apply some of the goodness-of-fit tests, the unknown population parameters are estimated. The null distributions of test statistics become complicated or depend on the unknown parameters if population parameters are replaced by their estimators. This will restrict the use of the test. Goodness-of-fit statistics which are invariant to parameters can be used if the distribution under null hypothesis is a location-scale distribution. For location and scale invariant goodness-of-fit tests, there is no need to estimate the unknown population parameters. However, approximations are used in some of those tests. Different types of estimation and approximation techniques are used in this study to compute goodness-of-fit statistics for complete and censored samples from univariate distributions as well as complete samples from bivariate normal distribution. Simulated power properties of the goodness-of-fit tests against a broad range of skew and symmetric alternative distributions are examined to identify the estimation effects in goodness-of-fit tests. The main aim of this thesis is to modify goodness-of-fit tests by using different estimators or approximation techniques, and finally see the effect of estimation on the power of these tests.
30

Mean-field analysis of basal ganglia and thalamocortical dynamics

van Albada, Sacha Jennifer January 2009 (has links)
PhD / When modeling a system as complex as the brain, considerable simplifications are inevitable. The nature of these simplifications depends on the available experimental evidence, and the desired form of model predictions. A focus on the former often inspires models of networks of individual neurons, since properties of single cells are more easily measured than those of entire populations. However, if the goal is to describe the processes responsible for the electroencephalogram (EEG), such models can become unmanageable due to the large numbers of neurons involved. Mean-field models in which assemblies of neurons are represented by their average properties allow activity underlying the EEG to be captured in a tractable manner. The starting point of the results presented here is a recent physiologically-based mean-field model of the corticothalamic system, which includes populations of excitatory and inhibitory cortical neurons, and an excitatory population representing the thalamic relay nuclei, reciprocally connected with the cortex and the inhibitory thalamic reticular nucleus. The average firing rates of these populations depend nonlinearly on their membrane potentials, which are determined by afferent inputs after axonal propagation and dendritic and synaptic delays. It has been found that neuronal activity spreads in an approximately wavelike fashion across the cortex, which is modeled as a two-dimensional surface. On the basis of the literature, the EEG signal is assumed to be roughly proportional to the activity of cortical excitatory neurons, allowing physiological parameters to be extracted by inverse modeling of empirical EEG spectra. One objective of the present work is to characterize the statistical distributions of fitted model parameters in the healthy population. Variability of model parameters within and between individuals is assessed over time scales of minutes to more than a year, and compared with the variability of classical quantitative EEG (qEEG) parameters. These parameters are generally not normally distributed, and transformations toward the normal distribution are often used to facilitate statistical analysis. However, no single optimal transformation exists to render data distributions approximately normal. A uniformly applicable solution that not only yields data following the normal distribution as closely as possible, but also increases test-retest reliability, is described in Chapter 2. Specialized versions of this transformation have been known for some time in the statistical literature, but it has not previously found its way to the empirical sciences. Chapter 3 contains the study of intra-individual and inter-individual variability in model parameters, also providing a comparison of test-retest reliability with that of commonly used EEG spectral measures such as band powers and the frequency of the alpha peak. It is found that the combined model parameters provide a reliable characterization of an individual's EEG spectrum, where some parameters are more informative than others. Classical quantitative EEG measures are found to be somewhat more reproducible than model parameters. However, the latter have the advantage of providing direct connections with the underlying physiology. In addition, model parameters are complementary to classical measures in that they capture more information about spectral structure. Another conclusion from this work was that a few minutes of alert eyes-closed EEG already contain most of the individual variability likely to occur in this state on the scale of years. In Chapter 4, age trends in model parameters are investigated for a large sample of healthy subjects aged 6-86 years. Sex differences in parameter distributions and trends are considered in three age ranges, and related to the relevant literature. We also look at changes in inter-individual variance across age, and find that subjects are in many respects maximally different around adolescence. This study forms the basis for prospective comparisons with age trends in evoked response potentials (ERPs) and alpha peak morphology, besides providing a standard for the assessment of clinical data. It is the first study to report physiologically-based parameters for such a large sample of EEG data. The second main thrust of this work is toward incorporating the thalamocortical system and the basal ganglia in a unified framework. The basal ganglia are a group of gray matter structures reciprocally connected with the thalamus and cortex, both significantly influencing, and influenced by, their activity. Abnormalities in the basal ganglia are associated with various disorders, including schizophrenia, Huntington's disease, and Parkinson's disease. A model of the basal ganglia-thalamocortical system is presented in Chapter 5, and used to investigate changes in average firing rates often measured in parkinsonian patients and animal models of Parkinson's disease. Modeling results support the hypothesis that two pathways through the basal ganglia (the so-called direct and indirect pathways) are differentially affected by the dopamine depletion that is the hallmark of Parkinson's disease. However, alterations in other components of the system are also suggested by matching model predictions to experimental data. The dynamics of the model are explored in detail in Chapter 6. Electrophysiological aspects of Parkinson's disease include frequency reduction of the alpha peak, increased relative power at lower frequencies, and abnormal synchronized fluctuations in firing rates. It is shown that the same parameter variations that reproduce realistic changes in mean firing rates can also account for EEG frequency reduction by increasing the strength of the indirect pathway, which exerts an inhibitory effect on the cortex. Furthermore, even more strongly connected subcircuits in the indirect pathway can sustain limit cycle oscillations around 5 Hz, in accord with oscillations at this frequency often observed in tremulous patients. Additionally, oscillations around 20 Hz that are normally present in corticothalamic circuits can spread to the basal ganglia when both corticothalamic and indirect circuits have large gains. The model also accounts for changes in the responsiveness of the components of the basal ganglia-thalamocortical system, and increased synchronization upon dopamine depletion, which plausibly reflect the loss of specificity of neuronal signaling pathways in the parkinsonian basal ganglia. Thus, a parsimonious explanation is provided for many electrophysiological correlates of Parkinson's disease using a single set of parameter changes with respect to the healthy state. Overall, we conclude that mean-field models of brain electrophysiology possess a versatility that allows them to be usefully applied in a variety of scenarios. Such models allow information about underlying physiology to be extracted from the experimental EEG, complementing traditional measures that may be more statistically robust but do not provide a direct link with physiology. Furthermore, there is ample opportunity for future developments, extending the basic model to encompass different neuronal systems, connections, and mechanisms. The basal ganglia are an important addition, not only leading to unified explanations for many hitherto disparate phenomena, but also contributing to the validation of this form of modeling.

Page generated in 0.1521 seconds