1 |
Calculating One-sided P-value for TFisher Under Correlated DataFang, Jiadong 29 April 2018 (has links)
P-values combination procedure for multiple statistical tests is a common data analysis method in many applications including bioinformatics. However, this procedure is nontrivial when input P-values are dependent. For the Fisher€™s combination procedure, a classic method is the Brown€™s Strategy [1, Brown,1975], which is based empirical moment-matching of gamma distribution. In this project, we address a more general family of weighting-andtruncation p-value combination procedures called TFisher. We first study how to extend Brown€™s Strategy to this problem. Then we make further development in two directions. First, instead of using the empirical polynomial model-fitting strategy to find moments, we developed an analytical calculation strategy based on asymptotic approximation. Second, instead of using the gamma distribution to approximate the null distribution of TFisher, we propose to use a mixed gamma distribution or a shifted-mixed gamma distribution. We focus on calculating the one-sided p-value for TFisher, especially the soft-thresholding version of TFisher. Simulations show that our methods much improve the accuracy than the traditional strategy.
|
2 |
Statistical methods for genetic association studies: multi-cohort and rare genetic variants approachesChen, Han 23 September 2015 (has links)
Genetic association studies have successfully identified many genetic markers associated with complex human diseases and related quantitative traits. However, for most complex diseases and quantitative traits, all associated genetic markers identified to date only explain a small proportion of heritability. Thus, exploring the unexplained heritability in these traits will help us discover novel genetic determinants for these traits and better understand disease etiology and pathophysiology. Due to limited sample size, a single cohort study may not have sufficient power to identify novel genetic association with a small effect size, and meta-analysis approaches have been proposed and applied to combine results from multiple cohorts in large consortia, increasing the sample size and statistical power. Rare genetic variants and gene by environment interaction may both play a role in genetic association studies. In this dissertation, we develop statistical methods in meta-analysis, rare genetic variants analysis and gene by environment interaction analysis, conduct extensive simulation studies, and apply these methods in real data examples. First, we develop a method of moments estimator for the between-study covariance matrix in random effects model multivariate meta-analysis. Our estimator is the first such estimator in matrix form, and holds the invariance property to linear transformations. It has similar performance with existing methods in simulation studies and real data analysis. Next, we extend the Sequence Kernel Association Test (SKAT), a rare genetic variants analysis approach for unrelated individuals, to be applicable in family samples for quantitative traits. The extension is necessary, as the original test has inflated type I error when directly applied to related individuals, and selecting an unrelated subset from family samples reduces the sample size and power. Finally, we derive methods for rare genetic variants analysis in detecting gene by environment interaction on quantitative traits, in the context of univariate test on the interaction term parameter. We develop statistical tests in the settings of both burden test and SKAT, for both unrelated and related individuals. Our methods are relevant to genetic association studies, and we hope that they can facilitate research in this field and beyond.
|
Page generated in 0.085 seconds