Spelling suggestions: "subject:"genome wide association"" "subject:"fenome wide association""
131 |
Weedy rice (Oryza sativa ssp.): an untapped genetic resource for abiotic stress tolerant traits for rice improvementStallworth, Shandrea D. 06 August 2021 (has links)
Rice (Oryza sativa) is the staple food for more than 3.5 billion people worldwide. As the population continues to grow, rice yield will need to increase by 1% every year for the next 30 years to keep up with the growth. In the US, Arkansas accounts for more than 50% of rice production. Over the last 68 years, rice production has continued to grow in Mississippi, placing it in fourth place after Arkansas, Louisiana, and California. Due to increasing rice acreage, regionally and worldwide, the need to develop abiotic stress-tolerant rice has increased. Unfortunately, current rice breeding programs lack genetic diversity, and many traits have been lost through the domestication of cultivated rice. Currently, stressors stemming from the continued effects of climate change continue to impact rice. To counteract the impacts of climate change, research has shifted to evaluating wild and weedy relatives of rice to improve breeding techniques. Weedy rice (Oryza sativa ssp.) is a genetically similar, noxious weed in rice with increased competitive ability. Studies have demonstrated that weedy rice has increased genetic variability and inherent tolerance to abiotic stressors. The aims of this study were to 1) screen a weedy rice mini-germplasm for tolerance to cold, heat, and complete submergence-stress, 2) utilize simple sequence repeat (SSR) markers and single nucleotide polymorphisms to evaluate the genetic diversity of the weedy rice population, and 3) use genome-wide association (GWAS) to identify SNPs associated with candidate genes within the population.
|
132 |
Molecular Analysis of Host Resistance and Pathogenicity of Rice Blast in East Africa.Mgonja, Emmanuel Mohamed January 2016 (has links)
No description available.
|
133 |
A SNP Microarray Analysis Pipeline Using Machine Learning TechniquesEvans, Daniel T. January 2010 (has links)
No description available.
|
134 |
Linear Mixed Effects Model for a Longitudinal Genome Wide Association Study of Lipid Measures in Type 1 DiabetesWang, Tao 10 1900 (has links)
<p>Hypercholesterolemia is the presence of high levels of cholesterol in the blood, and it is one of the major factors for the development of long-term complications in T1D patients.</p> <p>In the thesis, we studied 1303 Caucasians with type 1 diabetes in the Diabetes Control and Complications Trial (DCCT). With the experience of diabetes study, many factors are associated with diabetes complications, they are age, gender, cohort, treatment, diabetes duration, body mass index (BMI), exercise, insulin dose, etc. We mainly focus on which factors are associated with total cholesterol (CHL) analysis in the thesis.</p> <p>Many measures were collected monthly, quarterly or yearly for average 6.5 years from 1983 to 1993. We used annually lipid measures of DCCT because of their values are sufficient and complete, and they belong to longitudinal data.</p> <p>Different methods are discussed in the study, and linear mixed effect models are the appropriate approach to the study. The details of model selection with CHL model analysis are shown, which includes fixed effect selection, random effects selection, and residual correlation structure selection. Then the SNPs were added on three models individually in GWAS. We found locus (rs7412) is not only genome-wide associated with CHL, but also genome-wide associated with LDL.</p> <p>We will assess whether these SNPs are diabetes-specific in the future, and we will add dietary data in the three models to identify locus are associated with the interaction of diet and SNPs.</p> / Master of Science (MSc)
|
135 |
Genomic Prediction and Genetic Dissection of Yield-Related Traits in Soft Red Winter WheatWard, Brian Phillip 02 May 2017 (has links)
In multiple species, genome-wide association (GWA) studies have become an increasingly prevalent method of identifying the quantitative trait loci (QTLs) that underlie complex traits. Despite this, relatively few GWA analyses using high-density genomic markers have been carried out on highly quantitative traits in wheat. We utilized single-nucleotide polymorphism (SNP) data generated via a genotyping-by-sequencing (GBS) protocol to perform GWA on multiple yield-related traits using a panel of 329 soft red winter wheat genotypes grown in four environments. In addition, the SNP data was used to examine linkage disequilibrium and population structure within the testing panel. The results indicated that an alien translocation from the species Triticum timopheevii was responsible for the majority of observed population structure. In addition, a total of 50 significant marker-trait associations were identified. However, a subsequent study cast some doubt upon the reproducibility and reliability of plant QTLs identified via GWA analyses. We used two highly-related panels of different genotypes grown in different sets of environments to attempt to identify highly stable QTLs. No QTLs were shared across panels for any trait, suggesting that QTL-by-environment and QTL-by-genetic background interaction effects are significant, even when testing across many environments. In light of the challenges involved in QTL mapping, prediction of phenotypes using whole-genome marker data is an attractive alternative. However, many evaluations of genomic prediction in crop species have utilized univariate models adapted from animal breeding. These models cannot directly account for genotype-by-environment interaction, and hence are often not suitable for use with lower-heritability traits assessed in multiple environments. We sought to test genomic prediction models capable of more ad-hoc analyses, utilizing highly unbalanced experimental designs consisting of individuals with varying degrees of relatedness. The results suggest that these designs can successfully be used to generate reasonably accurate phenotypic predictions. In addition, multivariate models can dramatically increase predictive accuracy for some traits, though this depends upon the quantity and characteristics of genotype-by-environment interaction. / Ph. D. / Quantitative traits are those traits that can display a wide range of variability within a population of individuals. These traits are influenced by the interaction of many different genes, and are also influenced by the environment to varying degrees. Traditionally, geneticists who studied quantitative traits had to rely on statistical models, while the biological causes of variation in the expression of these traits remained largely unknown. However, the advent of DNA marker technology granted geneticists the ability to identify specific regions of the genome that highly influence quantitative traits. Many studies have since attempted to find these <i>quantitative trait loci</i> (QTLs) across a wide range of traits and species. However, we are faced with something of a paradox when we attempt to find QTLs. Theory tells us that an idealized, truly quantitative trait arises due to the effects of many genes, each with an infinitesimal effect on the trait in question. Therefore, the more quantitative a trait, the fewer QTLs we should expect to find. In addition, QTLs may not be reliable, due to the effects of different environments and different genetic backgrounds within a population. A more recent trend involves using all available marker data simultaneously to predict a particular line’s performance. This method entails ignoring the genomic underpinnings of a trait, and instead focusing solely on its expression, much like classical quantitative genetics. The obvious downside of this method is that it cannot be used to increase our understanding of what is giving rise to the variations in the trait’s expression that we observe. The studies described in this dissertation were designed to 1) test whether we could identify QTLs for highly quantitative yield-related traits in winter wheat, 2) test the reliability of identified QTLs, and 3) use the DNA marker data to instead generate predictions of line performance. The results indicate that while we can identify QTLs for highly quantitative traits in winter wheat, these QTLs may not be very reliable. Therefore, predictive models may be a good alternative to identifying QTLs, and these methods can be readily implemented within breeding programs.
|
136 |
Enrichment of inflammatory bowel disease and colorectal cancer risk variants in colon expression quantitative trait lociHulur, Imge, Gamazon, Eric R., Skol, Andrew D., Xicola, Rosa M., Llor, Xavier, Onel, Kenan, Ellis, Nathan A., Kupfer, Sonia S. January 2015 (has links)
BACKGROUND: Genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with diseases of the colon including inflammatory bowel diseases (IBD) and colorectal cancer (CRC). However, the functional role of many of these SNPs is largely unknown and tissue-specific resources are lacking. Expression quantitative trait loci (eQTL) mapping identifies target genes of disease-associated SNPs. This study provides a comprehensive eQTL map of distal colonic samples obtained from 40 healthy African Americans and demonstrates their relevance for GWAS of colonic diseases. RESULTS: 8.4 million imputed SNPs were tested for their associations with 16,252 expression probes representing 12,363 unique genes. 1,941 significant cis-eQTL, corresponding to 122 independent signals, were identified at a false discovery rate (FDR) of 0.01. Overall, among colon cis-eQTL, there was significant enrichment for GWAS variants for IBD (Crohn's disease [CD] and ulcerative colitis [UC]) and CRC as well as type 2 diabetes and body mass index. ERAP2, ADCY3, INPP5E, UBA7, SFMBT1, NXPE1 and REXO2 were identified as target genes for IBD-associated variants. The CRC-associated eQTL rs3802842 was associated with the expression of C11orf93 (COLCA2). Enrichment of colon eQTL near transcription start sites and for active histone marks was demonstrated, and eQTL with high population differentiation were identified. CONCLUSIONS: Through the comprehensive study of eQTL in the human colon, this study identified novel target genes for IBD- and CRC-associated genetic variants. Moreover, bioinformatic characterization of colon eQTL provides a tissue-specific tool to improve understanding of biological differences in diseases between different ethnic groups.
|
137 |
Mechanisms of Type 2 diabetes susceptibilityTravers, Mary E. January 2013 (has links)
Type 2 diabetes (T2D) has a genetic component which is only partially understood. The majority of genetic variance in disease susceptibility is unaccounted for, whilst the precise transcripts and molecular mechanisms through which most risk variants exert their effect is unclear. A complete understanding of T2D susceptibility mechanisms could have benefits in risk prediction, and in drug discovery through the identification of novel therapeutic targets. Work presented in this thesis aims to define relevant transcripts and disease mechanisms at known susceptibility loci, and to identify disease association with classes of genetic variation other than common single nucleotide polymorphisms (SNPs). KCNQ1 contains intronic variants associated with T2D susceptibility and β-cell dysfunction, but only maternally-inherited alleles confer increased disease risk. It maps within an imprinted domain with an established role in congenital and islet-specific growth phenotypes. Using human adult islet and foetal pancreas samples, I refined the transcripts and developmental stage at which T2D susceptibility must be conferred by demonstrating developmentally plastic monoallelic and biallelic expression. I identified a potential risk mechanism through the effect of T2D risk alleles upon DNA methylation. The disease-associated regions identified through genome-wide association (GWA) studies often contain multiple transcripts. I performed mRNA expression profiling of genes within loci associated with raised proinsulin/insulin ratios in human islets and metabolically relevant tissues. Some genes (notably CT62) were not expressed and therefore excluded from consideration for a risk effect, whilst others (for example C2CD4A) were highlighted as good regional candidates due to specific expression in relevant tissues. GWA studies for T2D risk have focused predominantly upon common single nucleotide polymorphisms. As part of a consortium conducing GWA analysis for copy number variation (CNV) and T2D risk, I optimised and compared alternative methods of CNV genotyping, before using this information to validate two signals of disease association. I genotyped three rare single nucleotide variants emerging from an association study with T2D risk based on imputed data, providing an indication of imputation accuracy and more powerful disease association analysis. These data underscore the challenge of translating association signals to causal mechanisms, and of identifying alternative forms of genomic variation which contribute to T2D risk. My work highlights candidates for functional analysis around proinsulin-associated loci, and makes significant progress towards uncovering risk mechanisms at the KCNQ1 locus.
|
138 |
Statistical Inference for Propagation Processes on Complex NetworksManitz, Juliane 12 June 2014 (has links)
Die Methoden der Netzwerktheorie erfreuen sich wachsender Beliebtheit, da sie die Darstellung von komplexen Systemen durch Netzwerke erlauben. Diese werden nur mit einer Menge von Knoten erfasst, die durch Kanten verbunden werden. Derzeit verfügbare Methoden beschränken sich hauptsächlich auf die deskriptive Analyse der Netzwerkstruktur. In der hier vorliegenden Arbeit werden verschiedene Ansätze für die Inferenz über Prozessen in komplexen Netzwerken vorgestellt. Diese Prozesse beeinflussen messbare Größen in Netzwerkknoten und werden durch eine Menge von Zufallszahlen beschrieben. Alle vorgestellten Methoden sind durch praktische Anwendungen motiviert, wie die Übertragung von Lebensmittelinfektionen, die Verbreitung von Zugverspätungen, oder auch die Regulierung von genetischen Effekten. Zunächst wird ein allgemeines dynamisches Metapopulationsmodell für die Verbreitung von Lebensmittelinfektionen vorgestellt, welches die lokalen Infektionsdynamiken mit den netzwerkbasierten Transportwegen von kontaminierten Lebensmitteln zusammenführt. Dieses Modell ermöglicht die effiziente Simulationen verschiedener realistischer Lebensmittelinfektionsepidemien. Zweitens wird ein explorativer Ansatz zur Ursprungsbestimmung von Verbreitungsprozessen entwickelt. Auf Grundlage einer netzwerkbasierten Redefinition der geodätischen Distanz können komplexe Verbreitungsmuster in ein systematisches, kreisrundes Ausbreitungsschema projiziert werden. Dies gilt genau dann, wenn der Ursprungsnetzwerkknoten als Bezugspunkt gewählt wird. Die Methode wird erfolgreich auf den EHEC/HUS Epidemie 2011 in Deutschland angewandt. Die Ergebnisse legen nahe, dass die Methode die aufwändigen Standarduntersuchungen bei Lebensmittelinfektionsepidemien sinnvoll ergänzen kann. Zudem kann dieser explorative Ansatz zur Identifikation von Ursprungsverspätungen in Transportnetzwerken angewandt werden. Die Ergebnisse von umfangreichen Simulationsstudien mit verschiedenstensten Übertragungsmechanismen lassen auf eine allgemeine Anwendbarkeit des Ansatzes bei der Ursprungsbestimmung von Verbreitungsprozessen in vielfältigen Bereichen hoffen. Schließlich wird gezeigt, dass kernelbasierte Methoden eine Alternative für die statistische Analyse von Prozessen in Netzwerken darstellen können. Es wurde ein netzwerkbasierter Kern für den logistischen Kernel Machine Test entwickelt, welcher die nahtlose Integration von biologischem Wissen in die Analyse von Daten aus genomweiten Assoziationsstudien erlaubt. Die Methode wird erfolgreich bei der Analyse genetischer Ursachen für rheumatische Arthritis und Lungenkrebs getestet. Zusammenfassend machen die Ergebnisse der vorgestellten Methoden deutlich, dass die Netzwerk-theoretische Analyse von Verbreitungsprozessen einen wesentlichen Beitrag zur Beantwortung verschiedenster Fragestellungen in unterschiedlichen Anwendungen liefern kann.
|
139 |
Bayesian and frequentist methods and analyses of genome-wide association studiesVukcevic, Damjan January 2009 (has links)
Recent technological advances and remarkable successes have led to genome-wide association studies (GWAS) becoming a tool of choice for investigating the genetic basis of common complex human diseases. These studies typically involve samples from thousands of individuals, scanning their DNA at up to a million loci along the genome to discover genetic variants that affect disease risk. Hundreds of such variants are now known for common diseases, nearly all discovered by GWAS over the last three years. As a result, many new studies are planned for the future or are already underway. In this thesis, I present analysis results from actual studies and some developments in theory and methodology. The Wellcome Trust Case Control Consortium (WTCCC) published one of the first large-scale GWAS in 2007. I describe my contribution to this study and present the results from some of my follow-up analyses. I also present results from a GWAS of a bipolar disorder sub-phenotype, and a recent and on-going fine mapping experiment. Building on methods developed as part of the WTCCC, I describe a Bayesian approach to GWAS analysis and compare it to widely used frequentist approaches. I do so both theoretically, by interpreting each approach from the perspective of the other, and empirically, by comparing their performance in the context of replicated GWAS findings. I discuss the implications of these comparisons on the interpretation and analysis of GWAS generally, highlighting the advantages of the Bayesian approach. Finally, I examine the effect of linkage disequilibrium on the detection and estimation of various types of genetic effects, particularly non-additive effects. I derive a theoretical result showing how the power to detect a departure from an additive model at a marker locus decays faster than the power to detect an association.
|
140 |
On genetic variants underlying common diseaseHechter, Eliana January 2011 (has links)
Genome-wide association studies (GWAS) exploit the correlation in ge- netic diversity along chromosomes in order to detect effects on disease risk without having to type causal loci directly. The inevitable downside of this approach is that, when the correlation between the marker and the causal variant is imperfect, the risk associated with carrying the predisposing allele is diluted and its effect is underestimated. This thesis explores four different facets of this risk dilution: (1) estimating true effect sizes from those observed in GWAS; (2) asking how the context of a GWAS, including the population studied, the genotyping chip employed, and the use of im- putation, affects risk estimates; (3) assessing how often the best-associated SNP in a GWAS coincides with the causal variant; and (4) quantifying how departures from the simplest disease risk model at a causal variant distort the observed disease risk model. Using simulations, where we have information about the true risk at the causal locus, we show that the correlation between the marker and the causal variant is the primary driver of effect size underestimation. The extent of the underestimation depends on a number of factors, including the population in which the study is conducted, the genotyping chip employed, whether imputation is used, and the strength, frequency, and disease model of the risk allele. Suppose that a GWAS study is conducted in a European population, with an Affymetrix 6.0 genotyping chip, without imputation, and that the causal loci have a modest effect on disease risk, are common in the population, and follow an additive disease risk model. In such a study, we show that the risk estimated from the most associated SNP is very close to the truth approximately two-thirds of the time (although we predict that fine mapping of GWAS loci will infrequently identify causal variants with considerably higher risk), and that the best-associated variant is very often perfectly or nearly-perfectly correlated with, and almost always within 0.1cM of, the causal variant. However, the strong correlations among nearby loci mean that the causal and best-associated variants coincide infrequently, less than one-fifth of the time, even if the causal variant is genotyped. We explore ways in which these results change quantitatively depending on the parameters of the GWAS study. Additionally, we demonstrate that we expect to identify substantial deviations from the additive disease risk model among loci where association is detected, even though power to detect departures from the model drops off very quickly as the correlation between the marker and causal loci decreases. Finally, we discuss the implications of our results for the design and interpretation of future GWAS studies.
|
Page generated in 0.1345 seconds