• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • No language data
  • Tagged with
  • 2
  • 2
  • 2
  • 2
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

Mechanisms of genome regulation in human islets and their role in the pathogenesis of type 2 diabetes

van de Bunt, Gerrit Martinus January 2014 (has links)
Genome-wide association studies (GWAS) have made substantial progress in implicating genomic regions in type 2 diabetes (T2D) susceptibility. Whilst attributing causal mechanisms to loci has proved non trivial, these studies have provided insights into the genetic architecture underlying the disease. GWAS findings indicate a causal role for gene regulatory processes, and suggest that pancreatic beta-cells play a pivotal role in mediating common T2D association. Work presented in this thesis therefore sought to generate novel regulatory annotations from human islets, and to assess whether T2D-associated loci can be accurately fine-mapped using statistical approaches, with the aim of improving understanding of causal mechanisms underlying these associations through integration of the two approaches. Using small RNA sequencing in human islets and enriched beta-cell populations (both n=3) and mRNA sequencing in a large number of human islets (n=130), I increased the number of available human islet annotations. These studies identified high or islet-specific expression in many micro RNAs (miRNAs) without previously known roles in human islets. It also provided the largest study of quantitative trait loci (eQTLs) and allele-specific expression (ASE) in human islets to date, identifying significant eQTLs for 1,636 genes and significant ASE at 8,754 genes. There was enrichment of active islet chromatin, compared to other tissues, at the best eQTL variant for each gene, but also substantial sharing of significant eQTLs between islets and other tissues. Simulations were used to assess the utility of fine-mapping approaches for refining common disease-associated loci to smaller intervals or sets of variants likely to include the causal variant. The results demonstrated that fine-mapping can indeed refine these loci to sets or intervals of a size more amenable to functional follow-up or focussed intersection with high quality annotations. Furthermore, using an approximated Bayesian approach, I was able to refine twenty-one of the known common T2D-associated loci. Finally, using the newly generated annotations, I demonstrated enrichment of T2D association signal for regulatory RNA annotations (islet lncRNAs and miRNA target gene sets). I also identified examples in which these types of annotation overlap common and rare variation suggestive of a role in T2D pathogenesis. Using further islet annotations, I also uncovered potential causal mechanisms at four of the twentyone fine-mapped common T2D loci. These data therefore provide many novel islet regulatory annotations that can be intersected with T2D genetics, and provide a first example of how such an approach can lead to novel potential causal mechanisms underlying association loci.
2

Myson Burch Thesis

Myson C Burch (16637289) 08 August 2023 (has links)
<p>With the completion of the Human Genome Project and many additional efforts since, there is an abundance of genetic data that can be leveraged to revolutionize healthcare. Now, there are significant efforts to develop state-of-the-art techniques that reveal insights about connections between genetics and complex diseases such as diabetes, heart disease, or common psychiatric conditions that depend on multiple genes interacting with environmental factors. These methods help pave the way towards diagnosis, cure, and ultimately prediction and prevention of complex disorders. As a part of this effort, we address high dimensional genomics-related questions through mathematical modeling, statistical methodologies, combinatorics and scalable algorithms. More specifically, we develop innovative techniques at the intersection of technology and life sciences using biobank scale data from genome-wide association studies (GWAS) and machine learning as an effort to better understand human health and disease. <br> <br> The underlying principle behind Genome Wide Association Studies (GWAS) is a test for association between genotyped variants for each individual and the trait of interest. GWAS have been extensively used to estimate the signed effects of trait-associated alleles, mapping genes to disorders and over the past decade about 10,000 strong associations between genetic variants and one (or more) complex traits have been reported. One of the key challenges in GWAS is population stratification which can lead to spurious genotype-trait associations. Our work proposes a simple clustering-based approach to correct for stratification better than existing methods. This method takes into account the linkage disequilibrium (LD) while computing the distance between the individuals in a sample. Our approach, called CluStrat, performs Agglomerative Hierarchical Clustering (AHC) using a regularized Mahalanobis distance-based GRM, which captures the population-level covariance (LD) matrix for the available genotype data.<br> <br> Linear mixed models (LMMs) have been a popular and powerful method when conducting genome-wide association studies (GWAS) in the presence of population structure. LMMs are computationally expensive relative to simpler techniques. We implement matrix sketching in LMMs (MaSk-LMM) to mitigate the more expensive computations. Matrix sketching is an approximation technique where random projections are applied to compress the original dataset into one that is significantly smaller and still preserves some of the properties of the original dataset up to some guaranteed approximation ratio. This technique naturally applies to problems in genetics where we can treat large biobanks as a matrix with the rows representing samples and columns representing SNPs. These matrices will be very large due to the large number of individuals and markers in biobanks and can benefit from matrix sketching. Our approach tackles the bottleneck of LMMs directly by using sketching on the samples of the genotype matrix as well as sketching on the markers during the computation of the relatedness or kinship matrix (GRM). <br> <br> Predictive analytics have been used to improve healthcare by reinforcing decision-making, enhancing patient outcomes, and providing relief for the healthcare system. These methods help pave the way towards diagnosis, cure, and ultimately prediction and prevention of complex disorders. The prevalence of these complex diseases varies greatly around the world. Understanding the basis of this prevalence difference can help disentangle the interaction among different factors causing complex disorders and identify groups of people who may be at a greater risk of developing certain disorders. This could become the basis of the implementation of early intervention strategies for populations at higher risk with significant benefits for public health.<br> <br> This dissertation broadens our understanding of empirical population genetics. It proposes a data-driven perspective to a variety of problems in genetics such as confounding factors in genetic structure. This dissertation highlights current computational barriers in open problems in genetics and provides robust, scalable and efficient methods to ease the analysis of genotype data.</p>

Page generated in 0.0959 seconds