Return to search

Bioinformatic analyses of microarray experiments on genetic control of gene expression level

The advent of microarray technology, allowing measurement of gene expression levels for thousands of genes in parallel, has made possible experiments designed to investigate the genetic control of variation in gene expression level (described in the literature as ???genetical genomics??? or ???eQTL??? experiments). Published results from these studies, in yeast and in mice, show that genetic variation is an important factor in gene regulation, and furthermore that individual polymorphisms modify the expression level of many genes. The concern of this thesis is the bioinformatic analyses of the expression level and genotype data sets that are the raw material for these studies. In particular this thesis addresses the two issues of detection of artefactual effects, and maximizing the information that can be extracted from the data. It is shown that while a polymorphism affecting the expression of many genes may be readily detected, care must be taken to determine whether the detected effect is genuinely one of genetic control of expression level, rather than the effect of correlations in measured expression level not of genetic cause. A significance test is devised to distinguish between these cases. The detection of artefactual correlation is explored further in the reanalysis of the published data from a large yeast study. A critique is given of the permutation method used to ascribe genetic control as the cause of inter gene expression level correlation. The presence of some degree of artefactual correlation is shown, and novel methods are presented for identifying such artefacts. To extend the analyses that may be applied to eQTL data, an algorithm is presented for determining secondary eQTLs for gene expression level (as opposed to a single primary QTL), along with a significance test for the putative QTL found. The technique is demonstrated on a large public data set. In addition to the use for which they are intended, the data sets generated for eQTL studies provide opportunities for additional analyses. In this thesis a method is developed for calculating a genome wide map of meiotic recombination frequency from the genotype data for multiple segregant strains. The method is demonstrated on the published genotype data generated for a large yeast eQTL study.

Identiferoai:union.ndltd.org:ADTP/187157
Date January 2006
CreatorsKirk, Michael, School of Biotechnology & Biomolecular Science, UNSW
PublisherAwarded by:University of New South Wales. School of Biotechnology and Biomolecular Science
Source SetsAustraliasian Digital Theses Program
LanguageEnglish
Detected LanguageEnglish
RightsCopyright Michael Kirk, http://unsworks.unsw.edu.au/copyright

Page generated in 0.0019 seconds