Since the completion of the sequencing of the human genome at the turn of the century, genomics has revolutionized the study of biology and medicine by providing high-throughput and quantitative methods for measuring molecular activities. Microarray and next generation sequencing emerged as important inflection points where the rate of data generation skyrocketed. The high dimensionality nature and the rapid growth in the volume of data precipitated a unique computational challenge in massive data analysis and interpretation. Noise and signal structure in the data varies significantly across types of data and technologies; thus, the context of the data generation process itself plays an important role in detecting key and oftentimes subtle signals. In this dissertation, we discuss four areas where contextualizing the data aids discoveries of disease-causing variants, complex relationships in the human microecology, interplay between gene and environment, and genetic regulation of gene expression. These studies, each in its own unique way, have helped made possible discoveries and expanded the horizon of our understanding of the human body, in health and disease.
Identifer | oai:union.ndltd.org:harvard.edu/oai:dash.harvard.edu:1/12274187 |
Date | 06 June 2014 |
Creators | Sathirapongsasuti, Jarupon Fah |
Contributors | Quackenbush, John |
Publisher | Harvard University |
Source Sets | Harvard University |
Language | en_US |
Detected Language | English |
Type | Thesis or Dissertation |
Rights | open |
Page generated in 0.0018 seconds