• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 2
  • Tagged with
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

Probabilistic Models for the Analysis of Gene Expression Profiles

Quon, Gerald 16 August 2013 (has links)
Gene expression profiles are some of the most abundant sources of data about the cellular state of a collection of cells in an organism. Comparison of the expression profiles of multiple samples allows biologists to find associations between observations at the molecular level and the phenotype of the samples. A key challenge is to distinguish variation in expression due to biological factors of interest from variation due to confounding factors that can arise for unrelated technical or biological reasons. This thesis presents models that can explicitly adjust the comparison of expression profiles to account for specific types of confounding factors. One such confounding factor arises when comparing tissue-specific expression profiles across multiple organisms to identify differences in expression that are indicative of changes in gene function. When the organisms are separated by long evolutionary distances, tissue functions may be re-distributed and introduce expression changes unrelated to changes in gene function. We developed Brownian Factor Phylogenetic Analysis, a model that can account for such re-distribution of function, and demonstrate that removing this confounding factor improves tasks such as predicting gene function. Another confounding factor arises because current protocols for expression profiling require RNA extracts from multiple cells. Often biological samples are heterogeneous mixtures of multiple cell types, so the measured expression profile is an average of the RNA levels of the constituent cells. When the biological sample contains both cells of interest and nuisance cells, the confounding expression from the nuisance cells can mask the expression of the cells of interest. We developed ISOLATE and ISOpure, two models for addressing the heterogeneity of tumor samples. We demonstrated that modeling tumor heterogeneity leads to an improvement in two tasks: identifying the site of origin of metastatic tumors, and predicting the risk of death of lung cancer patients.
2

Probabilistic Models for the Analysis of Gene Expression Profiles

Quon, Gerald 16 August 2013 (has links)
Gene expression profiles are some of the most abundant sources of data about the cellular state of a collection of cells in an organism. Comparison of the expression profiles of multiple samples allows biologists to find associations between observations at the molecular level and the phenotype of the samples. A key challenge is to distinguish variation in expression due to biological factors of interest from variation due to confounding factors that can arise for unrelated technical or biological reasons. This thesis presents models that can explicitly adjust the comparison of expression profiles to account for specific types of confounding factors. One such confounding factor arises when comparing tissue-specific expression profiles across multiple organisms to identify differences in expression that are indicative of changes in gene function. When the organisms are separated by long evolutionary distances, tissue functions may be re-distributed and introduce expression changes unrelated to changes in gene function. We developed Brownian Factor Phylogenetic Analysis, a model that can account for such re-distribution of function, and demonstrate that removing this confounding factor improves tasks such as predicting gene function. Another confounding factor arises because current protocols for expression profiling require RNA extracts from multiple cells. Often biological samples are heterogeneous mixtures of multiple cell types, so the measured expression profile is an average of the RNA levels of the constituent cells. When the biological sample contains both cells of interest and nuisance cells, the confounding expression from the nuisance cells can mask the expression of the cells of interest. We developed ISOLATE and ISOpure, two models for addressing the heterogeneity of tumor samples. We demonstrated that modeling tumor heterogeneity leads to an improvement in two tasks: identifying the site of origin of metastatic tumors, and predicting the risk of death of lung cancer patients.

Page generated in 0.1331 seconds