Spelling suggestions: "subject:"biunctional enrichment analysis"" "subject:"5functional enrichment analysis""
1 |
Knowledge Based Gene Set analysis (KB-GSA) : A novel method for gene expression analysisJadhav, Trishul January 2010 (has links)
Microarray technology allows measurement of the expression levels of thousand of genes simultaneously. Several gene set analysis (GSA) methods are widely used for extracting useful information from microarrays, for example identifying differentially expressed pathways associated with a particular biological process or disease phenotype. Though GSA methods like Gene Set Enrichment Analysis (GSEA) are widely used for pathway analysis, these methods are solely based on statistics. Such methods can be awkward to use if knowledge of specific pathways involved in particular biological processes are the aim of the study. Here we present a novel method (Knowledge Based Gene Set Analysis: KB-GSA) which integrates knowledge about user-selected pathways that are known to be involved in specific biological processes. The method generates an easy to understand graphical visualization of the changes in expression of the genes, complemented with some common statistics about the pathway of particular interest.
|
2 |
Comparisons of statistical modeling for constructing gene regulatory networksChen, Xiaohui 11 1900 (has links)
Genetic regulatory networks are of great importance in terms of scientific interests and practical medical importance. Since a number of high-throughput
measurement devices are available, such as microarrays and
sequencing techniques, regulatory networks have been intensively studied
over the last decade. Based on these high-throughput data sets, statistical interpretations of these billions of bits are crucial for biologist to extract meaningful results. In this thesis, we compare a variety of existing
regression models and apply them to construct regulatory networks which
span trancription factors and microRNAs. We also propose an extended
algorithm to address the local optimum issue in finding the Maximum A
Posterjorj estimator. An E. coli mRNA expression microarray data set with
known bona fide interactions is used to evaluate our models and we show
that our regression networks with a properly chosen prior can perform comparably
to the state-of-the-art regulatory network construction algorithm.
Finally, we apply our models on a p53-related data set, NCI-60 data. By
further incorporating available prior structural information from sequencing
data, we identify several significantly enriched interactions with cell proliferation
function. In both of the two data sets, we select specific examples
to show that many regulatory interactions can be confirmed by previous
studies or functional enrichment analysis. Through comparing statistical
models, we conclude from the project that combining different models with
over-representation analysis and prior structural information can improve
the quality of prediction and facilitate biological interpretation.
Keywords: regulatory network, variable selection, penalized maximum
likelihood estimation, optimization, functional enrichment analysis.
|
3 |
Knowledge Based Gene Set analysis (KB-GSA) : A novel method for gene expression analysisJadhav, Trishul January 2010 (has links)
<p>Microarray technology allows measurement of the expression levels of thousand of genes simultaneously. Several gene set analysis (GSA) methods are widely used for extracting useful information from microarrays, for example identifying differentially expressed pathways associated with a particular biological process or disease phenotype. Though GSA methods like Gene Set Enrichment Analysis (GSEA) are widely used for pathway analysis, these methods are solely based on statistics. Such methods can be awkward to use if knowledge of specific pathways involved in particular biological processes are the aim of the study. Here we present a novel method <strong><em>(Knowledge Based Gene Set Analysis: KB-GSA</em></strong>) which integrates knowledge about user-selected pathways that are known to be involved in specific biological processes. The method generates an easy to understand graphical visualization of the changes in expression of the genes, complemented with some common statistics about the pathway of particular interest.</p>
|
4 |
Comparisons of statistical modeling for constructing gene regulatory networksChen, Xiaohui 11 1900 (has links)
Genetic regulatory networks are of great importance in terms of scientific interests and practical medical importance. Since a number of high-throughput
measurement devices are available, such as microarrays and
sequencing techniques, regulatory networks have been intensively studied
over the last decade. Based on these high-throughput data sets, statistical interpretations of these billions of bits are crucial for biologist to extract meaningful results. In this thesis, we compare a variety of existing
regression models and apply them to construct regulatory networks which
span trancription factors and microRNAs. We also propose an extended
algorithm to address the local optimum issue in finding the Maximum A
Posterjorj estimator. An E. coli mRNA expression microarray data set with
known bona fide interactions is used to evaluate our models and we show
that our regression networks with a properly chosen prior can perform comparably
to the state-of-the-art regulatory network construction algorithm.
Finally, we apply our models on a p53-related data set, NCI-60 data. By
further incorporating available prior structural information from sequencing
data, we identify several significantly enriched interactions with cell proliferation
function. In both of the two data sets, we select specific examples
to show that many regulatory interactions can be confirmed by previous
studies or functional enrichment analysis. Through comparing statistical
models, we conclude from the project that combining different models with
over-representation analysis and prior structural information can improve
the quality of prediction and facilitate biological interpretation.
Keywords: regulatory network, variable selection, penalized maximum
likelihood estimation, optimization, functional enrichment analysis.
|
5 |
Comparisons of statistical modeling for constructing gene regulatory networksChen, Xiaohui 11 1900 (has links)
Genetic regulatory networks are of great importance in terms of scientific interests and practical medical importance. Since a number of high-throughput
measurement devices are available, such as microarrays and
sequencing techniques, regulatory networks have been intensively studied
over the last decade. Based on these high-throughput data sets, statistical interpretations of these billions of bits are crucial for biologist to extract meaningful results. In this thesis, we compare a variety of existing
regression models and apply them to construct regulatory networks which
span trancription factors and microRNAs. We also propose an extended
algorithm to address the local optimum issue in finding the Maximum A
Posterjorj estimator. An E. coli mRNA expression microarray data set with
known bona fide interactions is used to evaluate our models and we show
that our regression networks with a properly chosen prior can perform comparably
to the state-of-the-art regulatory network construction algorithm.
Finally, we apply our models on a p53-related data set, NCI-60 data. By
further incorporating available prior structural information from sequencing
data, we identify several significantly enriched interactions with cell proliferation
function. In both of the two data sets, we select specific examples
to show that many regulatory interactions can be confirmed by previous
studies or functional enrichment analysis. Through comparing statistical
models, we conclude from the project that combining different models with
over-representation analysis and prior structural information can improve
the quality of prediction and facilitate biological interpretation.
Keywords: regulatory network, variable selection, penalized maximum
likelihood estimation, optimization, functional enrichment analysis. / Science, Faculty of / Graduate
|
6 |
Functional association networks for disease gene predictionGuala, Dimitri January 2017 (has links)
Mapping of the human genome has been instrumental in understanding diseasescaused by changes in single genes. However, disease mechanisms involvingmultiple genes have proven to be much more elusive. Their complexityemerges from interactions of intracellular molecules and makes them immuneto the traditional reductionist approach. Only by modelling this complexinteraction pattern using networks is it possible to understand the emergentproperties that give rise to diseases.The overarching term used to describe both physical and indirect interactionsinvolved in the same functions is functional association. FunCoup is oneof the most comprehensive networks of functional association. It uses a naïveBayesian approach to integrate high-throughput experimental evidence of intracellularinteractions in humans and multiple model organisms. In the firstupdate, both the coverage and the quality of the interactions, were increasedand a feature for comparing interactions across species was added. The latestupdate involved a complete overhaul of all data sources, including a refinementof the training data and addition of new class and sources of interactionsas well as six new species.Disease-specific changes in genes can be identified using high-throughputgenome-wide studies of patients and healthy individuals. To understand theunderlying mechanisms that produce these changes, they can be mapped tocollections of genes with known functions, such as pathways. BinoX wasdeveloped to map altered genes to pathways using the topology of FunCoup.This approach combined with a new random model for comparison enables BinoXto outperform traditional gene-overlap-based methods and other networkbasedtechniques.Results from high-throughput experiments are challenged by noise and biases,resulting in many false positives. Statistical attempts to correct for thesechallenges have led to a reduction in coverage. Both limitations can be remediedusing prioritisation tools such as MaxLink, which ranks genes using guiltby association in the context of a functional association network. MaxLink’salgorithm was generalised to work with any disease phenotype and its statisticalfoundation was strengthened. MaxLink’s predictions were validatedexperimentally using FRET.The availability of prioritisation tools without an appropriate way to comparethem makes it difficult to select the correct tool for a problem domain.A benchmark to assess performance of prioritisation tools in terms of theirability to generalise to new data was developed. FunCoup was used for prioritisationwhile testing was done using cross-validation of terms derived fromGene Ontology. This resulted in a robust and unbiased benchmark for evaluationof current and future prioritisation tools. Surprisingly, previously superiortools based on global network structure were shown to be inferior to a localnetwork-based tool when performance was analysed on the most relevant partof the output, i.e. the top ranked genes.This thesis demonstrates how a network that models the intricate biologyof the cell can contribute with valuable insights for researchers that study diseaseswith complex genetic origins. The developed tools will help the researchcommunity to understand the underlying causes of such diseases and discovernew treatment targets. The robust way to benchmark such tools will help researchersto select the proper tool for their problem domain. / <p>At the time of the doctoral defense, the following papers were unpublished and had a status as follows: Paper 5: Manuscript. Paper 6: Manuscript.</p>
|
Page generated in 0.132 seconds