1 |
Inference dynamics in transcriptional regulationAsif, Hafiz Muhammad Shahzad January 2012 (has links)
Computational systems biology is an emerging area of research that focuses on understanding the holistic view of complex biological systems with the help of statistical, mathematical and computational techniques. The regulation of gene expression in gene regulatory network is a fundamental task performed by all known forms of life. In this subsystem, modelling the behaviour of the components and their interactions can provide useful biological insights. Statistical approaches for understanding biological phenomena such as gene regulation are proving to be useful for understanding the biological processes that are otherwise not comprehensible due to multitude of information and experimental difficulties. A combination of both the experimental and computational biology can potentially lead to system level understanding of biological systems. This thesis focuses on the problem of inferring the dynamics of gene regulation from the observed output of gene expression. Understanding of the dynamics of regulatory proteins in regulating the gene expression is a fundamental task in elucidating the hidden regulatory mechanisms. For this task, an initial fixed structure of the network is obtained using experimental biology techniques. Given this network structure, the proposed inference algorithms make use of the expression data to predict the latent dynamics of transcription factor proteins. The thesis starts with an introductory chapter that familiarises the reader with the physical entities in biological systems; then we present the basic framework for inference in transcriptional regulation and highlight the main features of our approach. Then we introduce the methods and techniques that we use for inference in biological networks in chapter 2; it sets the foundation for the remaining chapters of the thesis. Chapter 3 describes four well-known methods for inference in transcriptional regulation with pros and cons of each method. Main contributions of the thesis are presented in the following three chapters. Chapter 4 describes a model for inference in transcriptional regulation using state space models. We extend this method to cope with the expression data obtained from multiple independent experiments where time dynamics are not present. We believe that the time has arrived to package methods like these into customised software packages tailored for biologists for analysing the expression data. So, we developed an open-sources, platform independent implementation of this method (TFInfer) that can process expression measurements with biological replicates to predict the activities of proteins and their influence on gene expression in gene regulatory network. The proteins in the regulatory network are known to interact with one another in regulating the expression of their downstream target genes. To take this into account, we propose a novel method to infer combinatorial effect of the proteins on gene expression using a variant of factorial hidden Markov model. We describe the inference mechanism in combinatorial factorial hidden model (cFHMM) using an efficient variational Bayesian expectation maximisation algorithm. We study the performance of the proposed model using simulated data analysis and identify its limitation in different noise conditions; then we use three real expression datasets to find the extent of combinatorial transcriptional regulation present in these datasets. This constitutes chapter 5 of the thesis. In chapter 6, we focus on problem of inferring the groups of proteins that are under the influence of same external signals and thus have similar effects on their downstream targets. Main objectives for this work are two fold: firstly, identifying the clusters of proteins with similar dynamics indicate their role is specific biological mechanisms and therefore potentially useful for novel biological insights; secondly, clustering naturally leads to better estimation of the transition rates of activity profiles of the regulatory proteins. The method we propose uses Dirichlet process mixtures to cluster the latent activity profiles of regulatory proteins that are modelled as latent Markov chain of a factorial hidden Markov model; we refer to this method as DPM-FHMM. We extensively test our methods using simulated and real datasets and show that our model shows better results for inference in transcriptional regulation compared to a standard factorial hidden Markov model. In the last chapter, we present conclusions about the work presented in this thesis and propose future directions for extending this work.
|
2 |
Application of machine learning for the clustering of wheat transcription factor proteins into families and sub-familiesSameer, Haleemath Sameena January 2022 (has links)
Wheat plays an important role in ensuring the global food security. Salinity of soil and water poses a major threat to its production and it affects both growth and development of wheat in a negative way. Wheat plants uses certain molecular mechanisms to adapt themselves under the salt stress.Transcription factor proteins are the proteins that control the response of the wheat towards abiotic stress like salinity.There are 56 transcription factor protein families in the wheat genome. However these transcription factor protein families are not classified into subfamilies.The main goal of this research study is to understand how machine learning algorithm can be used to identify and cluster the transcription factor proteins into sub families that can help in associating them with specific biological processes like salt stress. In this project K Mean Clustering method is used to cluster the WRKY transcription factor family into subfamilies. WRKY is identified and clustered into three distinct clusters. Cluster validation is performed using external validation and resulted in 90% validation score. This method can be applied to other transcription factor families also. This can ultimately be helpful in producing salt-tolerant varieties of the wheat that are resistant to abiotic stress like salinity and this can help to improve crop yield.
|
3 |
New quinazoline analogues as NF-κB activation inhibitorsXu, Lu 01 January 2013 (has links)
NF-κB is a transcription factor protein complex that plays an important role in some cancers and inflammatory responses. It can enhance the proliferation rate, reduce apoptosis, as well as create more blood flow to ensure the survival of cancer. Thus blocking the NF-κB pathway has potential therapeutic benefit. We designed a series of compounds based on quinazoline scaffold pharmacophore model which may have high binding affinity with p50 subunit of NF-κB. The compound series with phenyl substitution at position 2 of quinazoline proved to be more effective at inhibiting NF-κB function both theoretically and experimentally. These compounds also reduce the proliferation of numerous tumor cell lines and the mean GI50 for representative compound 2a is 2.88μM on NCI 60 cell lines. Compound 2a can induce significant apoptosis at the concentration of 1μM. The exploration of the mechanism of action of these compounds found that 2a does not inhibit kinases upstream of NF-κB and does not inhibit p65 translocation from the cytosol to the nucleus as 2b does. However 2a inhibits NF-κB dependent Luciferase expression as well as NF-κB target genes better than 2b. This may suggest that 2a inhibits the NF-κB pathway by directly blocking gene transcription, while 2b acts at cytoplasmic stage.
|
Page generated in 0.1315 seconds