Global ETD Search

1	A widespread impact on small RNAs and gene networks in rice MSP1/OsTDL1a mutants, partners with key roles in anther development Fei, Qili 09 May 2017 (has links) <p> Dissection of the genetic pathways and mechanisms by which anther development occurs in grasses is crucial for both a basic understanding of plant development and for traits of agronomic importance like male sterility. In rice, MULTIPLE SPOROCYTES1 (MSP1), a leucine-rich-repeat receptor kinase, play an important role in anther development by limiting the number of sporocytes. OsTDL1a (a TPD1-like gene in rice) encodes a small protein which acts as a cofactor of MSP1 in the same regulatory pathway. In this study, we analyzed small RNA and mRNA changes in different stages of spikelets from wildtype rice, and from msp1 and ostdl1a mutants. Analysis across different stages of rice spikelets of the small RNA data identified miRNAs demonstrating differential abundances. miR2275 was depleted in the two rice mutants; this miRNA is specifically enriched in anthers and functions to trigger the production of 24-nt phased secondary siRNAs (phasiRNAs) from <i>PHAS</i> loci. We observed that the 24-nt phasiRNAs as well as their precursor <i>PHAS</i> mRNAs were also depleted in the two mutants. Based on comparisons of transcript levels across the spikelet stages and mutants, we identified 22 transcription factors as candidates to have roles specific to anther development, potentially acting downstream of the OsTDL1a-MSP1 pathway. An analysis of co-expression identified three Argonaute-encoding genes (<i>OsAGO1d, OsAGO2b,</i> and <i> OsAGO18</i>) that accumulate transcripts coordinately with phasiRNAs, suggesting a functional relationship. By mRNA in situ analysis, we demonstrated a strong correlation between the spatiotemporal pattern of accumulation of these OsAGO transcripts with previously-published phasiRNA accumulation patterns from maize.</p> Biology\|Bioinformatics
2	Intron loss and gain in Eukaryotes Coulombe-Huntington, Jasmin January 2008 (has links) No description available. Biology - Bioinformatics
3	Investigating non-canonical functions of gamma-tubulin by using genome scale structure-function (GSSF) analysis Nguyen, Thi Thu Thao January 2010 (has links) No description available. Biology - Bioinformatics
4	Modeling Cancer Progression on the Pathway Level Edelman, Elena Jane 11 December 2008 (has links) <p>Over the past several decades, many genes have been discovered that govern important functions in the development of a variety of diﬀerent cancers. However, biological insight from the list of genes is still limited and the underlying mechanisms that occur in the cell during tumorigenesis have not been well established. Studying cancer progression in terms of the oncogenic pathways that are responsible for speciﬁc actions that change normal cells into tumors is a means for bringing insight onto these issues. The work presented here will uncover mechanisms that are occurring at the pathway level that ﬁrst initiate tumor formation and then continue through cancer progression and ﬁnally metastasis. This knowledge will allow for drug treatment that is better targeted towards an individual.</p><p>Microarray technology has allowed for the collection of gene expression datasets from clinical cancer and other studies. These datasets can be used to study how expression levels of individual genes or groups of related genes are altered in individuals from diﬀerent phenotypic groups. Statistical methods exist which assay pathway enrichment by phenotypic class but do not describe individual variation. In order to study this individual variation, we developed a formal statistical method called ASSESS which measures the enrichment of a gene set in each sample in an expression dataset.</p><p>As cancer advances through the stages of initiation, progression, and proliferation, multiple pathways experience disruptions at various times. However, there is still much unknown on these particular pathways that evidence gene expression changes throughout tumorigenesis. Using gene expression datasets comprised of individuals with tumors classiﬁed by location and stage, we applied ASSESS in order to study the data on the pathway level. We then utilized novel statistical methods to uncover the pathways that play a role in cancer progression and in what order the pathways become perturbed.</p><p>These analyses can give a basis for how genetic disruptions serve to alter actions in speciﬁc cell types. The results may provide insight that will lead to treatments of existing tumors and prevention of incipient cancers from forming. Treatments for existing tumors will use multiple drugs to target the pathways that show an altered state of activity.</p> / Dissertation Biology, Bioinformatics
5	The Geometry of Cancer Guinney, Justin January 2009 (has links) <p>Cancer is a complex, multifaceted disease that operates through dynamic changes in the genome. Cancer is best understood through the process that generates it -- random mutations operated on by natural selection -- and several global hallmarks that describe its broad mechanisms. While many genes, protein interactions, and pathways have been enumerated as a kind of ``parts'' list for cancer, researchers are attempting to synthesize broader models for inferring and predicting cancer behavior using high-throughput data and integrative analyses. </p><p>The focus of this thesis is on the development of two novel methods that are optimized for the analysis of complex cancer phenotypes. The first method incorporates ideas from gradient learning with multitask learning to assess statistical dependencies across multiple related data sets. The second method integrates multiscale analysis on graphs and manifolds developed in applied harmonic analysis with sparse factor models, a mainstay of applied statistics. This method generates multiscale factors that are used for inferring hierarchical associations within complex biological networks. The primary biological focus is the inference of gene and pathway dependencies associated with cancer progression and metastatic disease in prostate cancer. Significant findings include evidence of Skp2 degradation of the cell-cycle regulator p27, and the upstream deregulation of the TGF-beta pathway, driving prostate cancer recurrence.</p> / Dissertation Biology, Bioinformatics
6	Computational DNA motif discovery in plant promoters Fauteux, François January 2010 (has links) The regulation of gene expression is driven primarily by transcription factors binding to short DNA sequences. Here three studies related to promoter cis-regulatory motif discovery in plant promoters are presented. In the first study, an exact discriminative seeding DNA motif discovery addressing key issues associated with popular DNA motif discovery algorithms is proposed. The Seeder algorithm outperforms popular motif discovery tools on biological benchmark data. In the second study, the algorithm is applied to the identification of cis-regulatory motifs in seed storage protein gene promoters. Known and new motifs are discovered. In the third study, groups of orthologous genes are identified among five dicotyledonous plant species, and DNA motif discovery is carried out in the proximal promoter sequence within each group. The presence of three large clusters of groups of orthologous promoters sharing similar motifs is revealed. / L'expression des gènes est régulée, en grande partie, par la liaison des facteurs de transcription à de courtes séquences d'ADN. Trois études sont présentées, portant sur l'identification in silico de motifs régulateurs dans les séquences promotrices de gènes végétaux. Dans la première étude, un algorithme d'initiation discriminative exacte est présenté. L'algorithme surpasse plusieurs algorithmes populaires lorsque appliqué à des données biologiques de référence. Dans la deuxième étude, l'algorithme est utilisé pour l'identification de motifs cis-régulateurs conservés dans les promoteurs de gènes de protéines de réserve des graines chez diverses espèces végétales. Des motifs connus ainsi que de nouveaux motifs sont identifiés. Dans la troisième étude, des groupes de gènes orthologues sont identifiés chez cinq espèces dicotylédones, et une recherche de motifs cis-régulateurs est réalisée dans les séquences promotrices proximales pour chaque groupe. La présence de trois larges grappes de groupes d'orthologues partageant des motifs similaires est mise en évidence. Biology - Bioinformatics
7	Role of non-signaling (decoy) chemokine receptors in regulating cell migration: the mathematical model Qu, Yiding January 2013 (has links) Chemokines belong to a family of important chemoattractants that guide the directional migration of the cell. The cognate chemokine receptor on the cell senses the chemokine gradient and the cell moves towards the signal of increasing chemokine concentration. However, several chemokine receptors were recently identified as non-signaling (decoy), based on their ability to bind the chemokine but produce no measurable signal for the cell. The function of these decoy receptors is yet unknown. We hypothesized that the ligand binding by the decoy receptor may help maintaining a sharper chemokine gradient and thus stimulate the cell migration. We first assessed if the expression of decoy and corresponding signaling receptors changes when cancer cells acquire migratory phenotype – become metastatic. Using publically available database of gene expression in normal prostate, carcinoma and metastatic prostate cancer samples, we have found that the expression of decoy receptors CCX-CKR and OPG increased in metastatic cancer cells compared to normal prostate and positively correlated with the expression of signaling receptors CCR7 and RANK respectively. We next developed mathematical model that described the dynamics of chemokine ligand, normal receptor and decoy receptor as well as subsequent cell movement. Using this model we first assessed how the cells expressing signaling receptors only migrate towards the source of ligand given at different concentrations. At low levels of ligand, cell migration increased with the increase in ligand concentration. However, at higher concentrations, when the ligand levels exceeded the signaling receptor capacity, further increase in ligand resulted in the decrease the distance of cell migration. Importantly, at high levels of ligand the presence of the decoy receptor improved the speed and distance of cell migration. This study suggests the novel function for the non-signaling chemokine receptors in maintaining the chemokine gradient and positively regulating directional cell migration. / Les chimiokines appartiennent à une importante famille de ligands chimiotactiques qui guident la direction migratoire des cellules. Sur une cellule-cible, des récepteurs spécifiques à une chimiokine donnée répondent à un gradient du ligand, provoquant la migration cellulaire vers le signal avec une concentration croissante. Cependant, quelques récepteurs pouvant liés des chimiokines ont récemment été identifiés comme muets (leurre) parce que la liaison du ligand ne stimule pas de signalisation mesurable dans la cellule. La fonction de ces récepteurs-leurres n'est pas connue actuellement.Nous avons émis l'hypothèse que l'interaction des chimiokines à ces récepteurs-leurres contribue à maintenir un gradient de ligand plus prononcé et donc stimule les cellules à migrer. Afin de tester cette hypothèse, nous avons en premier comparé l'expression de récepteurs signalant et de récepteurs-leurres pour un même ligand, quand des cellules deviennent métastatiques. En utilisant des bases de données publiques sur l'expression des gènes dans des échantillons de prostate normale, de carcinomes prostatiques, et de métastases prostatiques, nous avons remarqué que l'expression des récepteurs-leurres CCX-CKR et OPG est augmentée dans les cellules métastatiques lorsque comparée avec les cellules de prostate normales. Nous avons aussi trouvé une corrélation positive avec les niveaux d'expression des récepteurs signalants CCR7 et RANK. Par la suite, nous avons développé un modèle mathématique qui prédit la dynamique des concentrations de chimiokines, l'expression des récepteurs signalants, des récepteurs-leurres, et des mouvements de la cellule résultants. Nous avons tout d'abord utilisé ce modèle afin de prédire comment des cellules exprimant seulement des récepteurs signalant migrent vers la source du ligand selon sa concentration. En présence de faibles concentrations de ligand, la migration cellulaire augmente proportionnellement à l'augmentation de la concentration du ligand. Cependant, à des concentrations plus élevées dépassant la capacité de liaison du récepteur signalant, une augmentation subséquente diminue la distance migrée par la cellule. L'expression concomittante de récepteurs-leurres améliore la vitesse et la distance de la migration cellulaire lorsque la concentration du ligand est élevée. Cette étude suggère donc que les récepteurs-leurres des chimiokines contribuent au gradient chimiotactique et augmentent la migration des cellules. Biology - Bioinformatics
8	Closing the gap between genome analysis and the biologist Forgetta, Vincenzo January 2013 (has links) Bioinformatics is a crucial component of genomics research because it enables the analyses of large and complex data sets. Conventionally, these analyses involve the use of sophisticated software, and are largely performed by those with prior experience in bioinformatics using adequate computational resources.Massively parallel DNA sequencing (MPS) platforms have democratized genome sequencing, making it affordable to the biologist. For many biologists this will be their first venture into bioinformatics and genomics. Consequently, they may be unfamiliar with bioinformatics or lack the necessary computer resources. For these biologists, the potential of using MPS platforms for genome analysis is half fulfilled; providing affordable genomic data without the means to easily analyze it. One approach to close this gap is to build software oriented towards those with limited bioinformatics expertise or resources.This dissertation describes a paradigm to close the gap between genome analysis and the biologist. Using this paradigm, I have developed software tools for three bioinformatics tasks in genome analysis: [i] assessment of a genome assembly, [ii] display and integrated analysis of genomic data, and [iii] deriving biological insight using public information. The first tool I developed was cgb, a program that creates custom UCSC Genome Browsers, allowing biologists to use this browser for genome sequences obtained from MPS platforms. Using cgb for a comparative genomics study of Clostridium difficile assisted us to identify diagnostic DNA markers associated with disease severity and to estimate that the pan-genome is larger than previously estimated. Next I developed contiGo, a general purpose tool to inspect genome assemblies via a web browser, thus bypassing the need for the biologist to install software, satisfy hardware requirements, and download large datasets. Along with cgb, this program enabled us to evaluate the performance of the Roche/454 Genome Sequencer-FLX MPS platform across five sequencing core facilities, and to produce a high quality genome sequence of the fungus Ophiostoma novo-ulmi. Lastly, I developed BL!P, a program to automate NCBI BLAST searches and explore the results in a dynamic interface. This program was inspired by my work on characterizing the genome of a multi-drug resistant and pathogenic strain of Escherichia fergusonii, for which cgb and contiGo were also used in data analysis. These applications have been used in other genomics projects by users with a range of bioinformatics expertise and resources. Other data-intensive fields of science could benefit from a similar software development paradigm. / La bioinformatique fait maintenant partie intégrante de la recherche en génomique, car elle permet des analyses de bases de données larges et complexes. Conventionnellement, ces analyses impliquent l'utilisation de logiciels sophistiqués et sont généralement faites par des personnes expérimentées en bioinformatique qui utilisent des ressources informatiques adéquates.Les plateformes de séquençage haut débit d'ADN ont démocratisé le séquençage du génome, le rendant ainsi accessible aux biologistes. Pour de nombreux biologistes, ce sera leur première incursion dans les domaines de la bioinformatique et de la génomique. Par conséquent, ils ne sont probablement pas familiers avec la bioinformatique ou n'ont pas les ressources informatiques nécessaires afin d'analyser les résultats. Pour ces biologistes, l'utilisation des plateformes de séquençage haut débit permet l'obtention abordable de données génomiques, mais n'offre pas les outils pour les analyser facilement. Le développement de logiciels ciblant les chercheurs ayant une expertise en bioinformatique limitée ou avec peu de ressources permettrait de combler cet écart.Cette dissertation décrit un paradigme visant à réduire, voire même à fermer, l'écart entre l'analyse du génome et le biologiste. En utilisant ce paradigme, j'ai développé des outils informatiques pour trois tâches facilitant l'analyse génomique : [i] l'évaluation de l'assemblage du génome, [ii] l'affichage et l'analyse intégrée des données génomiques, et [iii] l'obtention de connaissances biologiques utilisant de l'information publique. Le premier outil que j'ai développé était cgb, un programme qui crée des navigateurs personnalisés « UCSC Genome ». Il permet aux biologistes d'utiliser ces navigateurs pour évaluer les séquences obtenues à partir de plateformes de séquençage haut débit. L'utilisation de cgb lors d'une étude génomique comparative de Clostridium difficile nous a permis d'identifier des marqueurs diagnostics d'ADN associés à la gravité de la maladie et de démontrer que son pan-génome est plus grand qu'estimé précédemment. Ensuite, j'ai développé contiGo, un outil d'usage général pour réviser les assemblages de séquences génomiques par l'intermédiaire d'un navigateur web. Cette application permet aux biologistes de contourner la nécessité d'installer un logiciel, de satisfaire les exigences de l'équipement informatique, et de télécharger des larges bases de données. Conjointement avec cgb, ce programme nous a permis d'évaluer la performance de la plateforme de séquençage haut débit Roche/454 Genome Sequencer FLX, à travers cinq installations de séquençage, ainsi qu'à générer une séquence génomique de grande qualité du champignon Ophiostoma novo-ulmi. Finalement, j'ai développé BL!P, un programme pour automatiser les recherches BLAST NCBI et pour explorer les résultats obtenus dans une interface dynamique. Ce programme a été inspiré par mon travail sur la caractérisation du génome d'une souche pathogène et multi résistante d'Escherichia fergusonii, et pour laquelle cgb et contiGo ont également été utilisés dans l'analyse des données. Ces applications ont été utilisées dans d'autres projets de génomique par des utilisateurs possédant un éventail de compétences et de ressources bioinformatiques. D'autres domaines scientifiques générant des multitudes de données pourraient bénéficier d'un paradigme similaire de développement de logiciel informatique. Biology - Bioinformatics
9	WRAPS -- a system for determining the probability of prokaryotic protein annotation correctness Nelson, Benjamin K. 21 May 2013 (has links) <p> Advances in sequencing technology have resulted in the sequencing of whole genomes from many simple organisms such as fungi and bacteria, while allowing the assembly of much more complex genomes like humans and chimpanzees. Consequently, association of segments of newly sequenced genomes to specific function (i.e. annotation) is being completed by comparative study of protein coding regions from previously annotated genome data. While this is an ideal procedure to process and annotate huge number of available genomic sequences, this approach can potentially lead to propagating erroneous annotation in a public sequence repository and vastly diminish the integrity of these new annotation of genome sequences. In this project, the WRongly Annotated Protein identification System (WRAPS) has been created to analyze previously annotated proteins quickly and efficiently. The likeliness that the protein is correctly annotated is determined by weighted scoring schema based on conservation of protein domain, the domains present in different reading frames, and isoelectric point. A study of 88,023 proteins of Yersinia, Staphylococcus, and Bacillus using WRAPS show that there are several proteins that can be considered wrongly annotated, as well as the correctness of annotation among these proteins. </p> Biology, Bioinformatics
10	Implications of host ULP1-like domains in DNA transposons Sabry, Nadia Hesham January 2010 (has links) Transposable elements are known to occasionally capture host genetic material. In the case of DNA transposons, such transduplications result in pseudogenic sequences that are eventually lost. An exception is the conserved transduplicated ubiquitin-like-specific protease 1 (ULP1)-like conserved domain (CD) previously described in Kiaonashi (KI) and other MULE (Mutator) transposons, its conservation suggesting it confers an advantage to the transposons and/or their hosts. In the present study, we describe and characterize a previously unreported ULP1-like domain (ONI) found in the Rim2/Hipa transposons of the CACTA superfamily in rice. We show that ONI is conserved amongst the elements, is more related to the KI domain in Arabidopsis than to host gene ULP1 CDs, is most closely related to ULP1 CDs recently found in grapevine CACTA elements and was most likely independently acquired in each of the CACTA and MULE transposons lineages. The discovery of a new conserved ULP1-like transduplicated domain in a different DNA transposon superfamily than previously supports the proposal that the ULP1- CD containing genes in these transposons contribute an evolutionary advantage benefit to the transposons. This suggests that transduplication may be then an evolutionarily significant mechanism for transposons, providing a source of diversity. / Des éléments transposables sont connus pour capturer de temps en temps le matériel génétique de leur hôte. Dans le cas des transposons d'ADN, de tels transduplications ont comme conséquence des pseudogênes qui sont par la suite perdus. Une exception est le domaine transduplicaté conservé d'ULP1 précédemment décrit dans Kionashi (KI) et d'autres transposons de MULE, sa conservation le suggérant confère un avantage aux transposons et/ou à leurs hôtes. Dans la présente étude, nous décrivons et caractérisons un domaine précédemment non rapporté d'ULP1 (ONI) trouvé dans les transposons de Rim2/Hipa du famille superbe de CACTA en riz. Nous prouvons qu'ONI est conservé parmi les éléments, davantage est lié au domaine de KI dans Arabidopsis qu'aux domaines conservés (DC) du gène ULP1 de l'hôte, le plus étroitement est lié aux DC ULP1 récemment trouvés dans des éléments de la vigne CACTA et était le plus susceptible indépendamment acquis dans chacun des lignées de CACTA et de MULE de transposons. La découverte d'un nouveau domaine transduplicaté par ULP1 conservé dans un super-famillie différent de transposon d'ADN qu'appuie précédemment la proposition que les gènes contenant du DC d'ULP1- dans ces transposons contribuent un avantage évolutionnaire avantageux aux transposons. Ceci suggère que le transduplication puisse être alors un mécanisme évolutionnaire significatif pour des transposons, fournissant une source de diversité. Biology - Bioinformatics

Search results