• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 90
  • 34
  • 27
  • 13
  • 4
  • 4
  • 4
  • 1
  • Tagged with
  • 209
  • 209
  • 57
  • 51
  • 45
  • 35
  • 35
  • 35
  • 27
  • 23
  • 19
  • 15
  • 14
  • 14
  • 14
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
11

Analysis of gene evolution in the long-lived Heliconius butterfly genus

Ward, LeAnn 12 May 2023 (has links) (PDF)
The butterfly genus Heliconius has an unusually long lifespan (six to eight months compared to only several weeks) and the unique ability to feed on pollen. The additional amino acids provided by a pollen diet seem intrinsically tied to longer lifespan in Heliconius. Because of these traits, Heliconius may be a valuable model species for future aging studies. As a preliminary analysis, we investigated differences in gene evolution between Heliconius and seven short-lived butterflies. We observed that the Heliconius genes had significant differences in evolutionary rates, in the direction of selection, and in the amounts of site-specific positive selection compared to the short-lived butterflies. We also investigated functional differences in gene complements between the Heliconius and short-lived butterflies but found only small differences. With this research, we present exploratory evidence supporting differences in gene evolution of the Heliconius to establish targets for future analysis of Heliconius as models for aging.
12

Pseudomonas aeruginosa Prairie Epidemic Strain Population Dynamics and Evolution of Disease in Cystic Fibrosis Airways of Adult Patients

Szymkiewicz, Rachelle January 2018 (has links)
The lower airways of patients with chronic airway diseases including cystic fibrosis (CF) are colonized by diverse communities of microorganisms. Over-time the airways of some 60% of CF patients become permanently colonized and dominated by Pseudomonas aeruginosa. Chronic infection of P. aeruginosa has been associated with a decline in pulmonary function, worse prognosis, and eventual patient mortality. Although P. aeruginosa evolves within the CF airways resulting in complex populations, the mechanism by which these complex populations contribute to disease progression is not well understood. Here we show diversity among isolates by observed changes in genome sequences of a strain of P. aeruginosa, known as Prairie Epidemic Strain (PES). Using whole genome sequencing and comparative genomics we identified a large core genome across 195 PES isolates from 57 CF patients of the Calgary Adult Cystic Fibrosis Clinic (CACFC) where 88% of the pangenome was categorized as core genes. Single nucleotide polymorphism (SNPs) mutations were shown to be the largest contributor of diversity at the nucleotide level compared to other polymorphism types consisting of 87% of the total polymorphisms present across the 195 PES isolates. CRISPR arrays and mobile elements such as prophage and plasmids demonstrate this strain of P. aeruginosa was stable over 30 years. In a second aim, I show variation in the populations of P. aeruginosa across an exacerbation event further highlighting the complexity of the lung bacterial community. Distinct populations of P. aeruginosa at the onset and resolution of an exacerbation within a single CF patient were identified by SNPs. These results a model where adaptive radiation as well as natural mutations contribute to the heterogeneity and diversification within populations of P. aeruginosa in CF patients. Understanding the evolution and population structure of PES through the identification of important genes and mutations through the clinical course of an exacerbation can aid in identifying new targets for patient treatment of P. aeruginosa in CF. / Thesis / Master of Science (MSc) / Cystic fibrosis is a life-threatening disease characterized by cycles of stability and respiratory illness. Bacterial species within the lungs of these patients are the main contributor to disease progression. I investigated a specific transmissible epidemic strain, Pseudomonas aeruginosa Prairie Epidemic Strain, using a unique collection of samples provided by collaborators at the adult cystic fibrosis clinic in Calgary. Using these samples, I first explored the differences between patients over a period of 34 years. I hypothesized that similar changes in genome sequences will be observed in multiple patients with a possible commonality in disease progression. Second, I explored the role this bacterial pathogen may play in cycles of respiratory illness. I hypothesize that a specific bacterial subpopulation could initiate these cycles and be identified by changes at the genome level. This research provides further knowledge of an epidemic strain of cystic fibrosis.
13

Comparative Genome Analysis of Three Brucella spp. and a Data Model for Automated Multiple Genome Comparison

Sturgill, David Matthew 09 October 2003 (has links)
Comparative analysis of multiple genomes presents many challenges ranging from management of information about thousands of local similarities to definition of features by combination of evidence from multiple analyses and experiments. This research represents the development stage of a database-backed pipeline for comparative analysis of multiple genomes. The genomes of three recently sequenced species of Brucella were compared and a superset of known and hypothetical coding sequences was identified to be used in design of a discriminatory genomic cDNA array for comparative functional genomics experiments. Comparisons were made of coding regions from the public, annotated sequence of B. melitensis (GenBank) to the annotated sequence of B. suis (TIGR) and to the newly-sequenced B. abortus (personal communication, S. Halling, National Animal Disease Center, USDA). A systematic approach to analysis of multiple genome sequences is described including a data model for storage of defined features is presented along with necessary descriptive information such as input parameters and scores from the methods used to define features. A collection of adjacency relationships between features is also stored, creating a unified database that can be mined for patterns of features which repeat among or within genomes. The biological utility of the data model was demonstrated by a detailed analysis of the multiple genome comparison used to create the sample data set. This examination of genetic differences between three Brucella species with different virulence patterns and host preferences enabled investigation of the genomic basis of virulence. In the B. suis genome, seventy-one differentiating genes were found, including a contiguous 17.6 kb region unique to the species. Although only one unique species-specific gene was identified in the B. melitensis genome and none in the B. abortus genome, seventy-nine differentiating genes were found to be present in only two of the three Brucella species. These differentiating features may be significant in explaining differences in virulence or host specificity. RT-PCR analysis was performed to determine whether these genes are transcribed in vitro. Detailed comparisons were performed on a putative B. suis pathogenicity island (PAI). An overview of these genomic differences and discussion of their significance in the context of host preference and virulence is presented. / Master of Science
14

Mold Allergomics: Comparative and Machine Learning Approaches

Dang, Ha Xuan 05 June 2014 (has links)
Fungi are one of the major organisms that cause allergic disease in human. A number of proteins from fungi have been found to be allergenic or possess immunostimulatory properties. Identifying and characterizing allergens from fungal genomes will help facilitate our understanding of the mechanism underlying host-pathogen interactions in allergic diseases. Currently, there is a lack of tools that allow us to rapidly and accurately predict allergens from whole genomes. In the context of whole genome annotation, allergens are rare compared to non-allergens and thus the data is considered highly skewed. In order to achieve a confident set of predicted allergens from a genome, false positive rates must be lowered. Current allergen prediction tools often produce many false positives when applied to large-scale data set such as whole genomes, and thus lower the precision. Moreover, the most accurate tools are relatively slow because they use sequence alignment to construct feature vectors for allergen classifiers. This dissertation presents computational approaches in characterizing the allergen repertoire in fungal genomes as part of the whole genome studies of Alternaria, an important allergenic/opportunistic human pathogenic fungus and necrotrophic plant parasite. In these studies, the genomes of multiple Alternaria species were characterized for the first time. Functional elements (e.g. genes, proteins) were first identified and annotated from these genomes using computational tools. Protein annotation and comparative genomics approaches revealed the link between Alternaria genotypes and its prolific saprophytic lifestyle that provides at least a partial explanation for the development of pathological relationships between Alternaria and humans. A machine learning based tool (Allerdictor) was developed to address the neglected problem of allergen prediction in highly skewed large-scale data sets. Allerdictor exhibited high precision over high recall at fast speed and thus it is a more practical tool for large-scale allergen annotation compared with existing tools. Allerdictor was then used together with a comparative genomics approach to survey the allergen repertoire of known allergenic fungi. We predicted a number of mold allergens that have not been experimentally characterized. These predicted allergens are potential candidates for further experimental and clinical validation. Our approaches will not only facilitate the study of allergens in the increasing number of sequenced fungal genomes but also will be useful for allergen annotation in other species and rapid prescreening of synthesized sequences for potential allergens. / Ph. D.
15

Comparative Genomics Insights into Speciation and Evolution of Hawaiian Drosophila

Kang, Lin 01 May 2017 (has links)
Speciation and adaptation have always been of great interest to biologists. The Hawaiian archipelago provides a natural arena for understanding adaptive radiation and speciation, and genomics and bioinformatics offer new approaches for studying these fundamental processes. The mode of speciation should have profound impacts on the genomic architecture and patterns of reproductive isolation of new species. The Hawaiian Drosophila are a spectacular example of sequential colonization, adaptive radiation, and speciation in the islands with nearly 1,000 estimated species, of which more than 500 have been described to date. This dissertation gives an overview of the Hawaiian Drosophila system (Chapter 1), new insights into genomes of three recently diverged species of Hawaiian picture-winged Drosophila (Chapter 2), as well as estimated gene flow patterns (Chapter 3). Additionally, I present a new approach of mapping genomic scaffolds onto chromosomes, based on NextGen sequencing from chromosomal microdissections (Chapter 4), and gene expression profiles of backcross hybrids and their parental forms (Chapter 5). Overall, obtained results were used to address such fundamental questions as the role of adaptive changes, founder effects (small effective population size in isolation), and genetic admixture during speciation. / Ph. D.
16

Structural Comparative Genomics of Four African Species of Oryza

Goicoechea, Jose Luis January 2009 (has links)
Rice is one of the most important crops in the world and is the first whose genome was completely sequenced. This landmark accomplishment placed O. sativa as a leading model in plant biology, especially for cereals. The genus Oryza includes 23 species, two of them independently domesticated in Asia and Africa. Wild species of Oryza contain a reservoir of useful agronomical traits which could be exploited for the benefit of rice agriculture, which is facing global problems as other crops, mainly due to a rampant increase in the human population and progressive deterioration of soils and water supplies. The Oryza Mapping Alignment Project has opened great opportunities to tap the genetic potential encapsulated in these species. Four BAC libraries generated from the African species of Oryza: O. barthii, O. glaberrima (AA genome), O. punctata (BB genome) and O. brachyantha (FF genome) were fully characterized and shown to provide enough coverage to represent their respective genomes. BAC clones from these libraries were fingerprinted and end-sequenced to assemble physical maps that were heavily manually edited using the sequence of O. sativa as a reference genome. The physical maps showed high coverage for all the species across all chromosomes. Both, BAC libraries and physical maps were used to investigate synteny and structural variation. The four species show high colinearity to the reference genome, although synteny perturbations were detected, including contractions, expansions, and putative inversions and translocations, which potential have an important impact in the evolution of these species.
17

Genomic Exploration of Transcriptional Regulation and Evolution in Vertebrates

Chan, Esther T. M. 16 March 2011 (has links)
All cellular processes depend on the coordinate expression of genes and their interactions. Regulatory sequences encoded in the genome stipulate the necessary instructions interpreted by sequence-specific transcription factors (TFs) to control the spatial-temporal output of gene expression. Detection of cis-regulatory signals is challenging, owing to the lack of distinguishing features such as open reading frames and an overwhelming excess of spurious to functional TF binding site matching sequences embedded within the vast non-coding regions of vertebrate genomes. From an evolutionary standpoint, functional alterations in cis-regulatory architecture are thought to be important in diversifying morphology and physiology in the evolution of vertebrates, which share a similar body plan and complement of genes. Correspondingly, recent studies have highlighted the plasticity of cis-regulatory architecture organization over evolutionary time, finding associations with examples of both diverged and conserved patterns of gene expression. These observations underscore the gap in our collective knowledge with respect to the rules by which TFs recognize and bind their targets in vivo, as well as how this process evolves in vertebrates, and serve as a motivating basis for this thesis work. To begin, I probed the extent of conservation and divergence of sequence and expression profiles across tissues of diverse vertebrate species, identifying thousands of candidate genes with conserved expression by microarray analysis. However, corresponding conservation of non-exonic and potentially regulatory sequence was lacking, suggestive of binding site turnover over evolutionary time. Next, I analyzed the sequence specificity of a wide array of mouse and yeast TFs, finding great diversity and complexity in their binding preferences, with many factors recognizing multiple distinct motifs. Furthermore, comparative analysis of orthologous TFs suggest well conserved binding specificities. I also demonstrate the likely biological relevance of sequences highly preferred by these TFs by revealing distinctive signatures in their distribution and organization within putative regulatory regions in each genome. Lastly, I have begun to explore the organization of cis-regulatory sequences active in vertebrate tissues by high-throughput sequencing of open chromatin. Together, these data help illuminate the organization and evolution of vertebrate regulatory architectures, providing a useful toolkit for the testing of new models and hypotheses.
18

Alinhamento múltiplo de genomas de eucariotos com montagens altamente fragmentadas / Multiple alignment of large eukaryotic genomes with highly fragmented assemblies

Epamino, George Willian Condomitti 04 August 2017 (has links)
O advento do sequenciamento de nova geração (NGS - Next Generation Sequencing) nos últimos anos proporcionou um aumento expressivo no número de projetos genômicos. De maneira simplificada, as máquinas sequenciadoras geram como resultado fragmentos de DNA que são utilizados por programas montadores de genoma. Esses programas tentam juntar os fragmentos de DNA de modo a obter a representação completa da sequência genômica (por exemplo um cromossomo) da espécie sendo sequenciada. Em alguns casos o processo de montagem pode ser executado com maior facilidade para organismos com genomas de tamanhos pequenos (por exemplo bactérias com genoma em torno de 5Mpb), através de pipelines que automatizam a maior parte da tarefa. Um cenário mais complicado surge quando a espécie possui genoma com grande comprimento (acima de 1Gpb) e elementos repetidos, como no caso de alguns eucariotos. Nesses casos o resultado da montagem é geralmente composto por milhares de fragmentos (chamados de contigs), uma ordem de magnitude muito superior ao número de cromossomos estimado para um organismo (comumente da ordem de dois dígitos), dando origem a uma montagem altamente fragmentada. Uma atividade comum nesses projetos é a comparação da montagem com a de outro genoma como forma de validação e também para identificação de regiões conservadas entre os organismos. Embora o problema de alinhamento par-a-par de genomas grandes seja bem contornado por abordagens existentes, o alinhamento múltiplo (AM) de genomas grandes em estado fragmentado ainda é uma tarefa de difícil resolução, por demandar alto custo computacional e grande quantidade de tempo. Este trabalho consiste em uma metologia para fazer alinhamento múltiplo de genomas grandes de eucariotos com montagens altamente fragmentadas. Nossa implementação, baseada em alinhamento estrela, se mostrou capaz de fazer AM de grupos de montagens com diversos níveis de fragmentação. O maior deles, um conjunto de 5 genomas de répteis, levou 14 horas de processamento para fornecer um mapa de regiões conservadas entre as espécies. O algoritmo foi implementado em um software que batizamos de FROG (FRagment Overlap multiple Genome alignment), de código aberto e disponível sob licença GPLv3. / The advent of Next Generation Sequencing (NGS) in recent years has led to an expressive increase in the number of genomic projects. In a simplified way, sequencing machines generate DNA fragments that are used by genome assembler software. These programs try to merge the DNA fragments to obtain the complete representation of the genomic sequence (for example a chromosome) of the species being sequenced. In some cases the assembling process can be performed more easily for organisms with small-sized genomes (e.g. bacteria with a genome length of approximately 5Mpb) through pipelines that automate most of the task. A trickier scenario arises when the species has a very large genome (above 1Gbp) and complex elements, as in the case of some eukaryotes. In those cases the result of the assembly is usually composed of thousands of fragments (called contigs), an order of magnitude much higher than the number of chromosomes estimated for an organism (usually in the order two digits), giving rise to a highly fragmented assembly. A common activity in these projects is the comparison of the assembly with that of another genome as a form of validation and also to identify common elements between organisms. Although the problem of pairwise alignment of large genomes is well circumvented by existing approaches, multiple alignment of large genomes with highly fragmented assemblies remains a difficult task due to its time and computational requirements. This work consists of a methodology for doing multiple alignment of large eukaryotic genomes with highly fragmented assemblies, a problem that few solutions are able to cope with. Our star alignment-based implementation, was able to accomplish a MSA of groups of assemblies with different levels of fragmentation. The largest of them, a set of 5 reptilian genomes where the B. jararaca assembly (800,000 contigs, N50 of 3.1Kbp) was used as anchor, took 14 hours of execution time to provide a map of conserved regions among the participating species. The algorithm was implemented in a software named FROG (FRagment Overlap multiple Genome alignment), available under the General Public License v3 (GPLv3) terms.
19

Diversité génomique des espèces bactériennes du genre Flavobacterium / Genomic diversity of Flavobacterium species

Barbier, Paul 13 November 2013 (has links)
Les bactéries du genre Flavobacterium sont retrouvées dans des types d’habitats très divers. Ce genre contient trois espèces ichtyopathogènes : columnare, branchiophilum et psychrophilum qui est responsable de pertes économiques importantes pour l’élevage des salmonidés. Un projet de séquençage et de comparaison des génomes de plusieurs flavobactéries pathogènes de poissons ainsi qu’isolées de différents environnements a été mis en place pour améliorer les connaissances sur ce genre. Les objectifs étaient l’identification des déterminants de virulence et la caractérisation de différents marqueurs moléculaires des traits phénotypiques associés à leur mode de vie. L’analyse des génomes de F. psychrophilum a permis de mettre en évidence une diversité des structures chromosomiques au sein de l’espèce et d’identifier des cibles moléculaires prometteuses pour le développement de tests de diagnostic ainsi que des cibles vaccinales potentielles. Le génome de F. branchiophilum a permis d’identifier des mécanismes moléculaires de virulence originaux. Les caractéristiques du génome de F. indicum révèlent un mode de vie environnemental : sa petite taille et ses faibles capacités de dégradation des bio-polymères suggèrent que F. indicum est adapté à une niche écologique restreinte. Ces nouvelles données ont permis de caractériser in silico des marqueurs moléculaires de caractères phénotypiques. En particulier, un groupe de gènes (dnd) rare et responsable d’une modification étonnante de la molécule d’ADN a été décrit pour la première fois chez les Flavobacteriaceae. Ce projet a permis d’enrichir les connaissances sur les bactéries du genre Flavobacterium et a contribué au développement d’outils pour la santé animale. / Flavobacterium species occur in a wide range of habitats. This genus includes three fish-pathogenic species, namely F. columnare, F. branchiophilum and F. psychrophilum. The latter is responsible for serious economic losses for salmonids farming in France and worlwide. A comparative genomics project including several fish-pathogenic flavobacteria as well as various environmental species has been set up in order to improve the knowledge on this poorly studied genus. Our aims were the identification of virulence determinants associated with pathogenicity and the characterization of various molecular elements reflecting phenotypes associated with their life-style. Analysis of the genomes of several F. psychrophilum isolates revealed the diversity of chromosomal structures within the species and identified in silico promising molecular targets for the development of diagnostic tests as well as potential vaccines targets. Analysis of the F. branchiophilum genome enabled to identify particular molecular virulence mechanisms. The features of the F. indicum genome reflected its environmental lifestyle : its small size and its limited bio-polymers degrading abilities suggested that F. indicum is adapted to a quite narrow ecological niche. These new data have allowed the in silico identification of many molecular elements reflecting phenotypic traits. In particular, a rare gene cluster (dnd) responsible for an unusual DNA structure modification was described for the first time within members of the family Flavobacteriaceae. This project enriched the knowledge on Flavobacterium species and contributed to the development of tools for animal health.
20

Population analysis of bacterial pathogens on distinct temporal and spatial scales

McAdam, Paul R. January 2014 (has links)
Bacteria have been the causative agents of major infectious disease pandemics throughout human history. Over the past 4 decades, a combination of changing medical practices, industrialization, and globalisation have led to a number of emergences and re-emergences of bacterial pathogens. The design of rational control programs and bespoke therapies will require an enhanced understanding of the dynamics underpinning the emergence and transmission of pathogenic clones. The recent development of new technologies for sequencing bacterial genomes rapidly and economically has led to a greatly enhanced understanding of the diversity of bacterial populations. This thesis describes the application of whole genome sequencing of 2 bacterial pathogens, Staphylococcus aureus and Legionella pneumophila, in order to understand the dynamics of bacterial infections on different temporal and spatial scales. The first study involves the examination of S. aureus evolution during a chronic infection of a single patient over a period of 26 months revealing differences in antibiotic resistance profiles and virulence factor expression over time. The genetic variation identified correlated with differences in growth rate, haemolytic activity, and antibiotic sensitivity, implying a profound effect on the ecology of S. aureus. Importantly, polymorphisms were identified in global regulators of virulence, with a high frequency of polymorphisms within the SigB locus identified, suggesting this region may be under selection in this patient. The identification of genes under diversifying selection during long-term infection may inform the design of novel therapeutics for the control of refractory chronic infections. Secondly, the emergence and transmission of 3 pandemic lineages derived from S. aureus clonal complex 30 (CC30) were investigated. Independent origins for each pandemic lineage were identified, with striking molecular correlates of hospital- or community-associated pandemics represented by mobile genetic elements, such as bacteriophage and Staphylococcal pathogenicity islands, and non-synonymous mutations affecting antibiotic resistance and virulence. Hospitals in large cities were identified as hubs for the transmission of MRSA to regional health care centres. In addition, comparison of whole genome sequences revealed that at least 3 independent acquisitions of TSST-1 have occurred in CC30, but a single distinct clade of diverse community-associated CC30 strains was responsible for the TSS epidemic of the late 1970s, and for subsequent cases of TSS in the UK and USA. Finally, whole genome sequencing was used as a tool for investigating a recent outbreak of legionellosis in Edinburgh. An unexpectedly high level of genomic diversity was identified among the outbreak strains, with respect to core genome polymorphisms, and accessory genome content. The data indicate that affected individuals may be infected with heterogeneous strains. The findings highlight the complexities in identifying environmental sources and suggest possible differences in pathogenic potential among isolates from a single outbreak. Taken together, the findings demonstrate applications of bacterial genome sequencing leading to enhanced understanding of bacterial pathogen evolution, emergence, and transmission, which may ultimately inform appropriate infection control measures.

Page generated in 0.0758 seconds