Spelling suggestions: "subject:"[een] GENOMICS"" "subject:"[enn] GENOMICS""
191 |
Estudo do desequilíbrio de ligação e estimativa do tamanho efetivo em uma população da raça gir selecionada para crescimento pós-desmama /Toro Ospina, Alejandra Maria. January 2017 (has links)
Orientador: Josineudson Augusto II de Vasconcelos Silva / Banca: Lenira El faro Zadra / Banca: Rusbel Raul Aspilcueta Borquis / Resumo: O objetivo deste estudo foi estimar o desequilíbrio de ligação (r2) nas distâncias de 25-50kb, 50-100kb, 100-500kb, 0,5-1Mb e o tamanho efetivo (Ne) nas gerações 0, 5, 10, 15, 20 em população da raça Gir selecionada para crescimento pós-desmama. Os animais utilizados no presente estudo foram provenientes do rebanho fechado do Instituto de Zootecnia, Sertãozinho, SP. Foram obtidos os genótipos de 155 animais com o painel BovineDL 33kb e 18 com painel HD imputado onde realizou-se controle de qualidade (CQ) para alelo de menor frequência (MAF) < 0,02 e call rate < 0,1. Depois do CQ permaneceram 27.236 SNPs e 155 animais do painel de 33 kb e 732.962 SNPs e 173 animais do painel HD Imputado. As análises de r2 foram realizadas pelo programa Plink e programa estatístico R Studio e o Ne por meio do DL. Os resultados das distâncias 25-50kb, 50-100kb, 100-500kb e 0,5-1Mb do r2 para o painel 33kb foram iguais a 0,29, 0,25, 0,16 e 0,032 respectivamente, e 0,35, 0,29, 0,18, 0,032 para o painel HD imputado demostrando que o DL permaneceu nas distâncias menores a 100kb, decaindo com o aumento das distâncias. Estes resultados foram maiores aos descritos na literatura para animais zebuínos, sugerindo como causa os segmentos longos de haplótipos que compartilham os animais aparentados. O Ne foi igual a 9, 17, 24, 30 e 30 animais nas gerações 0, 5, 10, 15, 20, observa-se que o Ne é maior na geração 20, com 30 animais, e decai drasticamente a partir da 5 geração com 17 animais, e sendo de 9 an... (Resumo completo, clicar acesso eletrônico abaixo) / Abstract: The aim of this study was to estimate the linkage disequilibrium (r2) at distances of 25-50kb, 50-100kb, 100-500kb, 0,5-1Mb and the effective population size (Ne) in generations 0, 5, 10, 15, 20 in population of the selected Gir for yearling growth. The animals used in this study were from the closed herd Animal Science Institute, Sertãozinho, SP. the genotypes of 155 animals were obtained with BovineDL 33kb and 18 animals of panel HD, where quality control was held (QC) for minor allele frequency (MAF) <0.02 and call rate <0.1. After QC remained 27,236 SNPs and 155 animals to panel 33 kb, 732.962 SNPs and 173 the panel HD imputation. The r2 analyzes were performed by Plink program and R Studio statistical program and Ne through the DL. The results of r2 for distances 25-50kb, 50-100kb, 100-500kb and 0,5-1Mb were equal to 0.29, 0.25, 0.16 and 0.032, respectively, showing that the DL remained in smaller distances 100kb, decreasing with increasing distances. These results were higher than those reported in the literature for Zebu animals, suggesting a cause to long haplotype segments that share the related animals. Ne is equal to 9, 17, 24, 30 and 30 in the generations 0, 5, 10, 15, 20, it is observed that Ne is higher in generation 20 with 30 animals and decays sharply from 5 Generation 17 animals, and with 9 animals the latest generation, small size for a population. The values found in this study to DL and Ne, explain the selection system and the structure of the populatio... (Complete abstract click electronic access below) / Mestre
|
192 |
Bioinformatic analysis of viral genomic sequences and concepts of genome-specific national vaccine designUnknown Date (has links)
This research is concerned with analyzing a set of viral genomes to elucidate the underlying characteristics and determine the information-theoretic aspects of the genomic signatures. The goal of this study thereof, is tailored to address the following: (i) Reviewing various methods available to deduce the features and characteristics of genomic sequences of organisms in general, and particularly focusing on the genomes pertinent to viruses; (ii) applying the concepts of information-theoretics (entropy principles) to analyze genomic sequences; (iii) envisaging various aspects of biothermodynamic energetics so as to determine the framework and architecture that decide the stability and patterns of the subsequences in a genome; (iv) evaluating the genomic details using spectral-domain techniques; (v) studying fuzzy considerations to ascertain the overlapping details in genomic sequences; (vi) determining the common subsequences among various strains of a virus by logistically regressing the data obtained via entropic, energetics and spectral-domain exercises; (vii) differentiating informational profiles of coding and non-coding regions in a DNA sequence to locate aberrant (cryptic) attributes evolved as a result of mutational changes and (viii) finding the signatures of CDS of genomes of viral strains toward rationally conceiving plausible designs of vaccines. Commensurate with the topics indicated above, necessary simulations are proposed and computational exercises are performed (with MatLabTM R2009b and other software as needed). Extensive data gathered from open-literature are used thereof and, simulation results are verified. Lastly, results are discussed, inferences are made and open-questions are identified for future research. / by Sharmistha P. Chatterjee. / Thesis (Ph.D.)--Florida Atlantic University, 2013. / Includes bibliography. / Mode of access: World Wide Web. / System requirements: Adobe Reader.
|
193 |
The genomics of acute myeloid leukaemia : an investigation into the molecular pathogenesis of acute myeloid leukaemia with t(8;21)Mannari, Deepak January 2012 (has links)
Acute myeloid leukaemia is a clonal disorder characterised by recurrent chromosomal translocations. One of the commonest, is the t(8;21) which results in part of the AML1 gene being juxtaposed to most of the ETO gene with the resultant chimeric protein, AML1-ETO, acting predominantly as a transcriptional repressor. Despite the extensive literature available, the exact mechanism by which the chimeric protein results in AML has not been fully elucidated. By using exon arrays and high throughput sequencing as tools it was hoped to gain further insights into the molecular basis of this disease. Gene expression profiling using the exon arrays highlighted molecular pathways and specific genes that play a key role in the pathogenesis in t(8;21). Exon arrays were also used to profile individual exon expression of the ETO gene. This demonstrated that the genomic breakpoint of ETO in the t(8;21) is variable between different patients. This technique also resulted in the discovery of a new exon in the ETO gene. This novel exon results in formation of alternative transcripts of AML1-ETO and was shown in mouse models to play a key role in leukaemogenesis. Chromatin immunoprecipitation followed by high throughput sequencing revealed novel aspects of AML1-ETO binding. A number of novel genes that bind AML1-ETO were recognized and in conjunction with the expression data, a number of hypothesis on how AML1-ETO binding effects gene expression are made.
|
194 |
Genome-wide analyses using bead-based microarraysDunning, Mark J. January 2008 (has links)
Microarrays are now an established tool for biological research and have a wide range of applications. In this thesis I investigate the BeadArray microarray technology developed by Illumina. The design of this technology is unique and gives rise to many computational and statistical challenges. However, I show how knowledge from other microarray technologies can be used to our advantage. I describe the beadarray software package, which is now used by researchers around the world. The development of this software was motivated by the fact that Illumina's software (BeadStudio) gives a summarised view of Illumina data and does not gives users any control over several processing steps that were found to be crucial for other microarray technologies. A main feature of beadarray is the ability to access raw data. The advantages of such data include the ability to perform more detailed quality assessment and greater control over the analysis at all stages. The analysis of a control experiment shows that the processing steps used in BeadStudio can be improved. In particular, utilising variances calculated from the raw data can increase the ability to detect genes which have different expression levels between samples, a common goal for microarray studies. The data from the control experiment are made available for other researchers to use and validate their own analysis methods. One issue discovered during the analysis of the control experiment was that only half of the intended genes could be reliably measured due to problems in the design of the probes targetting particular genes. By considering a large set of publicly available Illumina arrays, I show how such unreliable measurements can affect the analysis of Illumina data. I also show how potential problems can be identified in advance of an experiment and incorporated into an analysis pipeline.
|
195 |
Unearthing the genome of the earthworm Lumbricus rubellusElsworth, Benjamin Lloyd January 2013 (has links)
The earthworm has long been of interest to biologists, most notably Charles Darwin, who was the first to reveal their true role as eco-engineers of the soil. However, to fully understand an animal one needs to combine observational data with the fundamental building blocks of life, DNA. For many years, sequencing a genome was an incredibly costly and time-consuming process. Recent advances in sequencing technology have led to high quality, high throughput data being available at low cost. Although this provides large amounts of sequence data, the bioinformatics knowledge required to assemble and annotate these new data are still in their infancy. This bottleneck is slowly opening up, and with it come the first glimpses into the new and exciting biology of many new species. This thesis provides the first high quality draft genome assembly and annotation of an earthworm, Lumbricus rubellus. The assembly process and resulting data highlight the complexity of assembling a eukaryotic genome using short read data. To improve assembly, a novel approach was created utilising transcripts to scaffold the genome (https://github.com/elswob/SCUBAT). The annotation of the assembly provides the draft of the complete proteome, which is also supported by the first RNA-Seq generated transcriptome. These annotations have enabled detailed analysis of the protein coding genes including comparative analysis with two other annelids (a leech and a polychaete worm) and a symbiont (Verminephrobacter). This analysis identified four key areas which appear to be either highly enhanced or unique to L. rubellus. Three of these may be related to the unique environment from which the sequenced worms originated and add to the mounting evidence for the use of earthworms as bioindicators of soil quality. All data is stored in relational databases and available to search and browse via a website at www.earthworms.org. It is hoped that this genome will provide a springboard for many future investigations into the earthworm and continue research into this wonderful animal.
|
196 |
Visualising Plasmodium falciparum functional genomic data in MaGnET : malaria genome exploration toolSharman, Joanna Louise January 2009 (has links)
Malaria affects the lives of 500 million people around the world each year. The disease is caused by protozoan parasites of the genus Plasmodium, whose ability to evade the immune system and quickly evolve resistance to drugs poses a major challenge for disease control. The results of several Plasmodium genome sequencing projects have revealed how little is known about the function of their genes (over half of the approximately 5400 genes in Plasmodium falciparum, the most deadly human parasite, are annotated as hypothetical ). Recently, several large-scale studies have attempted to shed light on the processes in which genes are involved; for example, the use of DNA microarrays to profile the parasite s gene expression. With the emergence of varied types of functional genomic data comes a need for effective tools that allow biologists (and bioinformaticians) to explore these data. The goal of exploration/browsing-style analyses will typically be to derive clues towards the function of thus far uncharacterised gene products, and to formulate experimentally testable hypotheses. Graphic interfaces to individual data sets are obviously beneficial in this endeavour. However, effective visual data exploration requires also that interfaces to different functional genomic data are integrated and that the user can carry forward a selected group of genes (not merely one at a time) across a variety of data sets. Non-expert users especially benefit from workbenchlike tools offering access to the data in this way. Still, only very few of the contemporary publicly available software have implemented such functionality. This work introduces a novel software tool for the integrated visualisation of functional genomic data relating to P. falciparum: the Malaria Genome Exploration Tool (MaGnET). MaGnET consists of a light-weight Java program for effective visualisation linked to a MySQL database for data storage. In order to maximise accessibility, the program is publicly available over the World Wide Web (http://www.malariagenomeexplorer.org/). MaGnET incorporates a Genome Viewer for visualising the location of genomic features, a Protein-Protein Interaction Viewer for visualising networks of experimentally determined interactions and an Expression Data Viewer for displaying mRNA and protein expression data. Complex database queries can easily be constructed in the Data Analysis Viewer. An advantage over most other tools is that all sections are fully integrated, allowing users to carry selected groups of genes across different datasets. Furthermore, MaGnET provides useful advanced visualisation features, including mapping of expression data onto genomic location or protein-protein interaction network. The inclusion of available third-party Java software has expanded the visualisation capability of MaGnET; for example, the Jmol viewer has been incorporated for viewing 3-D protein structures. An effort has been made to only include data in MaGnET that is at least of reasonable quality. The MaGnET database collates experimental data from various public Plasmodium resources (e.g. PlasmoDB) and from published functional genomic studies, such as DNA microarrays. In addition, through careful filtering and labelling we have been able to include some predicted annotation that has not been experimentally confirmed, such as Gene Ontology and InterPro functional assignments and modelled protein structures. The application of MaGnET to malaria biology is demonstrated through a series of small studies. Initial examples show how MaGnET can be used to effectively demonstrate results from previously published analyses. This is followed up by using MaGnET to make a set of predictions about the possible functions of selected uncharacterised genes and suggesting follow-up experiments.
|
197 |
Comparative and functional analysis of alternative splicing in eukaryotic genomesChen, Lu January 2012 (has links)
Alternative splicing (AS) is a common post-transcriptional process in eukaryotic organisms, by which multiple distinct functional transcripts are produced from a single gene. Because of its potential role in expanding transcript diversity, interest in alternative splicing has been increasing over the last decade, ever since the release of the human genome draft showed it contained little more than the number of genes of a worm. Although recent studies have shown that 94% human multi-exon genes undergo AS while aberrant AS may cause disease or cancer, evolution of AS in eukaryotic genomes remains largely unexplored mainly due to the lack of comparable AS estimates. In this thesis I built a Eukaryote Comprehensive & Comparable Alternative Splicing Events Database (ECCASED) based on the analyses of over 30 million Expressed Sequence Tag (ESTs) for 114 eukaryotic genomes, including protists (22), plants (20), fungi (23), metazoan (non-vertebrates, 29) and vertebrates (20). Using this database, I addressed two main questions: 1) How does alternative splicing relate to gene duplication (GD) as an alternative mechanism to increase transcript diversity? and 2) What is the contribution of alternative splicing to eukaryote transcript diversity? I found that the previous “interchangeable model” of AS and gene duplication is a by-product of an existing relation between gene expression breadth, AS and gene family size. I also show that alternative splicing has played a key role in the expansion of transcript diversity and that this expansion is the best predictor reported to date of organisms complexity assayed as number of cell types. In addition, by comparing alternative splicing patterns in cancer and normal transcript libraries I found that cancer derived transcript libraries have increased levels of “noisy splicing”.
|
198 |
Characterizing the Huntington's disease, Parkinson's disease, and pan-neurodegenerative gene expression signature with RNA sequencingLabadorf, Adam 12 August 2016 (has links)
Huntington's disease (HD) and Parkinson's disease (PD) are devastating neurodegenerative disorders that are characterized pathologically by degeneration of neurons in the brain and clinically by loss of motor function and cognitive decline in mid to late life. The cause of neuronal degeneration in these diseases is unclear, but both are histologically marked by aggregation of specific proteins in specific brain regions. In HD, fragments of a mutant Huntingtin protein aggregate and cause medium spiny interneurons of the striatum to degenerate. In contrast, PD brains exhibit aggregation of toxic fragments of the alpha synuclein protein throughout the central nervous system and trigger degeneration of dopaminergic neurons in the substantia nigra. Considering the commonalities and differences between these diseases, identifying common biological patterns across HD and PD as well as signatures unique to each may provide significant insight into the molecular mechanisms underlying neurodegeneration as a general process. State-of-the-art high-throughput sequencing technology allows for unbiased, whole genome quantification of RNA molecules within a biological sample that can be used to assess the level of activity, or expression, of thousands of genes simultaneously. In this thesis, I present three studies characterizing the RNA expression profiles of post-mortem HD and PD subjects using high-throughput mRNA sequencing data sets. The first study describes an analysis of differential expression between HD individuals and neurologically normal controls that indicates a widespread increase in immune, neuroinflammatory, and developmental gene expression. The second study expands upon the first study by making methodological improvements and extends the differential expression analysis to include PD subjects, with the goal of comparing and contrasting HD and PD gene expression profiles. This study was designed to identify common mechanisms underlying the neurodegenerative phenotype, transcending those of each unique disease, and has revealed specific biological processes, in particular those related to NFkB inflammation, common to HD and PD. The last study describes a novel methodology for combining mRNA and miRNA expression that seeks to identify associations between mRNA-miRNA modules and continuous clinical variables of interest, including CAG repeat length and clinical age of onset in HD.
|
199 |
Precision medicine in oncology: a complicated idea needs a simple solutionBenson, Adam 17 June 2016 (has links)
Cancer therapy has historically been determined by a tumor’s tissue of origin. Now, thanks to advances in genomics technology, scientists are looking further into one’s cancer; into the very genome that drives the tumor growth.
The growth of genomics in cancer research has been astronomical. In a little over ten years since the completion of the Human Genome Project, genomic profiling technologies have developed into an incredibly powerful, relatively cheap, and immensely underutilized tool for oncologists.
In the midst of the advances in cancer profiling, there has been reluctance from oncologists to incorporate genomic profiling into their treatment decisions. Saddled by outdated clinical trial designs, and cancer drug regulation programs, the true measure of the clinical utility of genomic profiling has yet to be seen. Cancer scientists will continue to profile cancers at a pace well beyond the limits of the field of oncology. Without coordinated efforts to update the oncology healthcare system, compendia of data will continue to be generated with limited ability to translate the information into personalized medicines.
There are significant barriers to overcome before genomic data can universally be incorporated into the daily practice of cancer medicine. In the meantime, resources are available for physicians to help begin the process of integrating a more personalized approach to cancer therapy. Third-party bioinformatics companies are in the best position to be the agents of this change. As cancer research continues to adopt a genomic approach, it is paramount that, for the sake of millions of cancer patients, the healthcare system adapts in a way to best utilize this new information.
|
200 |
The relative importance of human and animal sources of vancomycin-resistant Enterococcus faecium in immunocompromised patients in hospitalGouliouris, Theodore January 2019 (has links)
Enterococcus faecium is a leading cause of hospital-acquired infection, disproportionally affecting immunocompromised and critically ill patients. Despite infection control measures, rates of vancomycin-resistant E. faecium (VREfm) bacteraemias have failed to decline in the United Kingdom, and Cambridge University Hospitals (CUH) report the highest numbers nationally. The aims of my PhD were to use epidemiological and genomic surveillance data to establish risk factors for acquisition and infection with E. faecium in patients at CUH, and to use a One Health approach to consider possible sources for hospital patients by relating bloodstream-associated isolates with those cultured from livestock and the environment in the same geographic region. A retrospective matched nested case control study was performed to determine risk factors for VRE bacteraemia relating to antibiotic exposure. 235 cases were matched to 220 controls for length of admission, year, specialty and ward type. Multivariable analysis demonstrated that duration of exposure to parenteral vancomycin, fluoroquinolones and meropenem were independently associated with VRE bacteraemia. This provides evidence for the importance of antimicrobial stewardship targeting high-risk antibiotics in patients at risk of VRE bacteraemia. VREfm bacteraemia may be complicated by disease recurrence. Whole genome sequencing was used to distinguish between relapse and reinfection in 14 episodes of recurrent VREfm bacteraemia. This demonstrated that 10 (71%) episodes were due to reinfection with a new strain, with reinfection being more likely with increasing time between two positive cultures. This study also evaluated 9 patients with blood cultures positive for both VREfm and vancomycin-susceptible E. faecium (VSEfm), the majority (78%) of which were found to be unrelated strains. More than half of all study isolates from these two patient groups were closely related to another isolate causing bacteraemia at CUH, suggesting that hospital acquisition of VREfm is a driver for infection and recurrence. A cross-sectional study of E. faecium in raw and treated wastewater from 20 municipal water treatment plants across the East of England revealed widespread dissemination of healthcare-associated lineages of VREfm in all sampled locations including rural areas, and environmental release in treated wastewater in 17/20 locations. Wastewater isolates were genetically intermixed with isolates causing bacteraemia at CUH, including highly related isolates indicating recent transmission between the two reservoirs. These findings are consistent with widespread distribution of healthcare-associated VREfm in community populations. A One Health approach incorporating sampling from livestock (10 pork, 10 cattle, 9 poultry farms) detected no VREfm in animals whilst 2 independent meat surveys demonstrated VREfm in 1-2% of uncooked products. Genomic comparison of >1400 E. faecium isolates from livestock, meat, wastewater and almost 800 people with bloodstream infection demonstrated that livestock and human isolates were genetically distinct. Analysis of the accessory genome added further evidence for distinct gene content associated with niche adaptation. An analysis of mobile genes encoding antibiotic resistance revealed limited evidence of sharing between human and animal populations. A prospective longitudinal study in haematology patients at CUH over 6 months revealed high rates of VREfm carriage (63% of cases) and environmental contamination (49% of samples). Genomic analysis elucidated complex colonisation dynamics with frequent loss and acquisition of subtypes, including unsuspected acquisition of new VREfm subtypes in patients already colonised with VREfm, and multiple transmission chains involving patients and the environment, including some leading to bacteraemia. These findings highlight the shortcomings of infection control and environmental cleaning and provide the basis for revised interventions.
|
Page generated in 0.0459 seconds