Spelling suggestions: "subject:"nextgenrationsequencing"" "subject:"andgeneintronsequencing""
11 |
Heterogeneity in Ewing sarcomaBranford White, Harriet A. January 2014 (has links)
Ewing sarcoma, an aggressive primary bone and soft tissue tumour is characterised by the expression of the chimeric transcription factor EWS-FLI1 in 90% of patients. This alters expression of many genes including activation of the Insulin Growth Factor (IGF) pathway via IGFBP3 supression. Phase I/II trials with an IGF-1 inhibitor have demonstrated tumour regression in a modest number of Ewing sarcoma patients. The aim of this thesis was to identify mechanisms contributing to the heterogeneity of resistance in Ewing sarcoma following inhibition with OSI-906, a dual kinase inhibitor of IGF-1 (IGF-1R) and Insulin (IR) receptors. The hypothesis was that mechanisms of resistance relate to heterogeneity of responses to signalling pathway activation and inhibition. Through selection, disruption of the pathway would identify subpopulations of cells both sensitive and resistant in their response allowing for interrogation of resistance mechanisms. A genome wide approach was taken to model the resistance profile of cell lines. Through developing a method of unbiased quantification, a panel of validated Ewing sarcoma cell lines (EuroBoNet) were imaged and segmented to assess the responses of biomarkers on signalling pathway activation. Heterogeneity was confirmed between cell lines. The application to diagnostic biopsies led to the identification of prognostic classifiers and cellular subpopulations with clinical prognostic significance. The distribution of Ki67 was found to be predictive of survival and cells with lower levels of CD99 in the cytoplasm were most discriminative. Parallel sequencing strategies (RNA-seq, whole exome sequencing, and aCGH/ SNP array) for genome-wide screening was carried out for point mutations, copy number changes and rearrangements. Systematic detection was used to characterise genomic rearrangements and functional validation performed. Resistant clones, formed via ENU mutagenesis of cell lines, were sequenced in order to demonstrate the resistance profile of OSI-906. In summary heterogeneity of Ewing sarcoma at the genomic and proteomic level can influence the signalling dependency of tumours and response to inhibitors. Genomic and proteomic profiling of tumour cells may be relevant to future developments of novel therapies.
|
12 |
High-throughput DNA Sequencingin Microbial Ecology : Methods and ApplicationsHugerth, Luisa January 2016 (has links)
Microorganisms play central roles in planet Earth’s geochemical cycles, in food production, and in health and disease of humans and livestock. In spite of this, most microbial life forms remain unknown and unnamed, their ecological importance and potential technological applications beyond the realm of speculation. This is due both to the magnitude of microbial diversity and to technological limitations. Of the many advances that have enabled microbiology to reach new depth and breadth in the past decade, one of the most important is affordable high-throughput DNA sequencing. This technology plays a central role in each paper in this thesis. Papers I and II are focused on developing methods to survey microbial diversity based on marker gene amplification and sequencing. In Paper I we proposed a computational strategy to design primers with the highest coverage among a given set of sequences and applied it to drastically improve one of the most commonly used primer pairs for ecological surveys of prokaryotes. In Paper II this strategy was applied to an eukaryotic marker gene. Despite their importance in the food chain, eukaryotic microbes are much more seldom surveyed than bacteria. Paper II aimed at making this domain of life more amenable to high-throughput surveys. In Paper III, the primers designed in papers I and II were applied to water samples collected up to twice weekly from 2011 to 2013 at an offshore station in the Baltic proper, the Linnaeus Microbial Observatory. In addition to tracking microbial communities over these three years, we created predictive models for hundreds of microbial populations, based on their co-occurrence with other populations and environmental factors. In paper IV we explored the entire metagenomic diversity in the Linnaeus Microbial Observatory. We used computational tools developed in our group to construct draft genomes of abundant bacteria and archaea and described their phylogeny, seasonal dynamics and potential physiology. We were also able to establish that, rather than being a mixture of genomes from fresh and saline water, the Baltic Sea plankton community is composed of brackish specialists which diverged from other aquatic microorganisms thousands of years before the formation of the Baltic itself. / <p>QC 20150505</p>
|
13 |
Role of UCHL1 in regulating gene expression in prostate cancer cellsIlic, Aleksandar 28 August 2014 (has links)
Ubiquitin C-terminal hydrolase L1 (UCHL1) is a multifunctional protein primarily expressed in neuronal cells and involved in numerous cellular processes. UCHL1 has been linked with neurodegenerative diseases and a wide range of cancers but its specific role remains unknown. Previous UCHL1 knockdown studies have shown that UCHL1 controls the expression of pro- and anti-apoptotic genes as well as genes involved in cell cycle regulation but it is unknown how UCHL1 regulates these genes.
We have shown that UCHL1 is cross-linked to DNA in DU145 but not in LNCaP or PC3 prostate cancer cells. Therefore, we hypothesized that UCHL1 regulates the expression of pro- or anti-apoptotic genes as well as the genes involved in the cell cycle through its interaction with DNA. By utilizing ChIP and ChIP-seq analyses it is possible to determine the UCHL1 target sequences on the genomic DNA.
It was shown that UCHL1 is only expressed in DU145 but not in LNCaP, PC3 or C4-2 prostate cancer cell lines. Additionally, UCHL1 is expressed and cross-linked to DNA in HEK293T cells. It is believed that UCHL1 is silenced by upstream promoter methylation when it is not expressed. However, treatment with the epigenetic drugs 5-aza-2′-deoxycytidine and trichostatin A (TSA) did not result in induction of UCHL1 expression in LNCaP, PC3 or C4-2 prostate cancer cell lines.
UCHL1 is also associated with p53. However, ChIP assay results have shown that UCHL1 and p53 do not bind to genomic DNA of upstream promoter regions CDKN1A and BAX genes. Additionally, through UCHL1 ChIP-seq analyses in DU145 and HEK293T cells, we discovered that UCHL1 co-localizes to the DNA with the shelterin complex shedding light on a new role of UCHL1 that has never been described before. / October 2014
|
14 |
Next-generation nematode genomesKumar, Sujai January 2013 (has links)
The first metazoan to be sequenced was a nematode (Caenorhabditis elegans), and understanding the genome of this model organism has led to many insights about all animals. Although eleven nematode genomes have been published so far and approximately twenty more are under way, the vast majority of the genomes of this incredibly diverse phylum remain unexplored. Next-generation sequencing has made it possible to generate large amounts of genome sequence data in a few days at a fraction of the cost of traditional Sanger-sequencing. However, assembling and annotating these data into genomic resources remains a challenge because of the short reads, the quality issues in these kinds of data, and the presence of contaminants and co-bionts in uncultured samples. In this thesis, I describe the process of creating high quality draft genomes and annotation resources for four nematode species representing three of the five major nematode clades: Caenorhabditis sp. 5, Meloidogyne floridensis, Dirofilaria immitis, and Litomosoides sigmodontis. I describe the new approaches I developed for visualising contamination and co-bionts, and I present the details of the robust workflow I devised to deal with the problems of generating low-cost genomic resources from Illumina short-read sequencing. Results: The draft genome assemblies created using the workflow described in this thesis are comparable to the draft nematode genomes created using Sanger sequencing. Armed with these genomes, I was able to answer two evolutionary genomics questions at very different scales. The first question was whether any non-coding elements were deeply conserved at the level of the whole phylum. Such elements had previously been hypothesised to be responsible for the phylum body plan in vertebrates, insects, and nematodes. I used twenty nematode genomes in several whole-genome alignments and concluded that no such elements were conserved across the whole phylum. The second question addressed the origins of the highly destructive plant-parasitic root-knot nematode Meloidogyne incognita. Comparisons with the newly sequenced Meloidogyne floridensis genome revealed the complex hybrid origins of both species, undermining previous assumptions about the rarity of hybrid speciation in animals. Conclusions: This thesis demonstrates the role of next-generation sequencing in democratising genome sequencing projects. Using the sequencing strategies, workflows, and tools described here, one can rapidly create genomic resources at a very low cost, even for unculturable metazoans. These genomes can be used to understand the evolutionary history of a genus or a phylum, as shown.
|
15 |
A novel whole system integrated genomics approach to identify key genetic components which facilitate synthetic design of a genetically engineered strain of Escherichia coli K12 with enhanced isobutanol toleranceBasu, Piyali January 2016 (has links)
There has been an increased global interest in biofuels which provide a renewable and sustainable alternative to fossil fuels. Isobutanol is an attractive and superior alternative to the currently produced bioethanol possessing several key advantages. Previous work focuses on strategies for metabolic optimisation of carbon utilisation. However, existing solutions reach a stage where the amount of alcohol produced reaches toxic thresholds for bacteria. This inhibits growth and reduces carbohydrate consumption resulting in lower product yields rendering the biofuel production process uneconomical. In this project, a novel strategy has been adopted which uses a whole system integrated genomics approach consisting of expression profiling, selection to create isobutanol-adapted lineages, next generation sequencing, and comparative behavioural genomics to interrogate the system thoroughly and identify critical determinants of resistance to isobutanol. These were used in the highly-defined model species, E. coli K12 to deliver results of the adaptive mechanisms which take place across the entire genome. 41 gene candidates (4 previously identified in literature) were identified to play a role in isobutanol tolerance. These candidates belong to a range of functional groups such as carbohydrate metabolism, oxidative stress response, osmotic stress response; but also identified novel membrane-associated functions such as the Tol-Pal system, BAM complex and colanic acid production. The results also identify critical genes with unknown functions. The results support previous notions that central carbon metabolism shifts from aerobic to anaerobic metabolism in the presence of isobutanol, but also shows there is a transitionary phase where mixed acid fermentation pathways are utilised. This shift was previously thought to be mediated by the ArcA-ArcB two-component system. However, these results suggest the inactive 2Fe-2S core of the anaerobic-regulator Fnr is re-activated by Fe2+ to form the 4Fe-4S core transported by the FeoAB ferrous iron transport system. The strategy also identified the Tol-Pal system and show it is essential to grow in the presence of isobutanol, which is responsible for the maintaining the integrity of the cell envelope structure and increasing the rate of cell division. The BAM complex is responsible for folding and assembly of outer membrane proteins (OMP) and OMP membrane permeability- this system was found to be important for growth in isobutanol, and SurA, which is the primary OMP assembly pathway provided tolerance which was specific to isobutanol. Colanic acid, an extracellular polysaccharide is produced when the cell experiences stress, and provides protection by forming a physical barrier around the cell. The results show that the presence of colanic acid plays a large role in allowing E. coli to grow in presence of isobutanol, and its role becomes essential at critical concentrations. The results also show deletion of the negative regulator of the colanic acid gene cluster improves growth at critical and growth-inhibiting concentrations. When consolidated, these results facilitated knowledge-led based design and subsequently led to the identification of components for a synthetic design schedule, which lists the genetic manipulations proposed to exploit E. coli to enhance isobutanol tolerance.
|
16 |
Exploring next-generation sequencing in chronic lymphocytic leukemiaLjungström, Viktor January 2016 (has links)
Next-generation sequencing (NGS) techniques have led to major breakthroughs in the characterization of the chronic lymphocytic leukemia (CLL) genome with discovery of recurrent mutations of potential prognostic and/or predictive relevance. However, before NGS can be introduced into clinical practice, the precision of the techniques needs to be studied in better detail. Furthermore, much remains unknown about the genetic mechanisms leading to aggressive disease and resistance to treatment. Hence, in Paper I, the technical performance of a targeted deep sequencing panel including 9 genes was evaluated in 188 CLL patients. We were able to validate 143/155 (92%) selected mutations through Sanger sequencing and 77/82 mutations were concordant in a second targeted sequencing run, indicating that the technique can be introduced in clinical practice. In Paper II we screened 18 NF-κB pathway genes in 315 CLL patients through targeted deep sequencing which revealed a recurrent 4 base-pair deletion in the NFKBIE gene. Screening of NFKBIE in 377 additional cases identified the mutation in ~6% of all CLL patients. We demonstrate that the lesion lead to aberrant NF-κB signaling through impaired interaction with p65 and is associated with unfavorable clinical outcome. In Paper III we sought to delineate the genetic lesions that leads to relapse after fludarabine, cyclophosphamide, and rituximab treatment. Through whole-exome sequencing of pre-treatment and relapse samples from 41 cases we found evidence of frequent selection of subclones harboring driver mutations and subsequent clonal evolution following treatment. We also detected mutations in the ribosomal protein RPS15 in 8 cases (19.5%) and characterization of the mutations through functional assays point to impaired p53 regulation in cells with mutated RPS15. Paper IV aimed at characterizing 70 patients assigned to three major subsets (#1, #2, and #4) through whole-genome sequencing. Besides recurrent exonic driver mutations, we report non-coding regions significantly enriched for mutations in subset #1 and #2 that may facilitate future molecular studies. Collectively, this thesis supports the potential of targeted sequencing for mutational screening of CLL in clinical practice, provides novel insight into the pathobiology of aggressive CLL, and demonstrates the clinical outcome and cellular effects of NFKBIE and RPS15 mutations.
|
17 |
The prediction of HLA genotypes from next generation sequencing and genome scan dataFarrell, John J. 22 January 2016 (has links)
Genome-wide association studies have very successfully found highly significant disease associations with single nucleotide polymorphisms (SNP) in the Major Histocompatibility Complex for adverse drug reactions, autoimmune diseases and infectious diseases. However, the extensive linkage disequilibrium in the region has made it difficult to unravel the HLA alleles underlying these diseases. Here I present two methods to comprehensively predict 4-digit HLA types from the two types of experimental genome data widely available.
The Virtual SNP Imputation approach was developed for genome scan data and demonstrated a high precision and recall (96% and 97% respectively) for the prediction of HLA genotypes. A reanalysis of 6 genome-wide association studies using the HLA imputation method identified 18 significant HLA allele associations for 6 autoimmune diseases: 2 in ankylosing spondylitis, 2 in autoimmune thyroid disease, 2 in Crohn's disease, 3 in multiple sclerosis, 2 in psoriasis and 7 in rheumatoid arthritis. The EPIGEN consortium also used the Virtual SNP Imputation approach to detect a novel association of HLA-A*31:01 with adverse reactions to carbamazepine.
For the prediction of HLA genotypes from next generation sequencing data, I developed a novel approach using a naïve Bayes algorithm called HLA-Genotyper. The validation results covered whole genome, whole exome and RNA-Seq experimental designs in the European and Yoruba population samples available from the 1000 Genomes Project. The RNA-Seq data gave the best results with an overall precision and recall near 0.99 for Europeans and 0.98 for the Yoruba population. I then successfully used the method on targeted sequencing data to detect significant associations of idiopathic membranous nephropathy with HLA-DRB1*03:01 and HLA-DQA1*05:01 using the 1000 Genomes European subjects as controls.
Using the results reported here, researchers may now readily unravel the association of HLA alleles with many diseases from genome scans and next generation sequencing experiments without the expensive and laborious HLA typing of thousands of subjects. Both algorithms enable the analysis of diverse populations to help researchers pinpoint HLA loci with biological roles in infection, inflammation, autoimmunity, aging, mental illness and adverse drug reactions.
|
18 |
Multiple displacement amplification and whole genome sequencing for the diagnosis of infectious diseasesAnscombe, C. J. January 2016 (has links)
Next-generation sequencing technologies are revolutionising our ability to characterise and investigate infectious diseases. Utilising the power of high throughput sequencing, this study reports, the development of a sensitive, non-PCR based, unbiased amplification method, which allows the rapid and accurate sequencing of multiple microbial pathogens directly from clinical samples. The method employs Φ29 DNA polymerase, a highly efficient enzyme able to produce strand displacement during the polymerisation process with high fidelity. Problems with DNA secondary structure were overcome and the method optimised to produce sufficient DNA to sequence from a single bacterial cell in two hours. Evidence was also found that the enzyme requires at least six bases of single stranded DNA to initiate replication, and is not capable of amplification from nicks. Φ29 multiple displacement amplification was shown to be suitable for a range of GC contents and bacterial cell wall types as well as for viral pathogens. The method was shown to be able to provide relative quantification of mixed cells, and a method for quantification of viruses using a known standard was developed. To complement the novel molecular biology workflow, a data analysis pipeline was developed to allow pathogen identification and characterisation without prior knowledge of input. The use of de novo assemblies for annotation was shown to be equivalent to the use of polished reference genomes. Single cell Φ29 MDA samples had better assembly and annotation than non-amplification controls, a novel finding which, when combined with the very long DNA fragments produced, has interesting implications for a variety of analytical procedures. A sampling process was developed to allow isolation and amplification of pathogens directly from clinical samples, with good concordance shown between this method and traditional testing. The process was tested on a variety of modelled and real clinical samples showing good application to sterile site infections, particularly bacteraemia models. Within these samples multiple bacterial, viral and parasitic pathogens were identified, showing good application across multiple infection types. Emerging pathogens were identified including Onchocerca volvulus within a CSF sample, and Sneathia sanguinegens within an STI sample. Use of Φ29 MDA allows rapid and accurate amplification of whole pathogen genomes. When this is coupled with the sample processing developed here it is possible to detect the presence of pathogens in sterile sites with a sensitivity of a single genome copy.
|
19 |
Avaliação dos mecanismos adquiridos de resistência a antimicrobianos em enterobactérias produtoras de carbapenemases por sequenciamento de nova geraçãoNodari, Carolina Silva January 2016 (has links)
O objetivo deste trabalho foi caracterizar os mecanismos adquiridos de resistência de isolados de enterobactérias produtoras de carbapenemases utilizando a tecnologia de sequenciamento de nova geração. Foram incluídos no estudo quatro isolados – três Escherichia coli e uma Serratia marcescens – produtores de diferentes carbapenemases – OXA-370, KPC-2, NDM-1 e GES-5, respectivamente, obtidos a partir de um estudo de vigilância para detecção de carbapenemases. O DNA total dos isolados foi extraído utilizando kits comerciais e submetido à fragmentação enzimática para a obtenção de bibliotecas genômicas de aproximadamente 300 pares de bases. Após a preparação das bibliotecas, elas foram carregadas em chips 316 v2 para a plataforma Ion Torrent PGM e submetidas ao sequenciamento, utilizando um programa de 850 flows. Para cada genoma, foram obtidos aproximadamente um milhão de reads, os quais foram submetidos ao processo de montagem para a obtenção de genomas de aproximadamente 5Mb com uma cobertura de, em média, 175 vezes. As sequências obtidas foram submetidas à anotação utilizando o sistema RAST e a ferramenta online ResFinder. Sequências de inserção foram pesquisadas utilizando a ferramenta ISFinder. Além das carbapenemases, genes que codificam para outras β-lactamases (blaTEM-1, blaCTX-M-2 blaCTX-M-8, blaOXA-1, blaOXA-2) foram encontrados em todos os genomas. AMEs (aadA1, aph(3’)-la, aac(3)-IIa, strA, strB, aac(6’)Ib-cr, aac(6’)-Ib and aac(6`)-Ic), bem como genes que codificam resistência às sulfonamidas e ao trimetoprim também foram comuns aos isolados avaliados. Os seguintes ambientes genéticos foram observados: blaOXA-370 é flanqueada pela IS5075-like, blaKPC-2 está inserida no transposon Tn4401, e blaNDM-1, no transposon Tn3000. Cada isolado de E. coli pertenceu a um sequence type (ST) distinto: 1099F pertence à ST617, 1326F, à ST648, e 2610F, à ST707. Nossos resultados indicaram a diversidade de genes de resistência que podem ser encontrados em um isolado clínico, ressaltaram a variedade de contextos genéticos em que as carbapenemases podem estar inseridas e demonstraram que o sequenciamento de nova geração pode ser utilizado como uma ferramenta para a caracterização de isolados bacterianos multirresistentes, auxiliando, entre outros aspectos, na tipagem e na identificação de determinantes de resistência. / The aim of this study was to characterize the resistome of carbapenemase-producing Enterobacteriaceae using a next-generation sequencing platform. Four isolates were included in this study – three Escherichia coli and one Serratia marcescens – producing different carbapenemases – OXA-370, KPC-2, NDM-1 and GES-5, respectively, obtained from a surveillance study for carbapenemase detection. Total DNA was extracted using commercially available kits and submitted to enzymatic fragmentation to obtain libraries of around 300 base pairs of each isolate. After library preparation, they were loaded in 316 v2 chips for Ion Torrent PGM platform, and sequencing was performed using an 850 flows program. For each genome, approximately a million reads were obtained, and they were further assembled. The genome length for each isolate was of around 5Mb, with a mean coverage of 175x. The RAST system and the online tool ResFinder were used for annotation, as well as ISFinder. Besides the carbapenemases, genes encoding for other β-lactamases (blaTEM-1, blaCTX-M-2 blaCTX-M-8, blaOXA-1, blaOXA-2) were found in all genomes. AMEs (aadA1, aph(3’)-la, aac(3)-IIa, strA, strB, aac(6’)Ib-cr, aac(6’)-Ib and aac(6`)-Ic) were also detected in every isolate included in the study, as well as genes encoding for resistance determinants to sulfonamides and trimetoprim. The genetic environment of the carbapenemases was very similar to other isolates described in the literature – blaOXA-370 is flanked by IS5075-like, blaKPC-2 is inserted in transposon Tn4401, and blaNDM-1 was found in transposon Tn3000. Each isolate belonged to a distinct ST – 1099F is part of ST617, 1326F belonged to ST648 and 2610F, to ST707. Our results demonstrated the diversity of resistance determinants that can be found in a clinical isolate, highlighted the variability of genetic environments in which carbapenemases can be present and demonstrated that next-generation sequencing is a valuable tool for the characterization of multidrug-resistant isolates, and can provide information regarding the molecular typing and the identification of resistance determinants.
|
20 |
Computational strategies to investigate the genetic cause of human eye diseaseGoar, Wesley Andrew 01 May 2019 (has links)
It is estimated that 4000 genetic diseases/syndromes affect humans with one third of these diseases involving the eye. Many eye disorders, such as age-related macular degeneration that affects an estimated 170 million elderly adults worldwide, are associated with genetic variants. Since the conception of the human genome project we have learned a great deal about the genetic make-up of the human race and have identified over ~20,000 genes. Over 270 of these genes have been implicated in retinal diseases alone with many more genes involved in other forms of ocular disease. Though we have made a great deal of progress in understanding the genetics of eye disease, there remain many eye diseases with significant evidence of genetic components for which a disease-causing gene has not been identified.
In my thesis research, I utilized computational tools and strategies to analyze microarrays and whole-exome sequencing to investigate the genetic causes of three different eye diseases. First, I utilized a combination of familial analyses and whole-exome sequencing to study the genetic cause of Keratoconus, a progressive cornea abnormality that can lead to distorted vision and light sensitivity. Second, I analyzed three different cohorts of patients with Bardet-Biedl syndrome (BBS), a syndromic retinopathy leading to blindness, using whole-exome sequencing to identify both known and novel genetic causes of BBS. Finally, I performed the largest whole-exome sequencing study at the time for Pigment Dispersion Syndrome (PDS), a disorder associated with glaucoma, and identified variants within previously established candidate genes and a novel candidate gene that is now the subject of further scientific investigation.
By using computational tools and strategies in tandem with high-quality bench research performed by fellow lab members, we have identified both candidate and known eye disease-causing genes/mutations and furthered the goal to cure blindness.
|
Page generated in 0.1248 seconds