1 |
Technologies for Single Cell Genome AnalysisBorgström, Erik January 2016 (has links)
During the last decade high throughput DNA sequencing of single cells has evolved from an idea to one of the most high profile fields of research. Much of this development has been possible due to the dramatic reduction in costs for massively parallel sequencing. The four papers included in this thesis describe or evaluate technological advancements for high throughput DNA sequencing of single cells and single molecules. As the sequencing technologies improve, more samples are analyzed in parallel. In paper 1, an automated procedure for preparation of samples prior to massively parallel sequencing is presented. The method has been applied to several projects and further development by others has enabled even higher sample throughputs. Amplification of single cell genomes is a prerequisite for sequence analysis. Paper 2 evaluates four commercially available kits for whole genome amplification of single cells. The results show that coverage of the genome differs significantly among the protocols and as expected this has impact on the downstream analysis. In Paper 3, single cell genotyping by exome sequencing is used to confirm the presence of fat cells derived from donated bone marrow within the recipients’ fat tissue. Close to hundred single cells were exome sequenced and a subset was validated by whole genome sequencing. In the last paper, a new method for phasing (i.e. determining the physical connection of variant alleles) is presented. The method barcodes amplicons from single molecules in emulsion droplets. The barcodes can then be used to determine which variants were present on the same original DNA molecule. The method is applied to two variable regions in the bacterial 16S gene in a metagenomic sample. Thus, two of the papers (1 and 4) present development of new methods for increasing the throughput and information content of data from massively parallel sequencing. Paper 2 evaluates and compares currently available methods and in paper 3, a biological question is answered using some of these tools. / <p>QC 20160127</p>
|
2 |
Whole Genome Amplification von Plasma-DNA und Entwicklung eines Ausschlusskriteriums zur Verbesserung der Genotypisierungsqualität / Sample selection algorithm to improve quality of genotyping from plasma-derived DNA: to separate the wheat from the chaff.Schoenborn, Veit January 2008 (has links) (PDF)
Plasma- und Serumproben waren in früheren epidemiologischen Studien häufig das einzige biologische Material, das gesammelt und untersucht wurde. Diese Studien besitzen gerade durch ihren sehr langen Untersuchungszeitraum einen riesigen Informationsgehalt und wären ein unbezahlbarer Schatz für genetische Analysen. Oft ist aufgrund damals mangelnder Akquirierung jedoch keine genomische DNA verfügbar. Um die in Plasmaproben in geringer Menge vorkommende DNA verwenden zu können, extrahierten wir die DNA mit Hilfe von magnetischen Partikeln und setzten sie in eine Whole Genome Amplification (WGA) mittels Φ29-DNA-Polymerase ein. Wir stellten 88 Probenpärchen, bestehend aus einer WGA-Plasma-DNA und der korrespondierenden Vollblut-DNA derselben Person, zusammen und genotypisierten bei diesen neun hochpolymorphe Short Tandem Repeats (STR) und 25 SNPs. Die durchschnittliche innerhalb der Probenpaare auftretende Diskordanzrate betrug 3,8% für SNPs sowie 15,9% für STRs. Basierend auf den Ergebnissen der Hälfte der Probenpaare entwickelten wir einen Ausschlussalgorithmus und validierten diesen in der anderen Hälfte der Probenpaare. Mit diesem ist es möglich, zum Einen diejenigen Proben mit einer guten DNA-Qualität herauszufiltern, um Genotypisierungsfehler zu vermeiden, und zum Anderen jene Proben mit insuffizienter DNA-Qualität auszuschließen. Nachdem Proben, die fünf oder mehr homozygote Loci in dem 9-STR-Markerset aufwiesen, ausgeschlossen wurden, resultierte dies in einer Ausschlussrate von 22,7% und senkte die durchschnittliche Diskordanzrate auf 3,92% für STRs bzw. 0,63% für SNPs. Bei SNPs entspricht dieser Wert ungefähr der Fehlerquote, wie er auch bei Genotypisierungen mit Vollblut-DNA in vielen Laboratorien auftritt. Unsere Methode und das Ausschlusskriterium bieten damit neue Möglichkeiten, um zuverlässige DNA aus archivierten Plasmaproben wiederzugewinnen. Dieser Algorithmus ist auch besser geeignet, als nur die eingesetzte DNA-Menge in die WGA-Reaktion als Kriterium zu benützen. / Plasma and serum samples were often the only biological material collected for earlier epidemiological studies. These studies have a huge informative content, especially due to their long follow-up and would be an invaluable treasure for genetic investigations. However, often no banked DNA is available. To use the small amounts of DNA present in plasma, in a first step, we applied magnetic bead technology to extract this DNA, followed by a whole-genome amplification (WGA) using phi29-polymerase. We assembled 88 sample pairs, each consisting of WGA plasma DNA and the corresponding whole-blood DNA. We genotyped nine highly polymorphic short tandem repeats (STRs) and 23 SNPs in both DNA sources. The average within-pair discordance was 3.8% for SNPs and 15.9% for STR genotypes, respectively. We developed an algorithm based on one-half of the sample pairs and validated on the other one-half to identify the samples with high WGA plasma DNA quality to assure low genotyping error and to exclude plasma DNA samples with insufficient quality: excluding samples showing homozygosity at five or more of the nine STR loci yielded exclusion of 22.7% of all samples and decreased average discordance for STR and SNP markers to 3.92% and 0.63%, respectively. For SNPs, this is very close to the error observed for genomic DNA in many laboratories. Our workflow and sample selection algorithm offers new opportunities to recover reliable DNA from stored plasma material. This algorithm is superior to testing the amount of input DNA.
|
3 |
Massively parallel analysis of cells and nucleic acidsSandberg, Julia January 2011 (has links)
Recent proceedings in biotechnology have enabled completely new avenues in life science research to be explored. By allowing increased parallelization an ever-increasing complexity of cell samples or experiments can be investigated in shorter time and at a lower cost. This facilitates for example large-scale efforts to study cell heterogeneity at the single cell level, by analyzing cells in parallel that also can include global genomic analyses. The work presented in this thesis focuses on massively parallel analysis of cells or nucleic acid samples, demonstrating technology developments in the field as well as use of the technology in life sciences. In stem cell research issues such as cell morphology, cell differentiation and effects of reprogramming factors are frequently studied, and to obtain information on cell heterogeneity these experiments are preferably carried out on single cells. In paper I we used a high-density microwell device in silicon and glass for culturing and screening of stem cells. Maintained pluripotency in stem cells from human and mouse was demonstrated in a screening assay by antibody staining and the chip was furthermore used for studying neural differentiation. The chip format allows for low sample volumes and rapid high-throughput analysis of single cells, and is compatible with Fluorescence Activated Cell Sorting (FACS) for precise cell selection. Massively parallel DNA sequencing is revolutionizing genomics research throughout the life sciences by constantly producing increasing amounts of data from one sequencing run. However, the reagent costs and labor requirements in current massively parallel sequencing protocols are still substantial. In paper II-IV we have focused on flow-sorting techniques for improved sample preparation in bead-based massive sequencing platforms, with the aim of increasing the amount of quality data output, as demonstrated on the Roche/454 platform. In paper II we demonstrate a rapid alternative to the existing shotgun sample titration protocol and also use flow-sorting to enrich for beads that carry amplified template DNA after emulsion PCR, thus obtaining pure samples and with no downstream sacrifice of DNA sequencing quality. This should be seen in comparison to the standard 454-enrichment protocol, which gives rise to varying degrees of sample purity, thus affecting the sequence data output of the sequencing run. Massively parallel sequencing is also useful for deep sequencing of specific PCR-amplified targets in parallel. However, unspecific product formation is a common problem in amplicon sequencing and since these shorter products may be difficult to fully remove by standard procedures such as gel purification, and their presence inevitably reduces the number of target sequence reads that can be obtained in each sequencing run. In paper III a gene-specific fluorescent probe was used for target-specific FACS enrichment to specifically enrich for beads with an amplified target gene on the surface. Through this procedure a nearly three-fold increase in fraction of informative sequences was obtained and with no sequence bias introduced. Barcode labeling of different DNA libraries prior to pooling and emulsion PCR is standard procedure to maximize the number of experiments that can be run in one sequencing lane, while also decreasing the impact of technical noise. However, variation between libraries in quality and GC content affects amplification efficiency, which may result in biased fractions of the different libraries in the sequencing data. In paper IV barcode specific labeling and flow-sorting for normalization of beads with different barcodes on the surface was used in order to weigh the proportion of data obtained from different samples, while also removing mixed beads, and beads with no or poorly amplified product on the surface, hence also resulting in an increased sequence quality. In paper V, cell heterogeneity within a human being is being investigated by low-coverage whole genome sequencing of single cell material. By focusing on the most variable portion of the human genome, polyguanine nucleotide repeat regions, variability between different cells is investigated and highly variable polyguanine repeat loci are identified. By selectively amplifying and sequencing polyguanine nucleotide repeats from single cells for which the phylogenetic relationship is known, we demonstrate that massively parallel sequencing can be used to study cell-cell variation in length of these repeats, based on which a phylogenetic tree can be drawn. / QC 20111031
|
4 |
Somatic Mutations in Breast Cancer Genomes : Discovery and Validation of Breast Cancer GenesJiao, Xiang January 2012 (has links)
Breast cancer is the most common cancer in women worldwide. However, the genetic alterations that lead to breast cancer are not fully understood. This thesis aims to identify novel genes of potential mechanistic, diagnostic or therapeutic interest in breast cancers by mutational analysis and whole-genome sequencing. In paper I, sequencing of 36 previously identified candidate genes in 96 breast tumors with patient-matched normal DNA determined the somatic mutation prevalence of these candidate genes and identified additional mutations in Notch, NF-κB, PI3K, and Hedgehog pathways as well as in processes mediating DNA methylation, RNA processing and calcium signaling. In paper II, comparison of massively parallel mate-pair sequencing results of a human genome before and after phi29-mediated multiple displacement amplification (MDA) revealed that MDA introduces structural alteration artifacts, with an emphasis on false positive inversions, and impairs the sensitivity to detect true inversions. Therefore, MDA has limited value in sample preparation for whole-genome sequencing for structural alteration detection. In paper III, massively parallel paired-end sequencing identified gene rearrangements in 15 hormone receptor negative breast cancers. Forty validated rearrangements were predicted to directly affect 30 genes, involved in epigenetic regulation, cell mitosis, signalling transduction and glycolytic flux. RNA interference-based assays revealed the potential roles in cell growth of some affected genes, among which DDX10 was implicated to be involved in apoptosis. In paper IV, a method for statistical evaluation of putative translocations detected by massively parallel paired-end sequencing was proposed. In an application of this method to analyse translocations detected by cancer genome deep paired-end sequencing, 76 putative translocations were classified into four categories, with the majority likely to be caused by mismapping due to repetitive regions. Taken together, this thesis provides insights into genes and pathways mutated in sporadic breast cancer genomes, which broaden our understanding of the genetic basis of breast cancer and may ultimately facilitate the diagnosis and treatment of this disease.
|
5 |
Detecting and sequencing Mycobacterium tuberculosis aDNA from archaeological remainsForst, Jannine January 2015 (has links)
Tuberculosis has been an important disease throughout human history, shaping countless past populations. The archaeological study of the causative agents of tuberculosis, members of the Mycobacterium tuberculosis Complex (MTBC), is hindered by the non-diagnostic nature of tuberculosis-associated skeletal changes. As such, ancient DNA (aDNA) or palaeogenetic analyses have become an important tool for identifying tuberculosis in past populations. However, due to the age and variable preservation of aDNA, there are often issues with sporadic results and false negatives. The overall aim of the work presented here was to use different methods, including traditional target-specific PCR, to identify and detect tuberculosis aDNA in archaeological remains. The main objectives within this overarching aim were to first test a method called whole genome amplification (WGA), used to non-specifically amplify all the DNA within a sample, and its potential to improve the yield of aDNA from skeletal remains (Chapters 3 and 4). To determine the extent of its impact, WGA was used in a comparative context, where each archaeological sample analysed was separately subjected to two methods of MTBC detection - the traditional targeted PCR method and the same method assisted by the initial application of WGA. The results show that applying WGA before the traditional targeted PCR methodology to detect the presence of MTBC pathogens in skeletal remains is only useful and viable in some cases, likely depending on the age and preservation of the sample. The second objective was to use next generation sequencing to obtain more information on the aDNA composition of certain archaeological samples and answer questions beyond the scope of traditional target-specific PCR techniques (Chapter 5). Although most of the sequencing runs were variably unsuccessful, the composition of two samples, both known to probably contain tuberculosis aDNA, could be analysed. The samples both contained similar amounts of mycobacterial aDNA and varying amounts of both human and even potentially human intestinal flora DNA. Finally, the third objective was to determine if MTBC aDNA could be detected in a rib sample from Private William Braine of the lost Franklin Expedition using standard target-specific PCR (Chapter 6). In this case study, no evidence of tuberculosis ancient DNA was found. The work done through-out highlights the difficulties of ancient DNA research and, in Chapter 4, shows the importance of using more than a single sample to evaluate methods for application in palaeogenetic contexts.
|
6 |
Optimizing a Selective Whole Genome Amplification (SWGA) Strategy for Clinical Malaria InfectionsAlawi, Mariah 08 1900 (has links)
Plasmodium is a genus well known for causing malaria, a life-threatening infection for many people where malaria is endemic. The blood-borne disease is transmitted by the female Anopheles mosquito. Till date, eight parasite species have been reported to cause malaria in humans that include P. falciparum, P. vivax, P. malariae, P. ovale curtisi, P. ovale wallikeri, P. cynomolgi, P. knowlesi and more recently P. simium. Amongst them, the most genetically understood species is P. falciparum, causing most of the deaths in children from malaria.
Understanding genome variation at the population level of all malaria species is of utmost importance, including clinical cases with very low parasitemia. To achieve this purpose, we need sufficient amounts of parasite DNA material from the pool of host DNA, which always is overrepresented in clinical infections. We utilized a strategy of selective whole genome amplification (SWGA) technology on P. malariae and P. ovale curtisi (two neglected human infecting malaria parasites that often cause mild yet clinically relevant infections with low parasitemia) to efficiently enrich their genomic DNA for high-quality whole genome sequencing. Previous studies on SWGA applied on P. falciparum and P. vivax showed that SWGA could efficiently enrich the amount of starting DNA material from inadequate amounts of parasites directly from clinical samples without separating the host DNA using specifically designed primer sets.
We have successfully designed multiple sets of primers and tested the efficiency of five best primer sets using polymerase chain reaction to enrich the genomes of P. malariae and P. ovale curtisi. The efficiency of primers in enriching the genome was tested on two clinical samples for each of P. malariae and P. ovale curtisi. We were able to enrich the genome of P. malariae with an average of 19-fold (19X) enrichment across both samples. For P. ovale curtisi, we could achieve an enrichment of 3 folds only. Nevertheless, we still obtained a sufficient amount of gDNA to prepare Illumina sequencing libraries and call for SNPs and Indels in a biologically reproducible manner at genome-scale.
|
7 |
Degenerate oligonucleotide primed amplification of genomic DNA for combinatorial screening libraries and strain enrichmentFreedman, Benjamin Gordon 22 December 2014 (has links)
Combinatorial approaches in metabolic engineering can make use of randomized mutations and/or overexpression of randomized DNA fragments. When DNA fragments are obtained from a common genome or metagenome and packaged into the same expression vector, this is referred to as a DNA library. Generating quality DNA libraries that incorporate broad genetic diversity is challenging, despite the availability of published protocols. In response, a novel, efficient, and reproducible technique for creating DNA libraries was created in this research based on whole genome amplification using degenerate oligonucleotide primed PCR (DOP-PCR). The approach can produce DNA libraries from nanograms of a template genome or the metagenome of multiple microbial populations. The DOP-PCR primers contain random bases, and thermodynamics of hairpin formation was used to design primers capable of binding randomly to template DNA for amplification with minimal bias. Next-generation high-throughput sequencing was used to determine the design is capable of amplifying up to 98% of template genomic DNA and consistently out-performed other DOP-PCR primers.
Application of these new DOP-PCR amplified DNA libraries was demonstrated in multiple strain enrichments to isolate genetic library fragments capable of (i) increasing tolerance of E. coli ER2256 to toxic levels of 1-butanol by doubling the growth rate of the culture, (ii) redirecting metabolism to ethanol and pyruvate production (over 250% increase in yield) in Clostridium cellulolyticum when consuming cellobiose, and (iii) enhancing L-arginine production when used in conjunction with a new synthetic gene circuit. / Ph. D.
|
8 |
Investigation of Strategies for Improving STR Typing of Degraded and Low Copy DNA from Human Skeletal Remains and BloodstainsAmbers, Angie D. 08 1900 (has links)
Forensic STR analysis is limited by the quality and quantity of DNA. Significant damage or alteration to the molecular structure of DNA by depurination, crosslinking, base modification, and strand breakage can impact typing success. Two methods that could potentially improve STR typing of challenged samples were explored: an in vitro DNA repair assay (PreCR™ Repair Mix) and whole genome amplification. Results with the repair assay showed trends of improved performance of STR profiling of bleach-damaged DNA. However, the repair assay did not improve DNA profiles from environmentally-damaged bloodstains or bone, and in some cases resulted in lower RFU values for STR alleles. The extensive spectrum of DNA damage and myriad combinations of lesions that can be present in forensic samples appears to pose a challenge for the in vitro PreCR™ assay. The data suggest that the use of PreCR™ in casework should be considered with caution due to the assay’s varied results. As an alternative to repair, whole genome amplification (WGA) was pursued. The DOP-PCR method was selected for WGA because of initial primer design and greater efficacy for amplifying degraded samples. Several modifications of the original DOP-PCR primer were evaluated. These modifications allowed for an overall more robust amplification of damaged DNA from both contemporary and historical skeletal remains compared with that obtained by standard DNA typing and a previously described DOP-PCR method. These new DOP-PCR primers show promise for WGA of degraded DNA.
|
9 |
Analysis of Nucleotide Variations in Non-human PrimatesRönn, Ann-Charlotte January 2007 (has links)
<p>Many of our closest relatives, the primates, are endangered and could be extinct in a near future. To increase the knowledge of non-human primate genomes, and at the same time acquire information on our own genomic evolution, studies using high-throughput technologies are applied, which raises the demand for large amounts of high quality DNA.</p><p>In study I and II, we evaluated the multiple displacement amplification (MDA) technique, a whole genome amplification method, on a wide range of DNA sources, such as blood, hair and semen, by comparing MDA products to genomic DNA as templates for several commonly used genotyping methods. In general, the genotyping success rate from the MDA products was in concordance with the genomic DNA. The quality of sequences of the mitochondrial control region obtained from MDA products from blood and non-invasively collected semen samples was maintained. However, the readable sequence length was shorter for MDA products.</p><p>Few studies have focused on the genetic variation in the nuclear genes of non-human primates. In study III, we discovered 23 new single nucleotide polymorphisms (SNPs) in the Y-chromosome of the chimpanzee. We designed a tag-microarray minisequencing assay for genotyping the SNPs together with 19 SNPs from the literature and 45 SNPs in the mitochondrial DNA. Using the microarray, we were able to analyze the population structure of wild-living chimpanzees.</p><p>In study IV, we established 111 diagnostic nucleotide positions for primate genera determination. We used sequence alignments of the nuclear epsilon globin gene and apolipoprotein B gene to identify positions for determination on the infraorder and Catarrhini subfamily level, respectively, and sequence alignments of the mitochondrial 12S rRNA (MT-RNR1) to identify positions to distinguish between genera. We designed a microarray assay for immobilized minisequencing primers for genotyping these positions to aid in the forensic determination of an unknown sample.</p>
|
10 |
Analysis of Nucleotide Variations in Non-human PrimatesRönn, Ann-Charlotte January 2007 (has links)
Many of our closest relatives, the primates, are endangered and could be extinct in a near future. To increase the knowledge of non-human primate genomes, and at the same time acquire information on our own genomic evolution, studies using high-throughput technologies are applied, which raises the demand for large amounts of high quality DNA. In study I and II, we evaluated the multiple displacement amplification (MDA) technique, a whole genome amplification method, on a wide range of DNA sources, such as blood, hair and semen, by comparing MDA products to genomic DNA as templates for several commonly used genotyping methods. In general, the genotyping success rate from the MDA products was in concordance with the genomic DNA. The quality of sequences of the mitochondrial control region obtained from MDA products from blood and non-invasively collected semen samples was maintained. However, the readable sequence length was shorter for MDA products. Few studies have focused on the genetic variation in the nuclear genes of non-human primates. In study III, we discovered 23 new single nucleotide polymorphisms (SNPs) in the Y-chromosome of the chimpanzee. We designed a tag-microarray minisequencing assay for genotyping the SNPs together with 19 SNPs from the literature and 45 SNPs in the mitochondrial DNA. Using the microarray, we were able to analyze the population structure of wild-living chimpanzees. In study IV, we established 111 diagnostic nucleotide positions for primate genera determination. We used sequence alignments of the nuclear epsilon globin gene and apolipoprotein B gene to identify positions for determination on the infraorder and Catarrhini subfamily level, respectively, and sequence alignments of the mitochondrial 12S rRNA (MT-RNR1) to identify positions to distinguish between genera. We designed a microarray assay for immobilized minisequencing primers for genotyping these positions to aid in the forensic determination of an unknown sample.
|
Page generated in 0.1291 seconds