Return to search

A suite of computational tools to interrogate sequence data with local haplotype analysis within complex ​Plasmodium​ infections and other microbial mixtures

The rapid development of DNA sequencing technologies has opened up new avenues of research, including the investigation of population structure within infectious diseases (both within patient and between populations). In order to take advantage of these advances in technologies and the generation of new types of data, novel bioinformatics tools are needed that won’t succumb to artifacts introduced by the data generation, and thus provide accurate and precise results. To achieve this goal I have create several tools.
First, SeekDeep, a pipeline for analyzing targeted amplicon sequencing datasets from various technologies, is able to achieve 1-base resolution even at low frequencies and read depths allowing for accurate comparison between samples and the detection of important SNPs. Next, PathWeaver, a local haplotype assembler designed for complex infections and highly variable genomic regions with poor reference mapping. PathWeaver is able to create highly accurate haplotypes without generating chimeric assemblies. PathWeaver was used on the key protein in pregnancy-associated malaria Plasmodium falciparum VAR2CSA which revealed population sub-structuring within the key binding domain of the protein observed to be present globally along with confirming copy number variation. Finally, the program Carmen is able to utilize PathWeaver to augment the results from targeted amplicon approaches by reporting where and when local haplotypes have been found previously.
These rigorously tested tools allow the analysis of local haplotype data from various technologies and approaches to provide accurate, precise and easily accessible results.

Identiferoai:union.ndltd.org:umassmed.edu/oai:escholarship.umassmed.edu:gsbs_diss-1976
Date19 March 2018
CreatorsHathaway, Nicholas J.
PublishereScholarship@UMMS
Source SetsUniversity of Massachusetts Medical School
Detected LanguageEnglish
Typetext
Formatapplication/pdf
SourceGSBS Dissertations and Theses
RightsCopyright is held by the author, with all rights reserved.

Page generated in 0.0157 seconds