Identifying bacterial strains is important not only for human health research but also for various environmental and industrial applications. The plethora of available shotgun metagenomic datasets provide an unprecedented opportunity to discover novel bacterial strains in different environmental niches. In this thesis, we evaluated the existing novel-strain-based tools and showed that their performance is still suboptimal. Due to the difficulty in distinguishing strains of similar abundance by available tools, the SMS (strains in multiple samples) tool was developed to de novo identify bacterial strains in multiple shotgun metagenomic samples. We showed that SMS distinguishes strains of similar abundance well and outperforms other novel-strain-based tools on both simulated and experimental datasets. Applying the SMS tool to the Atopic Dermatitis (AD) samples, we discovered novel strains of Staphylococcus aureus and Staphylococcus epidermidis, and diversity that could not be observed when the analysis was based on current known strain databases. Compared with the previously identified known strains in the same samples, the predicted novel strains showed a better likelihood of being present in the samples than the known strains. Annotation and functional pathway analysis of these AD-related novel strains revealed their relation to coding sequences and functional pathways that have been known to contribute to the ability that bacteria become pathogenic and carry out infection, suggesting their potential roles in AD progression.
Identifer | oai:union.ndltd.org:ucf.edu/oai:stars.library.ucf.edu:etd2020-2686 |
Date | 01 January 2022 |
Creators | Ventolero, Minerva Fatimae |
Publisher | STARS |
Source Sets | University of Central Florida |
Language | English |
Detected Language | English |
Type | text |
Format | application/pdf |
Source | Electronic Theses and Dissertations, 2020- |
Page generated in 0.0012 seconds