Return to search

A topic model based approach to inferring episodic directional selection in protein coding sequences

Pathogens, such as HIV and influenza, evolve in response to the selective pressures of their host environments accumulating changes in their genomes that offer fitness benefits. This selective pressure is characterised by three properties: (1.) it is episodic, tracking changes in the adaptive immune response and drug therapy, (2.) it is directional in that only particular amino acid substitutions are favoured and (3.) it varies between genomic loci. Most previous models have ignored or inadequately addressed some of these phenomena. This work extends recent approaches to modelling episodic directional selection acting on protein-coding sequences. We use inference techniques within the topic model framework to identify loci evolving under natural selection. A notable example of such techniques are the variational Bayesian methods. We show that our approach performs well in terms of specificity and power, and demonstrate its utility by applying it to some real datasets of HIV sequences.

Identiferoai:union.ndltd.org:netd.ac.za/oai:union.ndltd.org:uct/oai:localhost:11427/20013
Date January 2015
CreatorsSadiq, Hassan Taiwo
ContributorsLacerda, Miguel
PublisherUniversity of Cape Town, Faculty of Science, Department of Statistical Sciences
Source SetsSouth African National ETD Portal
LanguageEnglish
Detected LanguageEnglish
TypeMaster Thesis, Masters, MSc
Formatapplication/pdf

Page generated in 0.0021 seconds