Return to search

Computerised methods for selecting a small number of single nucleotide polymorphisms that enable bacterial strain discrimination

The possibility of identifying single nucleotide polymorphisms (SNPs) that would be useful for rapid bacterial typing was investigated. Neisseria meningitidis was the organism chosen for modelling the approach since informative SNPs could be found amongst the sequence data available for multi-locus sequence typing (MLST) at http://www.mlst.net. The hypothesis tested was that a small number of SNPs located within the seven gene fragments sequenced for MLST provide information equivalent to MLST. Preliminary investigations revealed that a small number of SNPs could be utilised to highly discriminate sequence types (STs) of clinical interest. Laboratory procedures demonstrated that SNP fingerprinting of N. meningitidis isolates is achievable. Further tests showed that laboratory identification of a defining SNP in the genome of isolates was to be a practical method of obtaining relevant typing information. Identification of the most discriminating SNPs amongst the ever-increasing amount of MLST sequence data summoned the need for computer-based assistance. Two methods of SNP selection devised by the author of this thesis were translated into computer-based algorithms by contributing team members. Software for two computer programs was produced. The algorithms facilitate the optimal selection of SNPs useful for (1) distinguishing specific STs and (2) differentiating non-specific STs. Current input information can be obtained from the MLST database and consequently the programs can be applied to any bacterial species for which MLST data have been entered. The two algorithms for the selection of SNPs were designed to serve contrasting purposes. The first of these was to determine the ST identity of isolates from an outbreak of disease. In this case, isolates would be tested for their membership to any of the STs known to be associated with disease. It was shown that one SNP per ST could distinguish each of four hyperinvasive STs of N. meningitidis from between 92.5% and 97.5% of all other STs. With two SNPs per ST, between 96.7% and 99.0% discrimination is achieved. The SNPs were selected from MLST loci with the assistance of the first algorithm which scores SNPs according to the number of base mismatches in a sequence alignment between an allele of an ST of interest and alleles belonging to all other STs at a specified locus. The second purpose was to determine whether or not isolates from different sources belong to the same ST, regardless of their actual ST identity. It was shown that with seven SNPs, four sample STs of N. meningitidis could, on average, be discriminated from 97.1% of all other STs. The SNPs were selected with the aid of the second algorithm which scores SNPs at MLST loci for the relative frequency of each nucleotide base in a sequence alignment as a measure of the extent of their polymorphism. A third algorithm for selecting SNPs has been discussed. By altering the method of scoring SNPs, it is possible to overcome the limitations inherent in the two algorithms that were utilised for finding SNPs. In addition, the third approach caters for finding SNPs that distinguish members of a complex from non-members.

Identiferoai:union.ndltd.org:ADTP/265276
Date January 2006
CreatorsRobertson, Gail Alexandra
PublisherQueensland University of Technology
Source SetsAustraliasian Digital Theses Program
Detected LanguageEnglish
RightsCopyright Gail Alexandra Robertson

Page generated in 0.002 seconds