Return to search

SNP Haplotype Block Inference and Tag Selection Algorithm

SNP (single nucleotide polymorphisms, pronounce as snip) is one nucleotide position difference within human population.
These differences can be detected in human genome and the difference occurs once about every 1000 base pairs. There are only two possible nucleotides in each SNP position. As a genetic marker, SNP data can be used to capture human disease traits because of its abundance and low diversity.
In recent research results, it has been shown that there is a block-like structure in human genome, and only limited haplotype diversity can be observed. Consequently, we can use only a small fraction of SNPs to capture haplotype diversity in each block, and these SNPs are called tagSNPs.
We propose a fixed-diversity approach to capture the diversity of the entire data. After partitioning the haplotype blocks, we will provide an objective way for evaluating the result. We obtain that the diversity of chromosome 21 SNPs locates at 0.5 by using our algorithm. The partition result shows the concurrence property of the haplotype data downloaded from NCBI web site. Finally, we develop an algorithm for tagSNP selection within each block, and obtain the compression ratio 0.78.

Identiferoai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0727104-171533
Date27 July 2004
CreatorsSun, Chia-Ling
ContributorsBang-Ye Wu, You-Ling Shiue, Mau-Shiang Chang, Cheng-Wen Co, Chang-Biau Yang
PublisherNSYSU
Source SetsNSYSU Electronic Thesis and Dissertation Archive
LanguageEnglish
Detected LanguageEnglish
Typetext
Formatapplication/pdf
Sourcehttp://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0727104-171533
Rightsoff_campus_withheld, Copyright information available at source archive

Page generated in 0.0021 seconds