SNP (single nucleotide polymorphisms, pronounce as snip) is one nucleotide position difference within human population.
These differences can be detected in human genome and the difference occurs once about every 1000 base pairs. There are only two possible nucleotides in each SNP position. As a genetic marker, SNP data can be used to capture human disease traits because of its abundance and low diversity.
In recent research results, it has been shown that there is a block-like structure in human genome, and only limited haplotype diversity can be observed. Consequently, we can use only a small fraction of SNPs to capture haplotype diversity in each block, and these SNPs are called tagSNPs.
We propose a fixed-diversity approach to capture the diversity of the entire data. After partitioning the haplotype blocks, we will provide an objective way for evaluating the result. We obtain that the diversity of chromosome 21 SNPs locates at 0.5 by using our algorithm. The partition result shows the concurrence property of the haplotype data downloaded from NCBI web site. Finally, we develop an algorithm for tagSNP selection within each block, and obtain the compression ratio 0.78.
Identifer | oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0727104-171533 |
Date | 27 July 2004 |
Creators | Sun, Chia-Ling |
Contributors | Bang-Ye Wu, You-Ling Shiue, Mau-Shiang Chang, Cheng-Wen Co, Chang-Biau Yang |
Publisher | NSYSU |
Source Sets | NSYSU Electronic Thesis and Dissertation Archive |
Language | English |
Detected Language | English |
Type | text |
Format | application/pdf |
Source | http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0727104-171533 |
Rights | off_campus_withheld, Copyright information available at source archive |
Page generated in 0.0018 seconds