Return to search

Hidden Markov Model inference copy number change in array-CGH data / HMM inference copy number change in array-CGH data

Thesis (S.M.)--Harvard-MIT Division of Health Sciences and Technology, 2005. / Includes bibliographical references (p. 56-57). / Cancer development and progression typically features genomic instability frequently resulting in genomic changes involving DNA copy number gains or losses. Identifying the genomic location of these regional alterations provides important opportunities for the discovery of potential novel oncogenes and tumor suppressors. Recently, array based competitive genomic hybridization (array-CGH) has become available as a powerful approach for genome-wide detection of DNA copy number changes. Array-CGH assesses DNA copy number in tumor samples through competitive hybridization on microarrays containing probes for thousands of genes. The datasets generated are complex and require statistical methods to accurately define discrete and uniform copy number from the data and to identify transitions between genomic regions with altered copy number. Several approaches based on different statistical frameworks have been developed. However, a fundamental informatic issue in array-CGH analysis remains unsolved by these methods. In particular, sample-specific data compression, a result of tumor cells being commonly admixed with normal cells in many tumor types, must be accounted for in each sample analyzed. Additionally, in order to accurately assess deviations from normal copy number, the copy number readout must be shifted to faithfully represent the baseline copy number in each tumor sample. Failure to appropriately address these issues reduces the accuracy of the data in hard-threshold based high-level analysis. / (cont.) By using the natural framework Hidden Markov Models (HMM) to model the distribution of array-CGH signals, a method infer the absolute copy number and identify change points has been developed to address the above problems. This method has been validated on independent dataset and its utility in inference on array-CGH data is demonstrated here. / by Yunyu Zhang. / S.M.

Identiferoai:union.ndltd.org:MIT/oai:dspace.mit.edu:1721.1/33086
Date January 2005
CreatorsZhang, Yunyu
ContributorsLynda Chin, Cheng Li and Cameron W. Brennan., Harvard University--MIT Division of Health Sciences and Technology., Harvard University--MIT Division of Health Sciences and Technology.
PublisherMassachusetts Institute of Technology
Source SetsM.I.T. Theses and Dissertation
LanguageEnglish
Detected LanguageEnglish
TypeThesis
Format57 p., 3055383 bytes, 3056742 bytes, application/pdf, application/pdf, application/pdf
RightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission., http://dspace.mit.edu/handle/1721.1/7582

Page generated in 0.0466 seconds