In the absence of clinical markers for Chronic Fatigue Syndrome (CFS), research
to find a biological basis for it is still open. Many data-mining techniques have
been widely employed to analyze biomedical data describing different aspects of CFS.
However, the inconsistency of the results of these studies reflect the uncertainty in
regards to the real basis of this disease. In this thesis, we show that CFS has a
biological basis that is detectable in gene expression data better than blood profile
and Single Nucleotide Polymorphism (SNP) data. Using random forests, the analysis
of gene expression data achieves a prediction accuracy of approximately 89%. We also
identify sets of differentially expressed candidate genes that might contribute to CFS.
We show that the integration of data spanning multiple levels of the biological scale
might reveal further insights into the understanding of CFS. Using integrated data,
we achieve a prediction accuracy of approximately 91%. We find that Singular Value
Decomposition (SVD) is a useful technique to visualize the performance of random
forests. / Thesis (Master, Computing) -- Queen's University, 2007-12-11 12:15:40.096
Identifer | oai:union.ndltd.org:LACETR/oai:collectionscanada.gc.ca:OKQ.1974/940 |
Date | 18 December 2007 |
Creators | Abou-Gouda, Samar A. |
Contributors | Queen's University (Kingston, Ont.). Theses (Queen's University (Kingston, Ont.)) |
Source Sets | Library and Archives Canada ETDs Repository / Centre d'archives des thèses électroniques de Bibliothèque et Archives Canada |
Language | English, English |
Detected Language | English |
Type | Thesis |
Format | 841075 bytes, application/pdf |
Rights | This publication is made available by the authority of the copyright owner solely for the purpose of private study and research and may not be copied or reproduced except as permitted by the copyright laws without written authority from the copyright owner. |
Relation | Canadian theses |
Page generated in 0.0018 seconds