Global ETD Search

231	Data Mining the Genetics of Leukemia Morton, Geoffrey 13 January 2010 (has links) Acute Lymphoblastic Leukemia (ALL) is the most common cancer in children under the age of 15. At present, diagnosis, prognosis and treatment decisions are made based upon blood and bone marrow laboratory testing. With advances in microarray technology it is becoming more feasible to perform genetic assessment of individual patients as well. We used Singular Value Decomposition (SVD) on Illumina SNP, Affymetrix and cDNA gene-expression data and performed aggressive attribute se- lection using random forests to reduce the number of attributes to a manageable size. We then explored clustering and prediction of patient-specific properties such as disease sub-classification, and especially clinical outcome. We determined that integrating multiple types of data can provide more meaningful information than individual datasets, if combined properly. This method is able to capture the cor- relation between the attributes. The most striking result is an apparent connection between genetic background and patient mortality under existing treatment regimes. We find that we can cluster well using the mortality label of the patients. Also, using a Support Vector Machine (SVM) we can predict clinical outcome with high accu-racy. This thesis will discuss the data-mining methods used and their application to biomedical research, as well as our results and how this will affect the diagnosis and treatment of ALL in the future. / Thesis (Master, Computing) -- Queen's University, 2010-01-12 18:40:44.2 Data Mining Acute Lymphoblastic Leukemia
232	Mining frequent sequences in one database scan using distributed computers Brajczuk, Dale A. 01 September 2011 (has links) Existing frequent-sequence mining algorithms perform multiple scans of a database, or a structure that captures the database. In this M.Sc. thesis, I propose a frequent-sequence mining algorithm that mines each database row as it reads it, so that it can potentially complete mining in the time it takes to read the database once. I achieve this by having my algorithm enumerate all sub-sequences from each row as it reads it. Since sub-sequence enumeration is a time-consuming process, I create a method to distribute the work over multiple computers, processors, and thread units, while balancing the load between all resources, and limiting the amount of communication so that my algorithm scales well in regards to the number of computers used. Experimental results show that my algorithm is effective, and can potentially complete the mining process in near the time it takes to perform one scan of the input database. data mining databases distributed computing
233	Mining frequent sequences in one database scan using distributed computers Brajczuk, Dale A. 01 September 2011 (has links) Existing frequent-sequence mining algorithms perform multiple scans of a database, or a structure that captures the database. In this M.Sc. thesis, I propose a frequent-sequence mining algorithm that mines each database row as it reads it, so that it can potentially complete mining in the time it takes to read the database once. I achieve this by having my algorithm enumerate all sub-sequences from each row as it reads it. Since sub-sequence enumeration is a time-consuming process, I create a method to distribute the work over multiple computers, processors, and thread units, while balancing the load between all resources, and limiting the amount of communication so that my algorithm scales well in regards to the number of computers used. Experimental results show that my algorithm is effective, and can potentially complete the mining process in near the time it takes to perform one scan of the input database. data mining databases distributed computing
234	Data preparation for biomedical knowledge domain visualization : a probabilistic record linkage and information fusion approach to citation data / Synnestvedt, Marie B. Lin, Xia. January 2007 (has links) Thesis (Ph.D.)--Drexel University, 2007. / Includes abstract and vita. Includes bibliographical references (leaves 98-102).
235	Semantic web Einführung, wirtschaftliche Bedeutung, Perspektiven Tusek, Jasna January 2006 (has links) Zugl.: Wien, Wirtschaftsuniv., Diplomarb.
236	Latent semantic analysis and classification modeling in applications for social movement theory / Spomer, Judith E., January 2008 (has links) (PDF) Thesis (M.S.) -- Central Connecticut State University, 2008. / Thesis advisor: Roger Bilisoly. "... in partial fulfillment of the requirements for the degree of Master of Science in Data Mining." Includes bibliographical references (leaves 122-127). Also available via the World Wide Web. Data mining. Latent semantic indexing.
237	Segmentierungs- und Klassifikationsmethoden der Statistik und des Data Mining Einsatzmöglichkeiten und Früherkennungspotenziale im Vertriebscontrolling eines Direktvertriebs Pilger, Jörg January 2007 (has links) Zugl.: Passau, Univ., Diss., 2007
238	Efficient decision tree building algorithms for uncertain data Tsang, Pui-kwan, Smith. January 2008 (has links) Thesis (M. Phil.)--University of Hong Kong, 2009. / Includes bibliographical references (leaves 84-88) Also available in print. Decision trees. Data mining. Algorithms.
239	Meta-learning strategies, implementations, and evaluations for algorithm selection / Köpf, Christian Rudolf. January 1900 (has links) Thesis (doctorate)--Universität Ulm, 2005. / Includes bibliographical references (p. 227-248).
240	Finding And Evaluating Patterns In Wes Repository Using Database Technology And Data Mining Algorithms/ Özakar, Belgin. Püskülcü, Halis January 2002 (has links) (PDF) Thesis (Master)--İzmir Institute of Technology, İzmir, 2002. / Includes bibliographical references (59-61). Data mining Web usage mining

Search results