Clustering is a fundamental data mining tool that aims to divide data into groups of similar items. Intuition about clustering reflects the ideal case -- exact data sets endowed with flawless dissimilarity between individual instances. In practice however, these cases are in the minority, and clustering applications are typically characterized by noisy data sets with approximate pairwise dissimilarities. As such, the efficacy of clustering methods necessitates robustness to perturbations. In this paper, we address foundational questions on perturbation robustness, studying to what extent can clustering techniques exhibit this desirable characteristic. Our results also demonstrate the type of cluster structures required for robustness of popular clustering paradigms. / A Thesis submitted to the Department of Computer Science in partial fulfillment of the requirements for the degree of Master of Science. / Summer Semester 2017. / May 4, 2017. / Includes bibliographical references. / Margareta Ackerman, Professor Co-Directing Thesis; Gary Tyson, Professor Co-Directing Thesis; Sonia Haiduc, Committee Member; Peixiang Zhao, Committee Member.
Identifer | oai:union.ndltd.org:fsu.edu/oai:fsu.digital.flvc.org:fsu_552106 |
Contributors | Moore, Jarrod (authoraut), Ackerman, Margareta (professor co-directing thesis), Tyson, Gary Scott (professor co-directing thesis), Haiduc, Sonia (committee member), Zhao, Peixiang (committee member), Florida State University (degree granting institution), College of Arts and Sciences (degree granting college), Department of Scientific Computing (degree granting departmentdgg) |
Publisher | Florida State University |
Source Sets | Florida State University |
Language | English, English |
Detected Language | English |
Type | Text, text, master thesis |
Format | 1 online resource (31 pages), computer, application/pdf |
Page generated in 0.0018 seconds