Global ETD Search

Return to search

Cost-Sensitive Learning-based Methods for Imbalanced Classification Problems with Applications

Analysis and predictive modeling of massive datasets is an extremely significant problem that arises in many practical applications. The task of predictive modeling becomes even more challenging when data are imperfect or uncertain. The real data are frequently affected by outliers, uncertain labels, and uneven distribution of classes (imbalanced data). Such uncertainties create bias and make predictive modeling an even more difficult task. In the present work, we introduce a cost-sensitive learning method (CSL) to deal with the classification of imperfect data. Typically, most traditional approaches for classification demonstrate poor performance in an environment with imperfect data. We propose the use of CSL with Support Vector Machine, which is a well-known data mining algorithm. The results reveal that the proposed algorithm produces more accurate classifiers and is more robust with respect to imperfect data. Furthermore, we explore the best performance measures to tackle imperfect data along with addressing real problems in quality control and business analytics.

Classification

imbalanced data

cost sensitive learning

outliers

weighted support vector machine

relaxed support vector machines

control chart pattern recognition

Engineering

Industrial Engineering

Identifer	oai:union.ndltd.org:ucf.edu/oai:stars.library.ucf.edu:etd-5574
Date	01 January 2014
Creators	Razzaghi, Talayeh
Publisher	STARS
Source Sets	University of Central Florida
Language	English
Detected Language	English
Type	text
Format	application/pdf
Source	Electronic Theses and Dissertations

Page generated in 0.0018 seconds

Cost-Sensitive Learning-based Methods for Imbalanced Classification Problems with Applications

Description

Links & Downloads

Tags

Additional Fields