Global ETD Search

Return to search

Probabilistic Diagnostic Model for Handling Classifier Degradation in Machine Learning

Several studies point out different causes of performance degradation in supervised machine learning. Problems such as class imbalance, overlapping, small-disjuncts, noisy labels, and sparseness limit accuracy in classification algorithms. Even though a number of approaches either in the form of a methodology or an algorithm try to minimize performance degradation, they have been isolated efforts with limited scope. This research consists of three main parts: In the first part, a novel probabilistic diagnostic model based on identifying signs and symptoms of each problem is presented. Secondly, the behavior and performance of several supervised algorithms are studied when training sets have such problems. Therefore, prediction of success for treatments can be estimated across classifiers. Finally, a probabilistic sampling technique based on training set diagnosis for avoiding classifier degradation is proposed<br>

10.25394/pgs.11312474.v1

Statistics

Pattern Recognition and Data Mining

Gaussian Mixture Models

Separation index

Classifier degradation

Bayesian Information Criterion (BIC)

Identifer	oai:union.ndltd.org:purdue.edu/oai:figshare.com:article/11312474
Date	04 December 2019
Creators	Gustavo A. Valencia-Zapata (8082655)
Source Sets	Purdue University
Detected Language	English
Type	Text, Thesis
Rights	CC BY 4.0
Relation	https://figshare.com/articles/Probabilistic_Diagnostic_Model_for_Handling_Classifier_Degradation_in_Machine_Learning/11312474

Page generated in 0.0016 seconds

Probabilistic Diagnostic Model for Handling Classifier Degradation in Machine Learning

Description

Links & Downloads

Tags

Additional Fields