Global ETD Search

Return to search

Genetic Algorighm Representation Selection Impact on Binary Classification Problems

In this thesis, we explore the impact of problem representation on the ability for the genetic algorithms (GA) to evolve a binary prediction model to predict whether a physical therapist is paid above or below the median amount from Medicare. We explore three different problem representations, the vector GA (VGA), the binary GA (BGA), and the proportional GA (PGA). We find that all three representations can produce models with high accuracy and low loss that are better than Scikit-Learn’s logistic regression model and that all three representations select the same features; however, the PGA representation tends to create lower weights than the VGA and BGA. We also find that mutation rate creates more of a difference in accuracy when comparing the individual with the best fitness (lowest binary cross entropy loss) and the most accurate solution when the mutation rate is higher. We then explore potential of biases in the PGA mapping functions that may encourage the lower values. We find that the PGA has biases on the values they can encode depending on the mapping function; however, since we do not find a bias towards lower values for all tested mapping functions, it is more likely that it is more difficult for the PGA to encode more extreme values given crossover tends to have an averaging effect on the PGA chromosome.

Genetic Algorithms

Binary Classification

Identifer	oai:union.ndltd.org:ucf.edu/oai:stars.library.ucf.edu:honorstheses-2373
Date	01 January 2022
Creators	Maldonado, Stephen V
Publisher	STARS
Source Sets	University of Central Florida
Language	English
Detected Language	English
Type	text
Format	application/pdf
Source	Honors Undergraduate Theses

Page generated in 0.0017 seconds

Genetic Algorighm Representation Selection Impact on Binary Classification Problems

Description

Links & Downloads

Tags

Additional Fields