Return to search

Predicting Satisfaction in Customer Support Chat : Opinion Mining as a Binary Classification Problem

The study explores binary classification with Support Vector Machines as means to predict a satisfaction score based on customer surveys in the customer supportdomain. Standard feature selection methods and their impact on results are evaluated and a feature scoring metric Log Odds Ratio is implemented for addressingasymmetrical class distributions. Results show that the feature selection andscoring methods implemented improve performance significantly. Results alsoshow that it is possible to get decent predictive values on test data based onlimited amount of training observations. However mixed results are presentedin a real-world application example as a there is a significant error rate fordiscriminating the minority class. We also show the negative effects of usingcommon metrics such as accuracy and f-measure for optimizing models whendealing with high-skew data in a classification context.

Identiferoai:union.ndltd.org:UPSALLA1/oai:DiVA.org:uu-300165
Date January 2016
CreatorsHedlund, Henrik
PublisherUppsala universitet, Institutionen för lingvistik och filologi
Source SetsDiVA Archive at Upsalla University
LanguageEnglish
Detected LanguageEnglish
TypeStudent thesis, info:eu-repo/semantics/bachelorThesis, text
Formatapplication/pdf
Rightsinfo:eu-repo/semantics/openAccess

Page generated in 0.0015 seconds