The elections in South Africa are contested by multiple political parties appealing to a
diverse population that comes from a variety of socioeconomic backgrounds. As a result,
a rich source of discourse is created to inform voters about election-related content. Two
common sources of information to help voters with their decision are news articles and
tweets, this study aims to understand the discourse in these two sources using natural
language processing. Topic modelling techniques, Latent Dirichlet Allocation and Non-
negative Matrix Factorization, are applied to digest the breadth of information collected
about the elections into topics. The topics produced are subjected to further analysis
that uncovers similarities between topics, links topics to dates and events and provides a
summary of the discourse that existed prior to the South African general elections. The
primary focus is on the 2019 elections, however election-related articles from 2014 and
2019 were also compared to understand how the discourse has changed. / Mini Dissertation (MIT (Big Data Science))--University of Pretoria, 2019. / Computer Science / MIT (Big Data Science) / Unrestricted
Identifer | oai:union.ndltd.org:netd.ac.za/oai:union.ndltd.org:up/oai:repository.up.ac.za:2263/82552 |
Date | January 2019 |
Creators | Moodley, Avashlin |
Contributors | Marivate, Vukosi, avashlin@gmail.com |
Publisher | University of Pretoria |
Source Sets | South African National ETD Portal |
Language | English |
Detected Language | English |
Type | Mini Dissertation |
Rights | © 2021 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. |
Page generated in 0.0021 seconds