This thesis examines the discussions on Reddit surrounding the launch of ChatGPTfrom late November 2022 until the end of March 2023. The objective of the study is to analyze the discussions concerning ChatGPT and how different topics have changed over time.Additionally, the thesis identifies significant events that have had an impact on the topicsand also how topics vary across different subreddits. To retrieve the data for the analysis, the PushShift API was used to gather almost half a million posts concerning ChatGPT.Topic modeling was then applied using BERTopic to identify common topics discussed onReddit and its unique subreddits. The results show several distinct topics, encompassing the technology behind ChatGPT, its societal implications, and its potential for creativeutilization. Furthermore, the thesis presents a clear correlation between significant newsconcerning ChatGPT and the frequency of posts on Reddit. Specifically, Microsoft’s investment in OpenAI and the incorporation of the GPT engine in Bing proved to have a greatinfluence on both the topics and frequency of posts. We also found some discrepancies between how subreddits discuss topics, most notably that more general topics tend to spreadout more, both over various subreddits as well as over time and being more sporadic, whilespecific topics tend to be more dictated by the occurence of significant events relevant tothe topic.
Identifer | oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:liu-199508 |
Date | January 2023 |
Creators | Nordell, Erik, Mogren, Max |
Publisher | Linköpings universitet, Institutionen för datavetenskap |
Source Sets | DiVA Archive at Upsalla University |
Language | English |
Detected Language | English |
Type | Student thesis, info:eu-repo/semantics/bachelorThesis, text |
Format | application/pdf |
Rights | info:eu-repo/semantics/openAccess |
Page generated in 0.0023 seconds