Return to search

Approaches to Natural Language Processing

This paper explores topic modeling through the example text of Alice in Wonderland. It explores both singular value decomposition as well as non-­‐‑negative matrix factorization as methods for feature extraction. The paper goes on to explore methods for partially supervised implementation of topic modeling through introducing themes. A large portion of the paper also focuses on implementation of these techniques in python as well as visualizations of the results which use a combination of python, html and java script along with the d3 framework. The paper concludes by presenting a mixture of SVD, NMF and partially-­‐‑supervised NMF as a possible way to improve topic modeling.

Identiferoai:union.ndltd.org:CLAREMONT/oai:scholarship.claremont.edu:cmc_theses-2795
Date01 January 2018
CreatorsSmith, Sydney
PublisherScholarship @ Claremont
Source SetsClaremont Colleges
Detected LanguageEnglish
Typetext
Formatapplication/pdf
SourceCMC Senior Theses
Rights© 2017 Sydney Smith, default

Page generated in 0.0018 seconds