Global ETD Search

Return to search

Hierarchical Bayesian topic modeling with sentiment and author extension

Doctor of Philosophy / Computing and Information Sciences / William H. Hsu / While the Hierarchical Dirichlet Process (HDP) has recently been widely applied to topic modeling tasks, most current hybrid models for concurrent inference of topics and other factors are not based on HDP.

In this dissertation, we present two new models that extend an HDP topic modeling framework to incorporate other learning factors. One model injects Latent Dirichlet Allocation (LDA) based sentiment learning into HDP. This model preserves the benefits of nonparametric Bayesian models for topic learning, while learning latent sentiment aspects simultaneously. It automatically learns different word distributions for each single sentiment polarity within each topic generated.

The other model combines an existing HDP framework for learning topics from free text with latent authorship learning within a generative model using author list information. This model adds one more layer into the current hierarchy of HDPs to represent topic groups shared by authors, and the document topic distribution is represented as a mixture of topic distribution of its authors. This model automatically learns author contribution partitions for documents in addition to topics.

http://hdl.handle.net/2097/20598

Computer science

Computer Science (0984)

Identifer	oai:union.ndltd.org:KSU/oai:krex.k-state.edu:2097/20598
Date	January 1900
Creators	Yang, Ming
Publisher	Kansas State University
Source Sets	K-State Research Exchange
Language	en_US
Detected Language	English
Type	Dissertation

Page generated in 0.0019 seconds

Hierarchical Bayesian topic modeling with sentiment and author extension

Description

Links & Downloads

Tags

Additional Fields