Return to search

Improving Topic Tracking with Domain Chaining

Topic Detection and Tracking (TDT) research has produced some successful statistical tracking systems. While lexical chaining, a non-statistical approach, has also been applied to the task of tracking by Carthy and Stokes for the 2001 TDT evaluation, an efficient tracking system based on this technology has yet to be developed. In thesis we investigate two new techniques which can improve Carthy's original design. First, at the core of our system is a semantic domain chainer. This chainer relies not only on the WordNet database for semantic relationships but also on Magnini's semantic domain database, which is an extension of WordNet. The domain-chaining algorithm is a linear algorithm. Second, to handle proper nouns, we gather all of the ones that occur in a news story together in a chain reserved for proper nouns. In this thesis we also discuss the linguistic limitations of lexical chainers to represent textual meaning.

Identiferoai:union.ndltd.org:unt.edu/info:ark/67531/metadc4274
Date08 1900
CreatorsYang, Li
ContributorsMontler, Timothy, Ross, John Robert, 1938-, Mihalcea, Rada, 1974-
PublisherUniversity of North Texas
Source SetsUniversity of North Texas
LanguageEnglish
Detected LanguageEnglish
TypeThesis or Dissertation
FormatText
RightsPublic, Copyright, Yang, Li, Copyright is held by the author, unless otherwise noted. All rights reserved.

Page generated in 0.0023 seconds