Version-controlled documents, such as Wikipedia or program codes in Subversion, demands a novel methodology to be analyzed efficiently. The documents are continually edited by one or more authors in contrast of the case of static documents. These collaborative processses make traditional methodologies to be ineffective, yet needs for efficient methodologies are rapidly developing. This paper proposes two new models based on Local Space-time Smoothing (LSS) which captures important revision patterns while Cumulative Revision Map (CRM) tracks word insertions and deletions in particular positions of a document. These two methods enable us to understand and visualize the revision patterns intuitively and efficiently. Synthetic data and real-world data are used to demonstrate its applicability.
Identifer | oai:union.ndltd.org:GATECH/oai:smartech.gatech.edu:1853/39603 |
Date | 05 April 2011 |
Creators | Kim, Seungyeon |
Publisher | Georgia Institute of Technology |
Source Sets | Georgia Tech Electronic Thesis and Dissertation Archive |
Detected Language | English |
Type | Thesis |
Page generated in 0.0055 seconds