Global ETD Search

Return to search

Table Understanding for Information Retrieval

This thesis proposes a novel approach for finding tables in text files containing a mixture of unstructured and structured text. Tables may be arbitrarily complex because the data in the tables may themselves be tables and because the grouping of data elements displayed in a table may be very complex. Although investigators have proposed competence models to explain the structure of tables, there are no computationally feasible performance models for detecting and parsing general structures in real data. Our emphasis is placed on the investigation of a new statistical procedure for detecting basic tables in plain text documents. The main task here is defining and testing this theory in the context of the Odessa Digital Library. / Master of Science

Information retrieval

Statistical crosscorrelation

Odessa digital library

detection heuristics

Table detection

Identifer	oai:union.ndltd.org:VTETD/oai:vtechworks.lib.vt.edu:10919/34820
Date	03 September 2002
Creators	Pande, Ashwini K.
Contributors	Computer Science, Ehrich, Roger W., Fox, Edward A., North, Christopher L.
Publisher	Virginia Tech
Source Sets	Virginia Tech Theses and Dissertation
Detected Language	English
Type	Thesis
Format	application/pdf
Rights	In Copyright, http://rightsstatements.org/vocab/InC/1.0/
Relation	AshwiniPandeTableIR.pdf

Page generated in 0.0027 seconds

Table Understanding for Information Retrieval

Description

Links & Downloads

Tags

Additional Fields