The emergence of information extraction (IE) oriented pattern engines has been observed during the last decade. Most of them exploit heavily finite-state devices. This paper introduces ExPRESS – a new extraction pattern engine, whose rules are regular expressions over flat feature structures. The underlying pattern language is a blend of two previously introduced IE oriented pattern formalisms, namely, JAPE, used in the widely known GATE system, and the unificationbased XTDL formalism used in SProUT. A brief and technical overview of ExPRESS, its pattern language and the pool of its native linguistic components is given. Furthermore, the implementation of the grammar interpreter is addressed too.
Identifer | oai:union.ndltd.org:Potsdam/oai:kobv.de-opus-ubp:2722 |
Date | January 2008 |
Creators | Piskorski, Jakub |
Publisher | Universität Potsdam, Extern. Extern |
Source Sets | Potsdam University |
Language | English |
Detected Language | English |
Type | InProceedings |
Format | application/pdf |
Rights | http://opus.kobv.de/ubp/doku/urheberrecht.php |
Page generated in 0.0015 seconds