Made available in DSpace on 2015-04-14T14:50:02Z (GMT). No. of bitstreams: 1
448850.pdf: 2763654 bytes, checksum: 430bc12ce50b74821fe6a8aa37994260 (MD5)
Previous issue date: 2013-03-26 / This work studies Word Sense Disambiguation (WSD) in the Biomedicine domain for English language, using external knowledge sources. Among the existing proposals for the selection of a sense for an ambiguous word, there is the graph-based approach. This approach uses a metric in the evaluation of graphs containing candidates to the correct sense for the ambiguous word. In this research, a set of metrics is analyzed individually, and, based on this evaluation, we propose a hybrid model for the selection of the metrics in order to determine the most adequate metric to be employed. The model makes use of a set of features and heuristics that determine a semi-supervised solution for WSD. The results obtained with experiments show an improvement in performance and reveal new perspectives of research. The proposed model raises the hit rate to 68,48%, increasing significantly in 3,52% the rate reported in literature / Este trabalho estuda o Word Sense Disambiguation no dom?nio da Biomedicina, para a l?ngua inglesa, com uso de fontes externas de conhecimento. Dentre as propostas existentes para a sele??o de um sentido para uma palavra amb?gua, est? a abordagem baseadas em grafos. Essa abordagem emprega uma m?trica na avalia??o de grafos que cont?m candidatos ao sentido correto da palavra amb?gua. Nesta pesquisa um conjunto de m?tricas ? analisado individualmente e, com base nas avalia??es, prop?e-se um modelo h?brido de sele??o de m?tricas com o objetivo de determinar a m?trica mais adequada a ser empregada. O modelo faz uso de um conjunto de features e heur?sticas que determinam uma solu??o semi-supervisionada para o WSD. Os resultados obtidos com experimentos apontam melhoria na performance e revelam novas perspectivas de pesquisa. O modelo proposto eleva a taxa de acerto a 68,48%, aumentando significativamente em 3,52% a taxa reportada na literatura
Identifer | oai:union.ndltd.org:IBICT/oai:tede2.pucrs.br:tede/5221 |
Date | 26 March 2013 |
Creators | Goulart, Rodrigo Rafael Vilarreal |
Contributors | Lima, Vera L?cia Strube de |
Publisher | Pontif?cia Universidade Cat?lica do Rio Grande do Sul, Programa de P?s-Gradua??o em Ci?ncia da Computa??o, PUCRS, BR, Faculdade de Inform?ca |
Source Sets | IBICT Brazilian ETDs |
Language | Portuguese |
Detected Language | Portuguese |
Type | info:eu-repo/semantics/publishedVersion, info:eu-repo/semantics/doctoralThesis |
Format | application/pdf |
Source | reponame:Biblioteca Digital de Teses e Dissertações da PUC_RS, instname:Pontifícia Universidade Católica do Rio Grande do Sul, instacron:PUC_RS |
Rights | info:eu-repo/semantics/openAccess |
Relation | 1974996533081274470, 500, 600, 1946639708616176246 |
Page generated in 0.0017 seconds