Global ETD Search

Return to search

Unsupervised Method for Disease Named Entity Recognition

Diseases take a central role in biomedical research; many studies aim to enable access to disease information, by designing named entity recognition models to make use of the available information. Disease recognition is a problem that has been tackled by various approaches of which the most famous are the lexical and supervised approaches. However, the aforementioned approaches have many drawbacks as their performance is aﬀected by the amount of human-annotated data set available. Moreover, lexicalapproachescannotdistinguishbetweenrealmentionsofdiseasesand mentionsofotherentitiesthatsharethesamenameoracronym. Thechallengeofthis project is to ﬁnd a model that can combine the strengths of the lexical approaches and supervised approaches, to design a named entity recognizer. We demonstrate that our model can accurately identify disease name mentions in text, by using word embedding to capture context information of each mention, which enables the model todistinguishifitisarealdiseasementionornot. Weevaluateourmodelusingagold standard data set which showed high precision of 84% and accuracy of 96%. Finally, we compare the performance of our model to diﬀerent statistical name entity recognition models, and the results show that our model outperforms the unsupervised lexical approaches.

Text Mining

NER

Name Entity Recognition

Disease Name

Identifer	oai:union.ndltd.org:kaust.edu.sa/oai:repository.kaust.edu.sa:10754/659966
Date	06 November 2019
Creators	Almutairi, Abeer N.
Contributors	Hoehndorf, Robert, Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Moshkov, Mikhail, Laleg-Kirati, Taous-Meriem
Source Sets	King Abdullah University of Science and Technology
Language	English
Detected Language	English
Type	Thesis

Page generated in 0.0018 seconds

Unsupervised Method for Disease Named Entity Recognition

Description

Links & Downloads

Tags

Additional Fields