Global ETD Search

Return to search

Context matters : Classifying Swedish texts using BERT's deep bidirectional word embeddings

When classifying texts using a linear classifier, the texts are commonly represented as feature vectors. Previous methods to represent features as vectors have been unable to capture the context of individual words in the texts, in theory leading to a poor representation of natural language. Bidirectional Encoder Representations from Transformers (BERT), uses a multi-headed self-attention mechanism to create deep bidirectional feature representations, able to model the whole context of all words in a sequence. A BERT model uses a transfer learning approach, where it is pre-trained on a large amount of data and can be further fine-tuned for several down-stream tasks. This thesis uses one multilingual, and two dedicated Swedish BERT models, for the task of classifying Swedish texts as of either easy-to-read or standard complexity in their respective domains. The performance on the text classification task using the different models is then compared both with feature representation methods used in earlier studies, as well as with the other BERT models. The results show that all models performed better on the classification task than the previous methods of feature representation. Furthermore, the dedicated Swedish models show better performance than the multilingual model, with the Swedish model pre-trained on more diverse data outperforming the other.

http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-166304

NLP

text classification

BERT

feature representation

pre-trained language models

transformer networks

fine-tuning

Identifer	oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:liu-166304
Date	January 2020
Creators	Holmer, Daniel
Publisher	Linköpings universitet, Institutionen för datavetenskap
Source Sets	DiVA Archive at Upsalla University
Language	English
Detected Language	English
Type	Student thesis, info:eu-repo/semantics/bachelorThesis, text
Format	application/pdf
Rights	info:eu-repo/semantics/openAccess

Page generated in 0.0022 seconds

Context matters : Classifying Swedish texts using BERT's deep bidirectional word embeddings

Description

Links & Downloads

Tags

Additional Fields