Global ETD Search

Return to search

Extractive Text Summarization of Norwegian News Articles Using BERT

Extractive text summarization has over the years been an important research area in Natural Language Processing. Numerous methods have been proposed for extracting information from text documents. Recent works have shown great success for English summarization tasks by fine-tuning the language model BERT using large summarization datasets. However, less research has been made for low-resource languages. This work contributes by investigating how BERT can be used for Norwegian text summarization. Two models are developed by applying a modified BERT architecture, called BERTSum, on pre-trained Norwegian and Multilingual BERT. The results are models able to predict key sentences from articles to generate bullet-point summaries. These models are evaluated with the automatic metric ROUGE and in this evaluation, the Multilingual BERT model outperforms the Norwegian model. The multilingual model is further evaluated in a human evaluation by journalists, revealing that the generated summaries are not entirely satisfactory in some aspects. With some improvements, the model shows to be a valuable tool for journalists to edit and rewrite generated summaries, saving time and workload. / <p>Examensarbetet är utfört vid Institutionen för teknik och naturvetenskap (ITN) vid Tekniska fakulteten, Linköpings universitet</p>

http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-176598

extractive text summarization

Datavetenskap (datalogi)

Identifer	oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:liu-176598
Date	January 2021
Creators	Biniam, Thomas Indrias, Morén, Adam
Publisher	Linköpings universitet, Medie- och Informationsteknik, Linköpings universitet, Tekniska fakulteten, Linköpings universitet, Medie- och Informationsteknik, Linköpings universitet, Tekniska fakulteten
Source Sets	DiVA Archive at Upsalla University
Language	English
Detected Language	English
Type	Student thesis, info:eu-repo/semantics/bachelorThesis, text
Format	application/pdf
Rights	info:eu-repo/semantics/openAccess

Page generated in 0.0023 seconds

Extractive Text Summarization of Norwegian News Articles Using BERT

Description

Links & Downloads

Tags

Additional Fields