Global ETD Search

Return to search

Multimodalita ve strojovém překladu / Multimodality in Machine Translation

Multimodality in Machine Translation Jindřich Libovický Traditionally, most natural language processing tasks are solved within the lan- guage, relying on distributional properties of words. Representation learning abilities of deep learning recently allowed using additional information source by grounding the representations in the visual modality. One of the tasks that attempt to exploit the visual information is multimodal machine translation: translation of image captions when having access to the original image. The thesis summarizes joint processing of language and real-world images using deep learning. It gives an overview of the state of the art in multimodal machine translation and describes our original contribution to solving this task. We introduce methods of combining multiple inputs of possibly different modalities in recurrent and self-attentive sequence-to-sequence models and show results on multimodal machine translation and other tasks related to machine translation. Finally, we analyze how the multimodality influences the semantic properties of the sentence representation learned by the networks and how that relates to translation quality.

http://www.nusl.cz/ntk/nusl-408143

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:408143
Date	January 2019
Creators	Libovický, Jindřich
Contributors	Pecina, Pavel, Specia, Lucia, Čech, Jan
Source Sets	Czech ETDs
Language	English
Detected Language	English
Type	info:eu-repo/semantics/doctoralThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.002 seconds

Multimodalita ve strojovém překladu / Multimodality in Machine Translation

Description

Links & Downloads

Tags

Additional Fields