Global ETD Search

11	Using a Character-Based Language Model for Caption Generation / Användning av teckenbaserad språkmodell för generering av bildtext Keisala, Simon January 2019 (has links) Using AI to automatically describe images is a challenging task. The aim of this study has been to compare the use of character-based language models with one of the current state-of-the-art token-based language models, im2txt, to generate image captions, with focus on morphological correctness. Previous work has shown that character-based language models are able to outperform token-based language models in morphologically rich languages. Other studies show that simple multi-layered LSTM-blocks are able to learn to replicate the syntax of its training data. To study the usability of character-based language models an alternative model based on TensorFlow im2txt has been created. The model changes the token-generation architecture into handling character-sized tokens instead of word-sized tokens. The results suggest that a character-based language model could outperform the current token-based language models, although due to time and computing power constraints this study fails to draw a clear conclusion. A problem with one of the methods, subsampling, is discussed. When using the original method on character-sized tokens this method removes characters (including special characters) instead of full words. To solve this issue, a two-phase approach is suggested, where training data first is separated into word-sized tokens where subsampling is performed. The remaining tokens are then separated into character-sized tokens. Future work where the modified subsampling and fine-tuning of the hyperparameters are performed is suggested to gain a clearer conclusion of the performance of character-based language models. Natural Language Processing NLP Machine Learning ML Neural Network Caption Generation Deep Learning Recurrent Neural Network Long-Short-Term-Memory LSTM word2vec Language Model
12	The Invention of Access: Speech-to-Text Writing and the Emergent Methodologies of Disability Service Transcription Iwertz, Chad Everett 02 October 2019 (has links) No description available. Composition Technology Technical Communication Rhetoric Multicultural Education composition studies transcription studies disability studies rhetorical invention grounded theory qualitative analysis quantitative analysis caption studies speech-to-text writing methodologies of transcription
13	Exploring Attention Based Model for Captioning Images Xu, Kelvin 12 1900 (has links) No description available. Reseaux de Neurones Generation de Description Apprentissage Profond Apprentissage de Representations Apprentissage Supervise Inference Variationelle Apprentissage par Renforcement Attention Modelisation de Donnees Sequentielles Neural Networks Caption Generation Deep Learning Representation Learning Supervised Learning Variational Inference Reinforcement Learning Attention Sequence Modelling

Page generated in 0.056 seconds