Global ETD Search

1	Low-Resource Natural Language Understanding in Task-Oriented Dialogue Louvan, Samuel 11 March 2022 (has links) Task-oriented dialogue (ToD) systems need to interpret the user's input to understand the user's needs (intent) and corresponding relevant information (slots). This process is performed by a Natural Language Understanding (NLU) component, which maps the text utterance into a semantic frame representation, involving two subtasks: intent classification (text classification) and slot filling (sequence tagging). Typically, new domains and languages are regularly added to the system to support more functionalities. Collecting domain-specific data and performing fine-grained annotation of large amounts of data every time a new domain and language is introduced can be expensive. Thus, developing an NLU model that generalizes well across domains and languages with less labeled data (low-resource) is crucial and remains challenging. This thesis focuses on investigating transfer learning and data augmentation methods for low-resource NLU in ToD. Our first contribution is a study of the potential of non-conversational text as a source for transfer. Most transfer learning approaches assume labeled conversational data as the source task and adapt the NLU model to the target task. We show that leveraging similar tasks from non-conversational text improves performance on target slot filling tasks through multi-task learning in low-resource settings. Second, we propose a set of lightweight augmentation methods that apply data transformation on token and sentence levels through slot value substitution and syntactic manipulation. Despite its simplicity, the performance is comparable to deep learning-based augmentation models, and it is effective on six languages on NLU tasks. Third, we investigate the effectiveness of domain adaptive pre-training for zero-shot cross-lingual NLU. In terms of overall performance, continued pre-training in English is effective across languages. This result indicates that the domain knowledge learned in English is transferable to other languages. In addition to that, domain similarity is essential. We show that intermediate pre-training data that is more similar – in terms of data distribution – to the target dataset yields better performance.
2	Crowdsourcing av data för Hybrid Code Networks Linné, Christoffer, Olausson, Pontus January 2020 (has links) Task-oriented dialogue systems are a popular way for organisations to generate extra value both internally and for customers. Modern approaches for these dialogue systems that use neural networks to enable training directly on written dialogues are very data hungry, which complicates their implementation. Crowdsourcing is an attractive solution for generating this type of training data, but the method also comes with several difficulties. We introduce a new method for generating training data based on parallel crowdsourcing of dialogues, as well as crowdsourced quality review. We use this method to collect a small dataset that takes place within the domain bus driver-traveler. We believe that this method offers an efficient way to collect new, high-quality datasets. Hybrid Code Networks is a model for dialogue systems that combines a neural network with domain-specific knowledge, and thus requires a significantly smaller amount of training data than other similar dialogue systems to achieve comparable performance. By combining Hybrid Code Networks with our new method for generating training data, we believe that the threshold for implementing task-oriented dialogue systems on domains with insufficient training data can be lowered. We implement Hybrid Code Networks and train the implementation on the collected dataset and achieve good results. / Uppgiftsorienterade dialogsystem är ett populärt sätt för företag att generera extra värde både internt och för kunder. Moderna modeller för dessa dialogsystem som använder neurala nätverk för att möjliggöra träning direkt på skriftliga dialoger är väldigt datahungriga, vilket försvårar implementationen av dessa. Crowdsourcing är en attraktiv lösning för att generera denna typ av träningsdata, men metoden kommer även med flera svårigheter. Vi introducerar en ny metod för generering av träningsdata som bygger på parallell crowdsourcing av dialoger, samt crowdsourcad kvalitetsgranskning. Vi använder denna metod för att samla in ett litet dataset som utspelar sig inom domänen busschaufför-resenär. Vi menar att denna metod erbjuder ett effektivt sätt att samla in nya, högkvalitativa dataset. Hybrid Code Networks är en modell för dialogsystem som kombinerar ett neuralt nätverk med domänspecifik kunskap, och som på så sätt kräver en betydligt mindre mängd träningsdata än andra liknande dialogsystem för att uppnå jämförbar prestanda. Genom att kombinera Hybrid Code Networks med vår nya metod för generering av träningsdata menar vi att man kan sänka tröskeln för att implementera uppgiftsorienterade dialogsystem på domäner med otillräcklig träningsdata. Vi implementerar Hybrid Code Networks och tränar implementationen på det insamlade datasetet, och uppnår goda resultat. Crowdsourcing Hybrid Code Networks Natural Dialogue Generation Task-Oriented Dialogue Systems Computer and Information Sciences Data- och informationsvetenskap
3	On iterated learning for task-oriented dialogue Singhal, Soumye 01 1900 (has links) Dans le traitement de langue et des système de dialogue, il est courant de pré-entraîner des modèles de langue sur corpus humain avant de les affiner par le biais d'un simulateur et de résolution de tâches. Malheuresement, ce type d'entrainement tend aussi à induire un phénomène connu sous le nom de dérive du langage. Concrétement, les propriétés syntaxiques et sémantiques de la langue intiallement apprise se détériorent: les agents se concentrent uniquement sur la résolution de la tâche, et non plus sur la préservation de la langue. En s'inspirant des travaux en sciences cognitives, et notamment l'apprentigssage itératif Kirby and Griffiths (2014), nous proposons ici une approche générique pour contrer cette dérive du langage. Nous avons appelé cette méthode Seeded iterated learning (SIL), ou apprentissage itératif capitalisé. Ce travail a été publié sous le titre (Lu et al., 2020b) et est présenté au chapitre 2. Afin d'émuler la transmission de la langue entre chaque génération d'agents, un agent étudiant est d'abord pré-entrainé avant d'être affiné de manière itérative, et ceci, en imitant des données échantillonnées à partir d'un agent enseignant nouvellement formé. À chaque génération, l'enseignant est créé en copiant l'agent étudiant, avant d'être de nouveau affiné en maximisant le taux de réussite de la tâche sous-jacente. Dans un second temps, nous présentons Supervised Seeded iterated learning (SSIL) dans le chapitre 3, où apprentissage itératif capitalisé avec supervision, qui a été publié sous le titre (Lu et al., 2020b). SSIL s'appuie sur SIL en le combinant avec une autre méthode populaire appelée Supervised SelfPlay (S2P) (Gupta et al., 2019), où apprentissage supervisé par auto-jeu. SSIL est capable d'atténuer les problèmes de S2P et de SIL, i.e. la dérive du langage dans les dernier stades de l'entrainement tout en préservant une plus grande diversité linguistique. Tout d'abord, nous évaluons nos méthodes dans sous la forme d'une preuve de concept à traver le Jeu de Lewis avec du langage synthetique. Dans un second temps, nous l'étendons à un jeu de traduction se utilisant du langage naturel. Dans les deux cas, nous soulignons l'efficacité de nos méthodes par rapport aux autres méthodes de la litterature. Dans le chapitre 1, nous discutons des concepts de base nécessaires à la compréhension des articles présentés dans les chapitres 2 et 3. Nous décrivons le problème spécifique du dialogue orienté tâche, y compris les approches actuelles et les défis auxquels ils sont confrontés : en particulier, la dérive linguistique. Nous donnons également un aperçu du cadre d'apprentissage itéré. Certaines sections du chapitre 1 sont empruntées aux articles pour des raisons de cohérence et de facilité de compréhension. Le chapitre 2 comprend les travaux publiés sous le nom de (Lu et al., 2020b) et le chapitre 3 comprend les travaux publiés sous le nom de (Lu et al., 2020a), avant de conclure au chapitre 4. / In task-oriented dialogue, pretraining on human corpus followed by finetuning in a simulator using selfplay suffers from a phenomenon called language drift. The syntactic and semantic properties of the learned language deteriorates as the agents only focuses on solving the task. Inspired by the iterative learning framework in cognitive science Kirby and Griffiths (2014), we propose a generic approach to counter language drift called Seeded iterated learning (SIL). This work was published as (Lu et al., 2020b) and is presented in Chapter 2. In an attempt to emulate transmission of language between generations, a pretrained student agent is iteratively refined by imitating data sampled from a newly trained teacher agent. At each generation, the teacher is created by copying the student agent, before being finetuned to maximize task completion.We further introduce Supervised Seeded iterated learning (SSIL) in Chapter 3, work which was published as (Lu et al., 2020a). SSIL builds upon SIL by combining it with the other popular method called Supervised SelfPlay (S2P) (Gupta et al., 2019). SSIL is able to mitigate the problems of both S2P and SIL namely late-stage training collapse and low language diversity. We evaluate our methods in a toy setting of Lewis Game, and then scale it up to the translation game with natural language. In both settings, we highlight the efficacy of our methods compared to the baselines. In Chapter 1, we talk about the core concepts required for understanding the papers presented in Chapters 2 and 3. We describe the specific problem of task-oriented dialogue including current approaches and the challenges they face: particularly, the challenge of language drift. We also give an overview of the iterated learning framework. Some sections in Chapter 1 are borrowed from the papers for coherence and ease of understanding. Chapter 2 comprises of the work published as (Lu et al., 2020b) and Chapter 3 comprises of the work published as (Lu et al., 2020a). Chapter 4 gives a conclusion on the work. deep-learning multi-agent learning task-oriented dialogue iterated learning multi-task learning language drift apprentissage en profondeur apprentissage multi-agents dialogue orienté tâche apprentissage itératif apprentissage multi-tâches dérive du langage natural language processing traitement du langage naturel

1

Page generated in 0.075 seconds