Global ETD Search

1	Slim Embedding Layers for Recurrent Neural Language Models Li, Zhongliang 02 August 2018 (has links) No description available. Computer Science Language Modeling Embedding Layers
2	Spectral Probablistic Modeling and Applications to Natural Language Processing Parikh, Ankur 01 August 2015 (has links) Probabilistic modeling with latent variables is a powerful paradigm that has led to key advances in many applications such natural language processing, text mining, and computational biology. Unfortunately, while introducing latent variables substantially increases representation power, learning and modeling can become considerably more complicated. Most existing solutions largely ignore non-identifiability issues in modeling and formulate learning as a nonconvex optimization problem, where convergence to the optimal solution is not guaranteed due to local minima. In this thesis, we propose to tackle these problems through the lens of linear/multi-linear algebra. Viewing latent variable models from this perspective allows us to approach key problems such as structure learning and parameter learning using tools such as matrix/tensor decompositions, inversion, and additive metrics. These new tools enable us to develop novel solutions to learning in latent variable models with theoretical and practical advantages. For example, our spectral parameter learning methods for latent trees and junction trees are provably consistent, local-optima-free, and 1-2 orders of magnitude faster thanEMfor large sample sizes. In addition, we focus on applications in Natural Language Processing, using our insights to not only devise new algorithms, but also to propose new models. Our method for unsupervised parsing is the first algorithm that has both theoretical guarantees and is also practical, performing favorably to theCCMmethod of Klein and Manning. We also developed power low rank ensembles, a framework for language modeling that generalizes existing n-gram techniques to non-integer n. It consistently outperforms state-of-the-art Kneser Ney baselines and can train on billion-word datasets in a few hours. probabilistic graphical models spectral methods kernels unsupervised parsing language modeling
3	Generating Vocabulary Sets for Implicit Language Learning using Masked Language Modeling January 2020 (has links) abstract: Globalization is driving a rapid increase in motivation for learning new languages, with online and mobile language learning applications being an extremely popular method of doing so. Many language learning applications focus almost exclusively on aiding students in acquiring vocabulary, one of the most important elements in achieving fluency in a language. A well-balanced language curriculum must include both explicit vocabulary instruction and implicit vocabulary learning through interaction with authentic language materials. However, most language learning applications focus only on explicit instruction, providing little support for implicit learning. Students require support with implicit vocabulary learning because they need enough context to guess and acquire new words. Traditional techniques aim to teach students enough vocabulary to comprehend the text, thus enabling them to acquire new words. Despite the wide variety of support for vocabulary learning offered by learning applications today, few offer guidance on how to select an optimal vocabulary study set. This thesis proposes a novel method of student modeling which uses pre-trained masked language models to model a student's reading comprehension abilities and detect words which are required for comprehension of a text. It explores the efficacy of using pre-trained masked language models to model human reading comprehension and presents a vocabulary study set generation pipeline using this method. This pipeline creates vocabulary study sets for explicit language learning that enable comprehension while still leaving some words to be acquired implicitly. Promising results show that masked language modeling can be used to model human comprehension and that the pipeline produces reasonably sized vocabulary study sets. / Dissertation/Thesis / Masters Thesis Software Engineering 2020 Computer science Foreign language education Information science computer assisted language learning language modeling masked language modeling student modeling vocabulary acquisition
4	A Language for Designing Process Maps Malinova, Monika 13 June 2016 (has links) (PDF) Business Process Management (BPM) is often adopted by organizations as a method to increase awareness and knowledge of their business processes. Business process modeling is used as a method to represent business processes in form of business process models. The number of organizations adopting BPM is quickly increasing. By this means, so is the number of business process models as result of a BPM initiative. Within a single organization the number of business process models often ranges from hundreds to even thousands. In order to handle such large amount of business process models, organizations structure them by the help of a process architecture. It includes a process map, which is considered as the top-most view of the process architecture where the organization's business processes and the relations between them are visually and abstractly depicted. The details of each business process shown on the process map are stored in the lower levels of the corresponding process architecture. The purpose of a process map is to provide an overview of how an organization operates as a whole without necessarily going into the process details. Therefore, the design of a process map is vital not only for the understanding of the company's processes, but also for the subsequent detailed process modeling. This is primarily because, a process map is often the result of the process identification phase of the BPM lifecycle, and is used as a foundation for the subsequent phases, where the detailed process modeling and process improvement takes place. Despite their importance, the design of process maps is still more art than science, essentially because there is no standardized modeling language available for process map design. As a result, we are faced with a high heterogeneity of process map designs from practice, although they all serve a similar purpose. This has accordingly been our main motivation for pursuing the research presented in this thesis. The research question for this thesis is the following: How to effectively model processes on an abstract level? In this thesis, we document the development of a language for designing process maps. In particular, we provide the following contributions. First, we present a holistic reference BPM framework. It is a consolidation of procedural frameworks introduced by prominent BPM researchers. The framework includes eleven BPM elements, each holding activities organizations need to consider when adopting BPM. Second, we provide a method for assessing cognitive effectiveness of process maps used in practice. For this, we follow the nine principles for cognitively effective visual notations introduced by Moody cite{moody2012physics}. In addition, we employ the cognitive fit theory to check whether the design of process maps has an effect on the BPM success in the respective organization. Second, we conduct a systematic literature review on the quality of modeling languages and models. We use the quality requirements we found as basis for developing the language for designing process maps. Third, we define the abstract syntax, semantics, and concrete syntax of the language for process maps. We follow an explorative method, hence we rely on empirical data for the language development. Accordingly, we reuse symbols in our language which have already been used in practice as part of process maps. We follow this approach in order to ensure the language will consist of elements already familiar to organizations. We evaluate the language by means of an experiment, in which we assess the effectiveness and efficiency of process maps designed using elements from our language against process maps that have not been designed using our language. Last, this thesis provides a method for testing the suitability of existing languages for specific purposes. (author's abstract)
5	Lecture transcription systems in resource-scarce environments / Pieter Theunis de Villiers De Villiers, Pieter Theunis January 2014 (has links) Classroom note taking is a fundamental task performed by learners on a daily basis. These notes provide learners with valuable offline study material, especially in the case of more difficult subjects. The use of class notes has been found to not only provide students with a better learning experience, but also leads to an overall higher academic performance. In a previous study, an increase of 10.5% in student grades was observed after these students had been provided with multimedia class notes. This is not surprising, as other studies have found that the rate of successful transfer of information to humans increases when provided with both visual and audio information. Note taking might seem like an easy task; however, students with hearing impairments, visual impairments, physical impairments, learning disabilities or even non-native listeners find this task very difficult to impossible. It has also been reported that even non-disabled students find note taking time consuming and that it requires a great deal of mental effort while also trying to pay full attention to the lecturer. This is illustrated by a study where it was found that college students were only able to record ~40% of the data presented by the lecturer. It is thus reasonable to expect an automatic way of generating class notes to be beneficial to all learners. Lecture transcription (LT) systems are used in educational environments to assist learners by providing them with real-time in-class transcriptions or recordings and transcriptions for offline use. Such systems have already been successfully implemented in the developed world where all required resources were easily obtained. These systems are typically trained on hundreds to thousands of hours of speech while their language models are trained on millions or even hundreds of millions of words. These amounts of data are generally not available in the developing world. In this dissertation, a number of approaches toward the development of LT systems in resource-scarce environments are investigated. We focus on different approaches to obtaining sufficient amounts of well transcribed data for building acoustic models, using corpora with few transcriptions and of variable quality. One approach investigates the use of alignment using a dynamic programming phone string alignment procedure to harvest as much usable data as possible from approximately transcribed speech data. We find that target-language acoustic models are optimal for this purpose, but encouraging results are also found when using models from another language for alignment. Another approach entails using unsupervised training methods where an initial low accuracy recognizer is used to transcribe a set of untranscribed data. Using this poorly transcribed data, correctly recognized portions are extracted based on a word confidence threshold. The initial system is retrained along with the newly recognized data in order to increase its overall accuracy. The initial acoustic models are trained using as little as 11 minutes of transcribed speech. After several iterations of unsupervised training, a noticeable increase in accuracy was observed (47.79% WER to 33.44% WER). Similar results were however found (35.97% WER) after using a large speaker-independent corpus to train the initial system. Usable LMs were also created using as few as 17955 words from transcribed lectures; however, this resulted in large out-of-vocabulary rates. This problem was solved by means of LM interpolation. LM interpolation was found to be very beneficial in cases where subject specific data (such as lecture slides and books) was available. We also introduce our NWU LT system, which was developed for use in learning environments and was designed using a client/server based architecture. Based on the results found in this study we are confident that usable models for use in LT systems can be developed in resource-scarce environments. / MSc (Computer Science), North-West University, Vaal Triangle Campus, 2014 Acoustic modeling Automatic speech recognition Language modeling Lecture transcription Unsupervised training
6	Head Start Teacher Professional Development on Language Modeling and Children's Language Development: A Sequential Mixed Methods Design Terrell, LaTrenda 01 December 2017 (has links) Poverty is known to affect many areas of life for poor children, particularly young children’s language development. To address language development issues as well as other educational needs, the Head Start Program was created. The purpose of this sequential mixed-methods study was to describe the professional development experiences of Head Start teachers on language modeling. In addition, this study sought to explore teachers’ views on language modeling and the activities they find most effective to support student learning. Analysis of the data revealed that teachers wanted more training and workshops, to be paired with a mentor/coach, pay raises for achieving higher education, strategies for working with children, and encouragement from administration to effectively achieve their professional development plans and goals. Additionally, teachers demonstrated an understanding of the importance of language modeling for children to build vocabulary, to improve school readiness goals, and to communicate and express their needs. Finally, teachers felt very strongly that they use frequent conversations, wait for student responses during conversations, use back and forth conversations, encourage peer conversations, use more than one word as well as a variety of words to support children’s language development. Findings from this study may be utilized to provide the necessary support teachers need to improve their language modeling skills and to help programs in their planning and evaluation of an ongoing professional development model. This study adds to the literature on bridging the gap between learning about practices and using them in the classroom to improve children’s language development by including teacher voices into their professional development and how to effectively implement coaching practices to promote teacher knowledge and skills. Language Modeling Language Development Early Childhood Education Professional Development Coaching Early Childhood Education Education
7	Textual information retrieval : An approach based on language modeling and neural networks Georgakis, Apostolos A. January 2004 (has links) <p>This thesis covers topics relevant to information organization and retrieval. The main objective of the work is to provide algorithms that can elevate the recall-precision performance of retrieval tasks in a wide range of applications ranging from document organization and retrieval to web-document pre-fetching and finally clustering of documents based on novel encoding techniques.</p><p>The first part of the thesis deals with the concept of document organization and retrieval using unsupervised neural networks, namely the self-organizing map, and statistical encoding methods for representing the available documents into numerical vectors. The objective of this section is to introduce a set of novel variants of the self-organizing map algorithm that addresses certain shortcomings of the original algorithm.</p><p>In the second part of the thesis the latencies perceived by users surfing the Internet are shortened with the usage of a novel transparent and speculative pre-fetching algorithm. The proposed algorithm relies on a model of behaviour for the user browsing the Internet and predicts his future actions when surfing the Internet. In modeling the users behaviour the algorithm relies on the contextual statistics of the web pages visited by the user.</p><p>Finally, the last chapter of the thesis provides preliminary theoretical results along with a general framework on the current and future scientific work. The chapter describes the usage of the Zipf distribution for document organization and the usage of the adaboosting algorithm for the elevation of the performance of pre-fetching algorithms. </p> Language modeling Informatik, data- och systemvetenskap Informatik, data- och systemvetenskap
8	Textual information retrieval : An approach based on language modeling and neural networks Georgakis, Apostolos A. January 2004 (has links) This thesis covers topics relevant to information organization and retrieval. The main objective of the work is to provide algorithms that can elevate the recall-precision performance of retrieval tasks in a wide range of applications ranging from document organization and retrieval to web-document pre-fetching and finally clustering of documents based on novel encoding techniques. The first part of the thesis deals with the concept of document organization and retrieval using unsupervised neural networks, namely the self-organizing map, and statistical encoding methods for representing the available documents into numerical vectors. The objective of this section is to introduce a set of novel variants of the self-organizing map algorithm that addresses certain shortcomings of the original algorithm. In the second part of the thesis the latencies perceived by users surfing the Internet are shortened with the usage of a novel transparent and speculative pre-fetching algorithm. The proposed algorithm relies on a model of behaviour for the user browsing the Internet and predicts his future actions when surfing the Internet. In modeling the users behaviour the algorithm relies on the contextual statistics of the web pages visited by the user. Finally, the last chapter of the thesis provides preliminary theoretical results along with a general framework on the current and future scientific work. The chapter describes the usage of the Zipf distribution for document organization and the usage of the adaboosting algorithm for the elevation of the performance of pre-fetching algorithms. Language modeling Informatik, data- och systemvetenskap Informatik, data- och systemvetenskap
9	Lecture transcription systems in resource-scarce environments / Pieter Theunis de Villiers De Villiers, Pieter Theunis January 2014 (has links) Classroom note taking is a fundamental task performed by learners on a daily basis. These notes provide learners with valuable offline study material, especially in the case of more difficult subjects. The use of class notes has been found to not only provide students with a better learning experience, but also leads to an overall higher academic performance. In a previous study, an increase of 10.5% in student grades was observed after these students had been provided with multimedia class notes. This is not surprising, as other studies have found that the rate of successful transfer of information to humans increases when provided with both visual and audio information. Note taking might seem like an easy task; however, students with hearing impairments, visual impairments, physical impairments, learning disabilities or even non-native listeners find this task very difficult to impossible. It has also been reported that even non-disabled students find note taking time consuming and that it requires a great deal of mental effort while also trying to pay full attention to the lecturer. This is illustrated by a study where it was found that college students were only able to record ~40% of the data presented by the lecturer. It is thus reasonable to expect an automatic way of generating class notes to be beneficial to all learners. Lecture transcription (LT) systems are used in educational environments to assist learners by providing them with real-time in-class transcriptions or recordings and transcriptions for offline use. Such systems have already been successfully implemented in the developed world where all required resources were easily obtained. These systems are typically trained on hundreds to thousands of hours of speech while their language models are trained on millions or even hundreds of millions of words. These amounts of data are generally not available in the developing world. In this dissertation, a number of approaches toward the development of LT systems in resource-scarce environments are investigated. We focus on different approaches to obtaining sufficient amounts of well transcribed data for building acoustic models, using corpora with few transcriptions and of variable quality. One approach investigates the use of alignment using a dynamic programming phone string alignment procedure to harvest as much usable data as possible from approximately transcribed speech data. We find that target-language acoustic models are optimal for this purpose, but encouraging results are also found when using models from another language for alignment. Another approach entails using unsupervised training methods where an initial low accuracy recognizer is used to transcribe a set of untranscribed data. Using this poorly transcribed data, correctly recognized portions are extracted based on a word confidence threshold. The initial system is retrained along with the newly recognized data in order to increase its overall accuracy. The initial acoustic models are trained using as little as 11 minutes of transcribed speech. After several iterations of unsupervised training, a noticeable increase in accuracy was observed (47.79% WER to 33.44% WER). Similar results were however found (35.97% WER) after using a large speaker-independent corpus to train the initial system. Usable LMs were also created using as few as 17955 words from transcribed lectures; however, this resulted in large out-of-vocabulary rates. This problem was solved by means of LM interpolation. LM interpolation was found to be very beneficial in cases where subject specific data (such as lecture slides and books) was available. We also introduce our NWU LT system, which was developed for use in learning environments and was designed using a client/server based architecture. Based on the results found in this study we are confident that usable models for use in LT systems can be developed in resource-scarce environments. / MSc (Computer Science), North-West University, Vaal Triangle Campus, 2014 Acoustic modeling Automatic speech recognition Language modeling Lecture transcription Unsupervised training
10	Refinements in hierarchical phrase-based translation systems Pino, Juan Miguel January 2015 (has links) The relatively recently proposed hierarchical phrase-based translation model for statistical machine translation (SMT) has achieved state-of-the-art performance in numerous recent translation evaluations. Hierarchical phrase-based systems comprise a pipeline of modules with complex interactions. In this thesis, we propose refinements to the hierarchical phrase-based model as well as improvements and analyses in various modules for hierarchical phrase-based systems. We took the opportunity of increasing amounts of available training data for machine translation as well as existing frameworks for distributed computing in order to build better infrastructure for extraction, estimation and retrieval of hierarchical phrase-based grammars. We design and implement grammar extraction as a series of Hadoop MapReduce jobs. We store the resulting grammar using the HFile format, which offers competitive trade-offs in terms of efficiency and simplicity. We demonstrate improvements over two alternative solutions used in machine translation. The modular nature of the SMT pipeline, while allowing individual improvements, has the disadvantage that errors committed by one module are propagated to the next. This thesis alleviates this issue between the word alignment module and the grammar extraction and estimation module by considering richer statistics from word alignment models in extraction. We use alignment link and alignment phrase pair posterior probabilities for grammar extraction and estimation and demonstrate translation improvements in Chinese to English translation. This thesis also proposes refinements in grammar and language modelling both in the context of domain adaptation and in the context of the interaction between first-pass decoding and lattice rescoring. We analyse alternative strategies for grammar and language model cross-domain adaptation. We also study interactions between first-pass and second-pass language model in terms of size and n-gram order. Finally, we analyse two smoothing methods for large 5-gram language model rescoring. The last two chapters are devoted to the application of phrase-based grammars to the string regeneration task, which we consider as a means to study the fluency of machine translation output. We design and implement a monolingual phrase-based decoder for string regeneration and achieve state-of-the-art performance on this task. By applying our decoder to the output of a hierarchical phrase-based translation system, we are able to recover the same level of translation quality as the translation system. 410.285

Search results