Global ETD Search

1	Study of Pretraining Bias and Frequencies Taware, Rutuja Murlidhar 10 July 2023 (has links) Usage of language models in an in-context learning environment has been adapted for a wide range of tasks. Recent works have showcased the impact of pretraining data on the in-context performance of language models. In this work, we experiment with numbers having high and low frequencies in the pretraining data to understand the impact of term frequencies on the model's performance. We also experiment with random and adversarial demonstrations to understand the pretraining bias present in the model. Through these experiments, we showcase the importance of pretraining frequencies of the numbers present in the demonstrations and explain how highly frequent terms can be used in the demonstrations to achieve better task performance. Moreover, we also show the impact of pretraining bias on the model's performance and explain how the model overcomes this bias with more demonstrations. / Master of Science / Recent works focus on understanding and improving the arithmetic capabilities of the state-of-the-art (SOTA) systems in the domain of Natural Language Processing (NLP). This work focuses on designing and performing novel experiments to analyze the impact of training data on the performance of such systems. Through these experiments, this work showcases interesting properties of the SOTA systems which will promote future research to understand them better as well as help in creating better downstream applications. In-Context Learning Pretraining Frequency Bias Language Model
2	Design, implementation and evaluation of an in-context learning support program for first year education students and its impact on educational outcomes. de la Harpe, Barbara I. January 1998 (has links) This research was concerned with furthering theoretical and practical understanding of student learning at university through a longitudinal, cross-sectional, in-depth study of first year students in a specific learning context, namely Educational Psychology. The main aim of the study was to investigate ways of assisting students to be effective learners. The particular role that affect played in learning and the relationship between learning behaviour and learning outcomes, was explored. A Conceptual Model of student learning incorporating student cognition, metacognition, motivation, affect and academic performance in a specific social and cultural context, underpinned the study. The study documented the design, implementation and evaluation - from both the students' and teacher's perspectives - of an in-context learning support program for first year students, using both quantitative and qualitative methodologies.The program was based on a theoretical framework which integrated cognitive, behavioural and social learning perspectives and focussed on increasing students' repertoire of learning strategies, promoting their higher level thinking and understanding, developing their metacognitive skills and managing their affect. It included an emphasis on student goal setting and time management, reading and writing strategies, learning for tests and exams, self-management, reflecting on and evaluating learning, and dealing with test anxiety.The main findings of the study were that providing in-context learning support was associated with positive changes in students' learning strategy use, motivational orientations, and affective reactions. Students valued teacher support and instructional strategies that promoted active learning. The instructor found that providing learning support was more challenging and rewarding than teaching content alone. The role of context - ++ / in particular, assessment tasks - in learning, was highlighted. The implications for teaching and learning were examined and the Conceptual Model was further refined. The research resulted in a more holistic and integrated perspective on learning support provision and on the role of cognitive, metacognitive, motivational and affective factors, and academic performance, in student learning.
3	The vocabulary learning behavior of Romanian high school students in a digital context Cojocnean, Diana Maria January 2015 (has links) This thesis investigates the vocabulary learning behavior of Romanian high school students in a digital context. The research identifies the vocabulary learning strategies used by EFL high school students and focuses on how the choice of vocabulary learning strategies varies across four independent variables: students' age, gender, academic profile (math-ICT, humanities, science and economic-technical) and language program (intensive English, bilingual, normal). These variables are hypothesized to influence learners' vocabulary behavior. Furthermore, the study examines the technology enhanced tools (computer and mobile assisted language learning tools) used by these students in their vocabulary learning as well as their attitudes towards using technology in vocabulary learning. Likewise, the study analyzes how students' choice of technology enhanced tools and their attitudes towards them vary across the four independent variables. The study is a mixed methods investigation with 1,239 participants (60% female, 40% male, aged 14-19 years old) learning English as a foreign language in nine Romanian secondary schools. Of the 1,239 participants who filled in the self-reported questionnaire, 43 also participated in focus group discussions prior to the administration of the questionnaire. The quantitative data were analyzed using descriptive and inferential statistics procedures whereas the qualitative data were analyzed thematically. The results from both phases were integrated in the results chapter. The main findings indicated that Romanian high school students prefer social strategies, followed by determination, metacognitive, cognitive and memory strategies. However, the usage of the strategies in these categories is medium towards low. As for individual vocabulary learning strategies, the participants reported that the impact of a new word, English media, guessing from context, associating the word with a picture and using cognates are frequently used strategies. The results also indicated that students' use of vocabulary learning strategies varies across the four independent variables. As far as the use of digital tools for vocabulary learning, the findings indicated that the students in this particular cultural context use few available digital tools with a preference for online dictionaries, games and social networking web sites. The results showed that overall Romanian students are not very familiar with computer and mobile assisted language learning tools, their attitudes towards the use of digital tools for vocabulary learning are neutral and they mostly associate the use of personal devices with their personal space, suggesting that they may not want to embed learning in their everyday activities. The results enrich existing knowledge of vocabulary learning strategies in a Romanian cultural context and they also give us an insight into how high school students use computer and mobile assisted language tools in their vocabulary learning. Implications for theory and practice are also discussed. 428.0071
4	Curriculum design in higher education using a learning outcome-led model : its influence on how students perceive learning Allan, Joanna January 1997 (has links) This thesis examines the potential of a learning outcome-led model of curriculum design to influence how students perceive learning in education studies within a modular context of a new university. It identifies and compares the conceptions of learning held by students and lecturers on traditional and outcome-led modules, and it explores and specifies the design factors which shape these conceptions. The issue is located within the interpretivist paradigm for the research seeks understanding which derives from the perceptions, attitudes and beliefs that students and their lecturers hold about learning in a given context. But the methodology employed is not wholly consistent with this paradigm, for a qualitative approach is complemented by the use of factor analysis techniques to facilitate the identification of the design features which influence how students perceive learning. The approach is thus eclectic drawing on quantitative methods to examine what is essentially qualitative data. An innovative model of learning outcome-led design is proposed, implemented and modified as a result of the research. The learner is placed at the centre of the learning experience which is defined as incorporating three domains: the teaching context; the assessment régime; and the directed learning undertaken by students outside of taught sessions. The model incorporates a trichotomy of outcomes which define the subject -specific, the transferable skills and the generic academic outcomes which influence directly both the content and process of learning, and which successful students are expected to achieve on completion of a module. The findings show that five design features influence how students perceive learning: the clarity of expectations; congruence between the content and process of each domain of the learning experience; direction in respect to the learning activities which should be undertaken in each domain to achieve the outcomes; and the content and process of the teaching context. The data suggest that a much higher profile should be given to metacognitive skills in curriculum development in HE because how students perceive both the process and the content of learning profoundly influences their conception of learning and, consistent with the underpinning theory, how they approach learning and therefore ultimately the kind of outcomes they achieve. The research leads to recommendations for the modification of the three models of learning in context; Ramsden (1988), Biggs (1990b) and Prosser (1995), which are presented and analysed in the thesis. The findings suggest that the learning experience should be redefined to specify the three domains - the teaching context, assessment régime and directed learning - and that clarity of expectations, metacognitive skills and congruence between the content and process of learning in each of the domains should be articulated as directly influencing students' conceptions of learning. The models should also seek to indicate that learning outcomes influence how students perceive learning, and that therefore they feature both at the starting point and as the end product of a contextualised learning process. The findings relating to students' conceptions of learning show that the study of outcome-led modules has resulted in a much greater degree of congruence between how lecturers and students perceive learning in a given module and that fewer students studying outcome-led modules hold a quantitative conception of learning. This suggests that the outcome-led model does have the potential to improve teaching and learning and consequently that there is an educational rationale for curriculum development premised on this model. 370
5	GENERATING SQL FROM NATURAL LANGUAGE IN FEW-SHOT AND ZERO-SHOT SCENARIOS Asplund, Liam January 2024 (has links) Making information stored in databases more accessible to users inexperienced in structured query language (SQL) by converting natural language to SQL queries has long been a prominent research area in both the database and natural language processing (NLP) communities. There have been numerous approaches proposed for this task, such as encoder-decoder frameworks, semantic grammars, and more recently with the use of large language models (LLMs). When training LLMs to successfully generate SQL queries from natural language questions there are three notable methods used, pretraining, transfer learning and in-context learning (ICL). ICL is particularly advantageous in scenarios where the hardware at hand is limited, time is of concern and large amounts of task specific labled data is nonexistent. This study seeks to evaluate two strategies in ICL, namely zero-shot and few-shot scenarios using the Mistral-7B-Instruct LLM. Evaluation of the few-shot scenarios was conducted using two techniques, random selection and Jaccard Similarity. The zero-shot scenarios served as a baseline for the few-shot scenarios to overcome, which ended as anticipated, with the few-shot scenarios using Jaccard similarity outperforming the other two methods, followed by few-shot scenarios using random selection coming in at second best, and the zero-shot scenarios performing the worst. Evaluation results acquired based on execution accuracy and exact matching accuracy confirm that leveraging similarity in demonstrating examples when prompting the LLM will enhance the models knowledge about the database schema and table names which is used during the inference phase leadning to more accurately generated SQL queries than leveraging diversity in demonstrating examples. In-context learning Few-shot scenarios Zero-shot scenarios Large language model Prompt engineering Jaccard Similarity Computer Sciences Datavetenskap (datalogi)
6	Learning objects model and context for recognition and localisation / Apprentissage de modèles et contextes d'objets pour la reconnaissance et la localisation Manfredi, Guido 18 September 2015 (has links) Cette thèse traite des problèmes de modélisation, reconnaissance, localisation et utilisation du contexte pour la manipulation d'objets par un robot. Le processus de modélisation se divise en quatre composantes : le système réel, les données capteurs, les propriétés à reproduire et le modèle. En spécifiant chacune des ces composantes, il est possible de définir un processus de modélisation adapté au problème présent, la manipulation d'objets par un robot. Cette analyse mène à l'adoption des descripteurs de texture locaux pour la modélisation. La modélisation basée sur des descripteurs de texture locaux a été abordé dans de nombreux travaux traitant de structure par le mouvement (SfM) ou de cartographie et localisation simultanée (SLAM). Les méthodes existantes incluent Bundler, Roboearth et 123DCatch. Pourtant, aucune de ces méthodes n'a recueilli le consensus. En effet, l'implémentation d'une approche similaire montre que ces outils sont difficiles d'utilisation même pour des utilisateurs experts et qu'ils produisent des modèles d'une haute complexité. Cette complexité est utile pour fournir un modèle robuste aux variations de point de vue. Il existe deux façons pour un modèle d'être robuste : avec le paradigme des vues multiple ou celui des descripteurs forts. Dans le paradigme des vues multiples, le modèle est construit à partir d'un grand nombre de points de vue de l'objet. Le paradigme des descripteurs forts compte sur des descripteurs résistants aux changements de points de vue. Les expériences réalisées montrent que des descripteurs forts permettent d'utiliser un faible nombre de vues, ce qui résulte en un modèle simple. Ces modèles simples n'incluent pas tout les point de vus existants mais les angles morts peuvent être compensés par le fait que le robot est mobile et peut adopter plusieurs points de vue. En se basant sur des modèles simples, il est possible de définir des méthodes de modélisation basées sur des images seules, qui peuvent être récupérées depuis Internet. A titre d'illustration, à partir d'un nom de produit, il est possible de récupérer des manières totalement automatiques des images depuis des magasins en ligne et de modéliser puis localiser les objets désirés. Même avec une modélisation plus simple, dans des cas réel ou de nombreux objets doivent être pris en compte, il se pose des problèmes de stockage et traitement d'une telle masse de données. Cela se décompose en un problème de complexité, il faut traiter de nombreux modèles rapidement, et un problème d'ambiguïté, des modèles peuvent se ressembler. L'impact de ces deux problèmes peut être réduit en utilisant l'information contextuelle. Le contexte est toute information non issue des l'objet lui même et qui aide a la reconnaissance. Ici deux types de contexte sont abordés : le lieu et les objets environnants. Certains objets se trouvent dans certains endroits particuliers. En connaissant ces liens lieu/objet, il est possible de réduire la liste des objets candidats pouvant apparaître dans un lieu donné. Par ailleurs l'apprentissage du lien lieu/objet peut être fait automatiquement par un robot en modélisant puis explorant un environnement. L'information appris peut alors être fusionnée avec l'information visuelle courante pour améliorer la reconnaissance. Dans les cas des objets environnants, un objet peut souvent apparaître au cotés d'autres objets, par exemple une souris et un clavier. En connaissant la fréquence d'apparition d'un objet avec d'autres objets, il est possible de réduire la liste des candidats lors de la reconnaissance. L'utilisation d'un Réseau de Markov Logique est particulièrement adaptée à la fusion de ce type de données. Cette thèse montre la synergie de la robotique et du contexte pour la modélisation, reconnaissance et localisation d'objets. / This Thesis addresses the modeling, recognition, localization and use of context for objects manipulation by a robot. We start by presenting the modeling process and its components: the real system, the sensors' data, the properties to reproduce and the model. We show how, by specifying each of them, one can define a modeling process adapted to the problem at hand, namely object manipulation by a robot. This analysis leads us to the adoption of local textured descriptors for object modeling. Modeling with local textured descriptors is not a new concept, it is the subject of many Structure from Motion (SfM) or Simultaneous Localization and Mapping (SLAM) works. Existing methods include bundler, roboearth modeler and 123DCatch. Still, no method has gained widespread adoption. By implementing a similar approach, we show that they are hard to use even for expert users and produce highly complex models. Such complex techniques are necessary to guaranty the robustness of the model to view point change. There are two ways to handle the problem: the multiple views paradigm and the robust features paradigm. The multiple views paradigm advocate in favor of using a large number of views of the object. The robust feature paradigm relies on robust features able to resist large view point changes. We present a set of experiments to provide an insight into the right balance between both. By varying the number of views and using different features we show that small and fast models can provide robustness to view point changes up to bounded blind spots which can be handled by robotic means. We propose four different methods to build simple models from images only, with as little a priori information as possible. The first one applies to planar or piecewise planar objects and relies on homographies for localization. The second approach is applicable to objects with simple geometry, such as cylinders or spheres, but requires many measures on the object. The third method requires the use of a calibrated 3D sensor but no additional information. The fourth technique doesn't need a priori information at all. We apply this last method to autonomous grocery objects modeling. From images automatically retrieved from a grocery store website, we build a model which allows recognition and localization for tracking. Even using light models, real situations ask for numerous object models to be stored and processed. This poses the problems of complexity, processing multiple models quickly, and ambiguity, distinguishing similar objects. We propose to solve both problems by using contextual information. Contextual information is any information helping the recognition which is not directly provided by sensors. We focus on two contextual cues: the place and the surrounding objects. Some objects are mainly found in some particular places. By knowing the current place, one can restrict the number of possible identities for a given object. We propose a method to autonomously explore a previously labeled environment and establish a correspondence between objects and places. Then this information can be used in a cascade combining simple visual descriptors and context. This experiment shows that, for some objects, recognition can be achieved with as few as two simple features and the location as context. The objects surrounding a given object can also be used as context. Objects like a keyboard, a mouse and a monitor are often close together. We use qualitative spatial descriptors to describe the position of objects with respect to their neighbors. Using a Markov Logic Network, we learn patterns in objects disposition. This information can then be used to recognize an object when surrounding objects are already identified. This Thesis stresses the good match between robotics, context and objects recognition. Modélisation Reconnaissance Localisation Contexte Cooccurrence d'objets Réseaux logiques de Markov Structure par le mouvement SLAM Structure from motion Object modeling Object recognition Object localization Context learning Robotics
7	Découverte de contexte pour une adaptation automatique de services en intelligence ambiante / Context discovery for the automatic adaptation of services in ambient intelligence Benazzouz, Yazid 26 August 2011 (has links) Cette thèse s’intéresse à la problématique de l’adaptation automatique de services dans ledomaine de l’intelligence ambiante. L’étude de la littérature montre que la sensibilité aucontexte est devenue un élément central pour la conception et la mise en place de servicesadaptatifs. Cependant, sa prise en compte se limite généralement à des descriptionsélémentaires de situations ou à des modèles prédéfinis. Afin de permettre une adaptation auxchangements d’habitudes des utilisateurs, à la dynamique de l’environnement et àl’hétérogénéité des sources de perception, nous proposons des mécanismes de découverte decontexte et de situations déclencheurs d’adaptation. Ces mécanismes s’appuient sur destechniques de fouille de données et sont intégrés au sein d’une architecture d’adaptationautomatique de services. Ces travaux ont été réalisés et appliqués à des projets d’intelligenceambiante pour de l’assistance à des personnes et plus particulièrement dans le cadre du projetITEA- MIDAS. / This thesis addresses the problem of dynamic adaptation of services in the context of ambientintelligence applications. Literature study shows how context-awareness plays a central rolein the design and implementation of adaptive services. However, its use is still limited toelementary descriptions and predefined situational models. Dynamic adaptation should becapable of following user habits to yield dynamic answers to environmental change, and tosupport heterogeneous sources of context. To this end, we propose mechanisms to discovercontexts and situations that trigger adaptation. These mechanisms rely on data miningtechniques, and are integrated within an architecture for dynamic adaptation of services. Thiswork was carried out and applied to ambient intelligence projects for the elderly, providingsupport and assistance in their daily lives, particularly in the context of the ITEA-MIDASproject. Intelligence ambiante Sensibilité au contexte Service adaptatif Adaptation dynamique de services Apprentissage du contexte Maintien à domicile Environnement intelligent Ambient intelligence, Context-awareness Adaptive services Dynamic adaptation of services Context learning Assisted living Intelligent environment
8	ONLINE STATISTICAL INFERENCE FOR LOW-RANK REINFORCEMENT LEARNING Qiyu Han (18284758) 01 April 2024 (has links) <p dir="ltr">We propose a fully online procedure to conduct statistical inference with adaptively collected data. The low-rank structure of the model parameter and the adaptivity nature of the data collection process make this task challenging: standard low-rank estimators are biased and cannot be obtained in a sequential manner while existing inference approaches in sequential decision-making algorithms fail to account for the low-rankness and are also biased. To tackle the challenges previously outlined, we first develop an online low-rank estimation process employing Stochastic Gradient Descent with noisy observations. Subsequently, to facilitate statistical inference using the online low-rank estimator, we introduced a novel online debiasing technique designed to address both sources of bias simultaneously. This method yields an unbiased estimator suitable for parameter inference. Finally, we developed an inferential framework capable of establishing an online estimator for performing inference on the optimal policy value. In theory, we establish the asymptotic normality of the proposed online debiased estimators and prove the validity of the constructed confidence intervals for both inference tasks. Our inference results are built upon a newly developed low-rank stochastic gradient descent estimator and its non-asymptotic convergence result, which is also of independent interest.</p> Context learning Reinforcement learning Statistical data science Online Decision-Making Low-rank matrix estimation Stochastic Gradient Descent (SGD) Reinforcement Learning Online inference
9	Towards Novelty-Resilient AI: Learning in the Open World Trevor A Bonjour (18423153) 22 April 2024 (has links) <p dir="ltr">Current artificial intelligence (AI) systems are proficient at tasks in a closed-world setting where the rules are often rigid. However, in real-world applications, the environment is usually open and dynamic. In this work, we investigate the effects of such dynamic environments on AI systems and develop ways to mitigate those effects. Central to our exploration is the concept of \textit{novelties}. Novelties encompass structural changes, unanticipated events, and environmental shifts that can confound traditional AI systems. We categorize novelties based on their representation, anticipation, and impact on agents, laying the groundwork for systematic detection and adaptation strategies. We explore novelties in the context of stochastic games. Decision-making in stochastic games exercises many aspects of the same reasoning capabilities needed by AI agents acting in the real world. A multi-agent stochastic game allows for infinitely many ways to introduce novelty. We propose an extension of the deep reinforcement learning (DRL) paradigm to develop agents that can detect and adapt to novelties in these environments. To address the sample efficiency challenge in DRL, we introduce a hybrid approach that combines fixed-policy methods with traditional DRL techniques, offering enhanced performance in complex decision-making tasks. We present a novel method for detecting anticipated novelties in multi-agent games, leveraging information theory to discern patterns indicative of collusion among players. Finally, we introduce DABLER, a pioneering deep reinforcement learning architecture that dynamically adapts to changing environmental conditions through broad learning approaches and environment recognition. Our findings underscore the importance of developing AI systems equipped to navigate the uncertainties of the open world, offering promising pathways for advancing AI research and application in real-world settings.</p> Autonomous agents and multiagent systems Knowledge representation and reasoning Modelling and simulation Planning and decision making Context learning Deep learning Neural networks Reinforcement learning Novelty Context MDP CMDP Reinforcement Learning Deep Reinforcement Learning Collusion Novelty Detection Detection Adaptation
10	Multimodal Data Management in Open-world Environment K M A Solaiman (16678431) 02 August 2023 (has links) <p>The availability of abundant multimodal data, including textual, visual, and sensor-based information, holds the potential to improve decision-making in diverse domains. Extracting data-driven decision-making information from heterogeneous and changing datasets in real-world data-centric applications requires achieving complementary functionalities of multimodal data integration, knowledge extraction and mining, situationally-aware data recommendation to different users, and uncertainty management in the open-world setting. To achieve a system that encompasses all of these functionalities, several challenges need to be effectively addressed: (1) How to represent and analyze heterogeneous source contents and application context for multimodal data recommendation? (2) How to predict and fulfill current and future needs as new information streams in without user intervention? (3) How to integrate disconnected data sources and learn relevant information to specific mission needs? (4) How to scale from processing petabytes of data to exabytes? (5) How to deal with uncertainties in open-world that stem from changes in data sources and user requirements?</p> <p><br></p> <p>This dissertation tackles these challenges by proposing novel frameworks, learning-based data integration and retrieval models, and algorithms to empower decision-makers to extract valuable insights from diverse multimodal data sources. The contributions of this dissertation can be summarized as follows: (1) We developed SKOD, a novel multimodal knowledge querying framework that overcomes the data representation, scalability, and data completeness issues while utilizing streaming brokers and RDBMS capabilities with entity-centric semantic features as an effective representation of content and context. Additionally, as part of the framework, a novel text attribute recognition model called HART was developed, which leveraged language models and syntactic properties of large unstructured texts. (2) In the SKOD framework, we incrementally proposed three different approaches for data integration of the disconnected sources from their semantic features to build a common knowledge base with the user information need: (i) EARS: A mediator approach using schema mapping of the semantic features and SQL joins was proposed to address scalability challenges in data integration; (ii) FemmIR: A data integration approach for more susceptible and flexible applications, that utilizes neural network-based graph matching techniques to learn coordinated graph representations of the data. It introduces a novel graph creation approach from the features and a novel similarity metric among data sources; (iii) WeSJem: This approach allows zero-shot similarity matching and data discovery by using contrastive learning<br> to embed data samples and query examples in a high-dimensional space using features as a novel source of supervision instead of relevance labels. (3) Finally, to manage uncertainties in multimodal data management for open-world environments, we characterized novelties in multimodal information retrieval based on data drift. Moreover, we proposed a novelty detection and adaptation technique as an augmentation to WeSJem.<br> </p> <p>The effectiveness of the proposed frameworks, models, and algorithms was demonstrated<br> through real-world system prototypes that solved open problems requiring large-scale human<br> endeavors and computational resources. Specifically, these prototypes assisted law enforcement officers in automating investigations and finding missing persons.<br> </p> Knowledge representation and reasoning Natural language processing Data mining and knowledge discovery Information extraction and fusion Recommender systems Collaborative and social computing Knowledge and information management Context learning Semi- and unsupervised learning Multimodal Information Retrieval Data Integration Text Attribute Extraction Missing Persons Situational Knowledge Extraction Representation Learning

Search results