Global ETD Search

1	RINA - ein agentenbasiertes Framework für Retrieval Intelligence & Natural Assistance Nuta, F., Kuhn, F., Kröhn, M. 19 February 2025 (has links) Im Beitrag wird ein agentenbasiertes LLM-Framework vorgestellt, dass in Zusammenarbeit von SPIE Automation und der Technischen Hochschule Aschaffenburg im Rahmen von Forschungsprojekten entwickelt wird. Das Framework ist modular aufgebaut und bietet durch diese Abgrenzbarkeit organisatorisch und didaktisch gute Möglichkeiten für Studierende, mit ihren Abschlussarbeiten beizutragen. Sie können fokussiert an einer Funktionalität arbeiten, und diese abschließend im Kontext des kompletten Frameworks bereitstellen. Das Framework erprobt den Einsatz von generativer KI im industriellen Umfeld, einem Schlüsselthema für den Wirtschaftsstandort Deutschland. Es wurden mehrere konkrete Beispiele implementiert und mit Nutzenden unterschiedlicher Nutzergruppen getestet. Exemplarisch wird nachfolgend ein Workflow Assistent und ein RAG-Chatbot gezeigt.
2	Key Concepts, Potentials and Obstacles for the Implementation of Large Language Models in Product Development Kretzschmar, Maximilian, Dammann, Maximilian Peter, Schwoch, Sebastian, Berger, Elias, Saske, Bernhard, Paetzold-Byhain, Kristin 09 October 2024 (has links) In the realm of Artificial Intelligence, Large Language Models (LLMs) have recently emerged as a new technology, rapidly gaining prominence across various domains due to their impressive capabilities. This paper investigates key concepts, potentials and obstacles associated with integrating LLMs into the product development process. The initial focus lies on clarifying the underlying mechanisms and capabilities of LLMs to provide a clear and practical understanding. Building upon this foundation, the exploration shifts to the potential applications of LLMs in product development. An assessment matrix evaluates the capabilities of LLMs with regards to engineering challenges, highlighting how these models could potentially improve key aspects of the development process. Additionally, the obstacles associated with implementation in a product development context are addressed. info:eu-repo/classification/ddc/620 ddc:620
3	Clustering of Distributed Word Representations and its Applicability for Enterprise Search Korger, Christina 04 October 2016 (has links) (PDF) Machine learning of distributed word representations with neural embeddings is a state-of-the-art approach to modelling semantic relationships hidden in natural language. The thesis “Clustering of Distributed Word Representations and its Applicability for Enterprise Search” covers different aspects of how such a model can be applied to knowledge management in enterprises. A review of distributed word representations and related language modelling techniques, combined with an overview of applicable clustering algorithms, constitutes the basis for practical studies. The latter have two goals: firstly, they examine the quality of German embedding models trained with gensim and a selected choice of parameter configurations. Secondly, clusterings conducted on the resulting word representations are evaluated against the objective of retrieving immediate semantic relations for a given term. The application of the final results to company-wide knowledge management is subsequently outlined by the example of the platform intergator and conceptual extensions." maschinelles Lernen word2vec Datamining Unternehmenssuche Clustering verteilte Wortrepräsentationen Sprachmodelle machine learning word2vec datamining enterprise search distributed word representations neural word embeddings language models ddc:004 rvk:ST 237 rvk:ST 265 rvk:ST 270
4	Clustering of Distributed Word Representations and its Applicability for Enterprise Search Korger, Christina 18 August 2016 (has links) Machine learning of distributed word representations with neural embeddings is a state-of-the-art approach to modelling semantic relationships hidden in natural language. The thesis “Clustering of Distributed Word Representations and its Applicability for Enterprise Search” covers different aspects of how such a model can be applied to knowledge management in enterprises. A review of distributed word representations and related language modelling techniques, combined with an overview of applicable clustering algorithms, constitutes the basis for practical studies. The latter have two goals: firstly, they examine the quality of German embedding models trained with gensim and a selected choice of parameter configurations. Secondly, clusterings conducted on the resulting word representations are evaluated against the objective of retrieving immediate semantic relations for a given term. The application of the final results to company-wide knowledge management is subsequently outlined by the example of the platform intergator and conceptual extensions.":1 Introduction 1.1 Motivation 1.2 Thesis Structure 2 Related Work 3 Distributed Word Representations 3.1 History 3.2 Parallels to Biological Neurons 3.3 Feedforward and Recurrent Neural Networks 3.4 Learning Representations via Backpropagation and Stochastic Gradient Descent 3.5 Word2Vec 3.5.1 Neural Network Architectures and Update Frequency 3.5.2 Hierarchical Softmax 3.5.3 Negative Sampling 3.5.4 Parallelisation 3.5.5 Exploration of Linguistic Regularities 4 Clustering Techniques 4.1 Categorisation 4.2 The Curse of Dimensionality 5 Training and Evaluation of Neural Embedding Models 5.1 Technical Setup 5.2 Model Training 5.2.1 Corpus 5.2.2 Data Segmentation and Ordering 5.2.3 Stopword Removal 5.2.4 Morphological Reduction 5.2.5 Extraction of Multi-Word Concepts 5.2.6 Parameter Selection 5.3 Evaluation Datasets 5.3.1 Measurement Quality Concerns 5.3.2 Semantic Similarities 5.3.3 Regularities Expressed by Analogies 5.3.4 Construction of a Representative Test Set for Evaluation of Paradigmatic Relations 5.3.5 Metrics 5.4 Discussion 6 Evaluation of Semantic Clustering on Word Embeddings 6.1 Qualitative Evaluation 6.2 Discussion 6.3 Summary 7 Conceptual Integration with an Enterprise Search Platform 7.1 The intergator Search Platform 7.2 Deployment Concepts of Distributed Word Representations 7.2.1 Improved Document Retrieval 7.2.2 Improved Query Suggestions 7.2.3 Additional Support in Explorative Search 8 Conclusion 8.1 Summary 8.2 Further Work Bibliography List of Figures List of Tables Appendix info:eu-repo/classification/ddc/004 ddc:004
5	Representation Learning for Biomedical Text Mining Sänger, Mario 10 January 2025 (has links) Die Untersuchung von Beziehungen zwischen biomedizinischen Entitäten bildet einen Eckpfeiler der modernen Medizin. Angesichts der rasanten Zunahme der Forschungsliteratur wird es jedoch zunehmend schwieriger, durch bloßes Lesen umfassende Informationen über bestimmte Entitäten und deren Beziehungen zu gewinnen. Text-Mining Ansätze versuchen, die Verarbeitung dieser riesigen Datenmengen mit Hilfe von Maschinellen Lernen zu erleichtern. Wir tragen zu dieser Forschung bei indem wir Methoden zum Erlernen von Entitäts- und Textrepräsentationen auf Basis großer Publikations- und Wissensdatenbanken entwickeln. Als erstes schlagen wir zwei neuartige Ansätze zur Relationsextraktion vor, die Techniken des Representation Learnings nutzen, um umfassende Modelle biomedizinischer Entitäten und Entitätspaaren zu lernen. Diese Modelle lernen Vektorrepräsentationen, indem sie alle PubMed-Artikel berücksichtigen, die eine bestimmte Entität oder ein Entitätspaar erwähnen. Wir verwenden diese Vektoren als Eingabe für ein neuronales Netzwerk, um Relationen global zu klassifizieren, d. h. die Vorhersagen basieren auf dem gesamten Korpus und nicht auf einzelnen Sätzen oder Artikeln wie in konventionellen Ansätzen. In unserem zweiten Beitrag untersuchen wir die Auswirkungen multimodaler Entitätsinformationen auf die Vorhersage von Relationen mithilfe von Knowledge Graph Embedding Methoden. In unserer Studie erweitern wir bestehende Modelle, indem wir Wissensgraphen mit multimodalen Informationen anreichern. Ferner schlagen wir ein allgemeines Framework für die Integration dieser Informationen in den Lernprozess für Entitätsrepräsentationen vor. In unserem dritten Beitrag erweitern wir Sprachmodelle mit zusätzlichen Entitätsinformationen für die Identifikation von Relationen in Texten. Wir führen eine umfangreiche Evaluation durch, welche die Leistung solcher Modelle in mehreren Szenarien erfasst und damit eine umfassende, jedoch bisher fehlende, Bewertung solcher Modelle liefert. / With the rapid growth of biomedical literature, obtaining comprehensive information regarding particular biomedical entities and relations by only reading is becoming increasingly difficult. Text mining approaches seek to facilitate processing these vast amounts of text using machine learning. This renders effective and efficient encoding of all relevant information regarding specific entities as one central challenge in these approaches. In this thesis, we contribute to this research by developing machine learning methods for learning entity and text representations based on large-scale publication repositories and diverse information from in-domain knowledge bases. First, we propose two novel relation extraction approaches that use representation learning techniques to create comprehensive models of entities or entity pairs. These models learn low-dimensional embeddings by considering all publications from PubMed mentioning a specific entity or pair of entities. We use these embeddings as input for a neural network to classify relations globally, i.e., predictions are based on the entire corpus, not on single sentences or articles as in prior art. In our second contribution, we investigate the impact of multi-modal entity information for biomedical link prediction using knowledge graph embedding methods (KGEM). Our study enhances existing KGEMs by augmenting biomedical knowledge graphs with multi-modal entity information from in-domain databases. We propose a general framework for integrating this information into the KGEM entity representation learning process. In our third contribution, we augment pre-trained language models (PLM) with additional context information to identify interactions described in scientific texts. We perform an extensive benchmark that assesses the performance of such models across a wide range of biomedical relation scenarios, providing a comprehensive, but so far missing, evaluation of knowledge-augmented PLM-based extraction models. Biomedizinisches Text Mining Relationsextraction Vortrainierte Sprachmodelle Benchmark Multimodale Entitätsinformationen Biomedical Natural Language Processing Biomedical Text Mining Relation Extraction Representation Learning Pre-trained Language Models Multi-modal Entity Information Knowledge Augmentation Machine Learning Benchmark Multi-modal Knowledge Graphs 004 Informatik 570 Biologie ddc:004 ddc:570

1

Page generated in 0.0395 seconds