11 |
Programming by Voice: Efficiency in using ChatGPT and Speech OptimizationEliasson, Albin, Kroik Herkules, Martin January 2024 (has links)
Programming by voice can be a viable option for programmers who suffer from physical disabilities such as Repetitive Stress Injury (RSI), but study shows there are still challenges and limitations in the area, from the requirement of syntax-specific language to inaccuracy of speech recognition. We aim to address the potential benefit and performance of gradually integrating ChatGPT with its versions 3.5/4.0 and how to best communicate (speech optimization) to see if the area of programming by voice can be improved in terms of reduced vocal and cognitive load. We present a tool named ChatGPT VCG which contains two approaches, a traditional keyword interpretation approach using the Serenade tool as a code generator and a direct code generation approach using ChatGPT to generate code directly. Using test tasks written in Java, and the Serenade tool as a base measurement, the two approaches are compared and measurements such as the number of words and code characters generated are collected and analyzed. The result indicates that integrating ChatGPT allows the user to circumvent the required syntax-specific language. The direct code generation using ChatGPT 4.0 performed the best with a 4.2 times reduction in the required number of words compared to standard Serenade, while the keyword-based approach at worst shows a 14% increase. Speech optimization shows performance can be further increased by reducing or removing superfluous grammar and instead only providing the relevant information in commands. The study concludes that integration of ChatGPT can improve performance, and the speech can be optimized to reduce the number of words, but often at the cost of a conversational speech pattern.
|
12 |
Exploring the impact of varying prompts on the accuracy of database querying with an LLMLövlund, Pontus January 2024 (has links)
Large Language Models (LLM) and their abilities of text-to-SQL are today a very relevant topic, as utilizing an LLM as a database interface would facilitate easy access to the data in the database without any prior knowledge of SQL. What is being studied in this thesis, is how to best structure a prompt to increase the accuracy of an LLM on a text-to-SQL task. The methods of experimentation used in the study were experimentation with 5 different prompts, and a total of 22 questions asked about the database with the questions being of difficulties varying from easy to extra hard. The results showed that a simpler, less descriptive prompt performed better on the easy and medium questions, while a more descriptive prompt performed better on the hard and extra hard questions. The f indings did not fully align with the hypothesis that more descriptive prompts would have the most correct outputs. In conclusion, it seemed that prompts that contained less ”clutter” and were more straightforward were more effective on easy questions, while on harder questions a prompt with a better description and examples had a better impact.
|
13 |
Improving Article Summarizationby Fine-tuning GPT-3.5Gillgren, Fredrik January 2024 (has links)
This thesis project aims to improve text summarization in the financial application by fine-tuningGenerative Pre-trained Transformer 3.5 (GPT-3.5) . Through meticulous training andoptimization, the model was adeptly configured to accurately and efficiently condense complexfinancial reports into concise, informative summaries, specifically designed to support decision-making in professional business environments. Notable improvements were demonstrated inthe model's capacity to retain essential financial details while enhancing the readability andcontextual relevance of the text, as evidenced by superior ROUGE and BLEU scores whencompared to the baseline GPT-3.5 Turbo model. This fine-tuning approach not only underscoresGPT-3.5’s remarkable adaptability to domain-specific challenges but also marks a significantadvancement in the field of automated text summarization within the financial sector. Thefindings from this research highlight the transformative potential of bespoke NLP solutions,offering data-driven industries the tools to rapidly generate precise and actionable businessinsights, thus facilitating more informed decision-making processes
|
14 |
Secure Interactions with Large Language Models in Financial Services : A Study on Implementing Safeguards for Large Language ModelsFredrikson, Gustav January 2024 (has links)
This thesis examines the use of Large Language Models (LLMs) in the financial sector, highlighting the risks and necessary safety measures for their application in financial services. As these models become more common in various financial tools, they bring both new opportunities and significant challenges, such as potential errors in financial advice and privacy issues. This work introduces a detailed safeguarded framework designed to improve the reliability, security, and ethical use of LLMs in financial applications. The framework includes specific safety features like checking user inputs, detecting incorrect information, and preventing security breaches to tackle these challenges effectively. Using quantitative testing benchmarks and case studies with a financial chatbot, this thesis shows that this framework helps reduce operational risks and increases trust among users. The results show that while LLMs already have some built-in safety features, adding tailored security measures greatly strengthens these systems against complex threats. This study advances the discussion on AI safety in financial settings and provides a practical guide for implementing strong safety measures that ensure reliable and ethical financial services.
|
15 |
Root Cause Prediction from Log Data using Large Language ModelsMandakath Gopinath, Aswath January 2024 (has links)
In manufacturing, uptime and system reliability are paramount, placing high demands on automation technologies such as robotic systems. Failures in these systems cause considerable disruptions and incur significant costs. Traditional troubleshooting methods require extensive manual analysis by experts of log files, system data, application information, and problem descriptions. This process is labor-intensive and time-consuming, often resulting in prolonged downtimes and increased customer dissatisfaction, leading to heavy financial losses for companies. This research explores the application of Large Language Models (LLMs) like MistralLite and Mixtral-8*7B to automate root cause prediction from log data. We employed various fine-tuning methods, including full fine-tuning, Low-Rank Adaptation (LoRA), and Quantized Low Rank Adaptation (QLoRA), on these decoder-only models. Beyond using perplexity as an evaluation metric, the study incorporates GPT-4 as-a-judge to assess model performance. Additionally, the research uses complex prompting techniques to aid in the extraction of root causes from problem descriptions using GPT-4 and utilizes vector embeddings to analyze the importance of features in root cause prediction. The findings demonstrate that LLMs, when fine-tuned, can assist in identifying root causes from log data, with the smaller MistralLite model showing superior performance compared to the larger Mixtral model, challenging the notion that larger models are inherently better. The results also indicate that different training adaptations yield varied effectiveness, with QLoRA adaptation performing best for MistralLite and full fine-tuning proving most effective for Mixtral. This suggests that a tailored approach to model adaptation is necessary for optimal performance. Additionally, employing GPT-4 with Chain of Thought (CoT) prompting has demonstrated the capability to extract reasonable root causes from solved issues using this technique. The analysis of feature vector embeddings provides insights into the significant features, enhancing our understanding of the underlying patterns and relationships in the data.
|
16 |
Utveckling av en anonymiseringsprototyp för säker interaktion med chatbotarHanna, John Nabil, Berjlund, William January 2024 (has links)
I denna studie presenteras en prototyp för anonymisering av känslig information itextdokument, med syfte att möjliggöra säker interaktion med stora språkmodeller(LLM:er), såsom ChatGPT. Prototypen erbjuder en plattform där användare kanladda upp dokument för att anonymisera specifika känsliga ord. Efter anonymiseringkan användare ställa frågor till ChatGPT baserat på det anonymiserade innehållet.Prototypen återställer de anonymiserade delarna i svaren från ChatGPT innan de visas för användaren, vilket säkerställer att känslig information förblir skyddad underhela interaktionen.I studien används metoden Design Science Research in Information Systems (DSRIS). Prototypen utvecklas i Java och testas med påhittade dokument, medan enkätsvar samlasin för att utvärdera användarupplevelsen.Resultaten visar att prototypens funktioner fungerar väl och skyddar känslig information vid interaktionen med ChatGPT. Prototypen har utvärderats med hjälp av svarfrån enkäten som dessutom tar upp förbättringsmöjligheter.Avslutningsvis visar studien att det är möjligt att anonymisera textdokument effektivt och samtidigt få korrekt och användbar feedback från ChatGPT. Trots vissa begränsningar i användargränssnittet på grund av tidsramen visar studien på potentialför säker datahantering med ChatGPT. / This study presents a prototype for anonymizing sensitive information in text documents, with the aim of enabling secure interactions with large language models(LLMs) such as ChatGPT. The prototype offers a platform where users can uploaddocuments to anonymize specific sensitive words. After anonymization, users canpose questions to ChatGPT based on the anonymized content. The prototype restores the anonymized parts in the responses from ChatGPT before they are displayed to the user, ensuring that sensitive information remains protected throughoutthe entire interaction.The study uses the Design Science Research in Information Systems (DSRIS)method. The prototype is developed in Java and tested with fabricated documents,while survey responses were collected to evaluate the user experience.The results show that the prototype's functionalities work well and protect sensitiveinformation during interaction with ChatGPT. The prototype has been evaluated using survey responses that also highlight opportunities for improvement.In conclusion, the study demonstrates that it is possible to effectively anonymizetext documents while obtaining accurate and useful feedback from ChatGPT. Despite some limitations in the user interface due to the timeframe, the study showspotential for secure data handling with ChatGPT.
|
17 |
Prompting for progression : How well can GenAI create a sense of progression in a set of multiple-choice questions? / Prompt för progression : Hur bra kan GenAI skapa progression i en uppsättning flervalsfrågor?Jönsson, August January 2024 (has links)
Programming education is on the rise, leading to an increase in learning resources needed for universities and online courses. Questions are crucial for promoting good learning, and providing students with ample practice opportunities. Learning a subject relies heavily on a structured progression of topics and complexity. Yet, creating numerous questions has been proven to be a time-consuming task. Recently the technology world has been introduced to Generative AI (GenAI) systems using Large Language Models (LLMs) capable of generating large amounts of text and performing other text-related tasks. How can GenAI be used to solve problems related to creating learning materials while ensuring good quality? This study aims to investigate how well GenAI can create a sense of progression in a set of programming questions based on different prompt strategies. The method involves three question-generation cases using Chat-GPT API. Then, a qualitative evaluation of questions complexity, order, and quality is conducted. The first case aims to be the most simple way of asking Chat-GPT to generate 10 MCQs about a specific topic. The second case introduces defined complexity levels and desires of logical order and progression in complexity. The final case is the more advanced prompt building upon the second case along with a skill map as inspiration to the LLM. The skill map is a structured outline that highlights key points when learning a topic. According to the results, providing more instructions along with a skill map had a better impact on the progression of questions generated compared to a simpler prompt. The first case prompt still resulted in questions with good order but lacking in increasing complexity. The results indicate that while GenAI is capable of creating questions with a good progression that could be used in a real teaching context, it still requires quality control of the content to find outliers. Further research should be done to investigate optimal prompts and what constitutes a good skill map. / Programmeringsutbildningar blir allt fler, vilket leder till en ökning av behovet för lärresurser för universtitet och onlinekurser. Frågor är avgörande för att främja bra lärande och ge eleverna övningsmöjligheter. Att lära sig ett ämne är starkt beroende av en strukturerad progression av ämnen och komplexitet. Men att skapa många frågor har visat sig vara en tidskrävande uppgift. Nyligen har teknikvärlden introducerats till Generativa AI (GenAI)-system som använder Stora språkmodeller (LLM) som kan generera stora mängder text och utföra andra textrelaterade uppgifter. Hur kan GenAI användas för att lösa problem relaterade till att skapa läromedel samtidigt som man säkerställer en god kvalitet? Denna studie syftar till att undersöka hur väl GenAI kan skapa en känsla av progression i en uppsättning programmeringsfrågor baserade på olika prompt strategier. Metoden använder tre olika sätt att generera frågor med hjälp av Chat-GPTs API. Därefter genomförs en kvalitativ utvärdering av frågornas komplexitet, ordning och kvalité. Det första sättet syftar till att vara det enklaste sättet att be Chat-GPT att generera 10 flervalsfrågor om ett specifikt ämne. Det andra fallet introducerar definierade komplexitetsnivåer och önskemål om logisk ordning och progression i komplexitet. Det sista fallet är den mer avancerade prompten som bygger på det andra fallet tillsammans med en färdighetskarta som inspiration. Färdighetskartan är en strukturerad disposition av ett ämne som lyfter fram nyckelpunkter när man lär sig ett ämne. Resultaten visade att tillhandahålla fler instruktioner tillsammans med en färdighetskarta hade en bättre inverkan på progressionen av de genererade frågorna jämfört med det första sättet. Den första prompten resulterade fortfarande i frågor med god ordning men som saknade stegrande komplexitet. Resultaten indikerar att även om GenAI kan skapa frågor med god progression som skulle kunna användas i ett verkligt undervisningssammanhang, så krävs fortfarande en kvalitetskontroll av innehållet för att hitta felaktigheter. Ytterligare forskning bör göras för att undersöka optimala prompt och hur en bra färdighetskarta bör se ut.
|
18 |
A Mixed-Methods Study on the Realism of GPT-Generated Texts on Anxiety : From “My Misguided Drunken Bravado” to “Racked with Embarrassment and a Gnawing Sense of Doom”Schalker, Merel, Onn, Andrea January 2024 (has links)
The popularity of Large Language Models (LLMs), such as ChatGPT-4 (GPT), is increasing rapidly. Simultaneously, anxiety is contributing to the global burden of disease. To date, little research is performed on how LLMs portray anxiety. This study aimed to investigate the quality of anxiety portrayals generated by LLMs by “How well do large language models produce realistic anxiety depictions?”, using a mixed-methods approach. Realism was defined as perceived realism and alignment with scientific literature. The quantitative analysis involved 42 participants (age M = 30) randomly assigned to a questionnaire representing one of four anxiety levels. Participants rated perceived level and perceived realism of anxiety descriptions in human-written and GPT-modified texts. Results revealed: GPT-generated anxiety descriptions were perceived significantly more realistic compared to human-written texts; regardless of anxiety level; participants were less accurate in rating anxiety levels of GPTgenerated stories when the levels differed largely from that of the human-written story. The qualitative narrative analysis provided deeper insights into how realistic GPT and a GPT based on the cognitive framework of anxiety (CGPT) depicted anxiety by assessing how well depictions aligned with scientific literature. GPT and CGPT effectively included general features of anxiety in depictions. CGPT focused more on cognitive thought patterns, but neither fully depicted distinctions between different levels of anxiety. Overall, findings suggest LLMs do well in producing realistic representations of anxiety, but fail to fully depict various levels of anxiety. The study contributes to understanding potential applications of LLMs in psychological contexts, including management training, therapeutic settings, and literature. Keywords: LLM, ChatGPT, anxiety, realism, cognitive framework / Språkmodeller (LLMs) som ChatGPT-4 (GPT) blir allt mer populära. Samtidigt påverkar ångest folkhälsan. Hittills har enbart ett fåtal studier utrett hur LLMs skildrar ångest. Syftet med denna studie var att undersöka kvaliteten på ångestskildringar genererade av LLMs med en blandad metodansats genom frågan “Hur väl producerar LLMs realistiska ångestskildringar?”. Realism definierades som upplevd realism och överensstämmelse med vetenskaplig litteratur. I den kvantitativa analysen tilldelades 42 deltagare (ålder M = 30) en enkät som representerade en av fyra ångestnivåer. Deltagarna bedömde upplevd ångestnivå och upplevd realism av ångestbeskrivningar i berättelser skrivna av människor och GPTgenererade berättelser. Resultaten visade att deltagarna uppfattade GPT-genererade ångestbeskrivningar som signifikant mer realistiska, oavsett ångestnivå, men var mindre korrekta i att bedöma ångestnivå i GPT-genererade berättelser när nivån skiljde sig mycket från den människoskrivna versionen. Den kvalitativa narrativa analysen gav djupare insikter i hur realistiskt GPT och en GPT baserad på det kognitiva perspektivet av ångest (CGPT) skildrade ångest genom att jämföra skildringarna med vetenskaplig litteratur. Skildringarna av GPT och CGPT överensstämde i stort sett med litteraturen. CGPT fokuserade mer på kognitiva tankemönster, men ingen modell lyckades fullt ut skildra skillnaderna mellan olika ångestnivåer. Sammantaget tyder resultaten på att LLMs är bra på att skapa realistiska representationer av ångest, men misslyckas med att fullt ut skildra olika ångestnivåer. Studien bidrar till kunskapen om potentiella tillämpningar av LLMs i psykologiska kontexter, såsom ledarskapsutbildning, terapi och inom litteratur. Nyckelord: LLM, ChatGPT, ångest, realism, kognitiv ångest
|
19 |
Automating Software Development Processes Through Multi-Agent Systems : A Study in LLM-based Software Engineering / Automatisering av Mjukvaruutvecklingsprocesser genom användning av Multi-Agent System : En studie inom LLM-baserad mjukvaruutvecklingPeltomaa Åström, Samuel, Winoy, Simon January 2024 (has links)
In the ever-evolving landscape of Software Development, the demand for more efficient, scalable, and automated processes is paramount. The advancement of Generative AI has unveiled new avenues for innovative approaches to address this demand. This thesis explores one such avenue through the use of Multi-Agent Systems combined with Large Language Models (LLMs) to automate tasks within the development lifecycle. The thesis presents a structure for designing and developing an LLM-based multi-agent application by encompassing agent design principles, strategies for facilitating multi-agent collaboration, and providing valuable insights into the selection of an appropriate agent framework. Furthermore, the thesis showcases the developed application in its problem-solving capabilities with quantitative benchmarking results. Additionally, the study demonstrates practical implementations through examples of real-world applications. This study demonstrates the potential of utilizing LLM-based multi-agent systems in enhancing software development efficiency, offering companies a promising and powerful tool for streamlining Software Engineering workflows. / I den ständigt föränderliga världen av mjukvaruutveckling är behovet av mer effektiva, skalbara, och automatiserade metoder av stor betydelse. Framstegen inom generativ AI har öppnat nya möjligheter för utveckling av metoder för detta ändamål. Denna studie undersöker en sådan möjlighet genom användning av multi-agent system i samband med stora språkmodeller (Large Language Models, LLM) för automatisering av uppgifter inom utvecklingslivscykeln. Studien presenterar en struktur för design och utveckling av en LLM-baserad multi-agent applikation genom att bearbeta agentdesign och strategier för att underlätta samarbete mellan flera agenter och ge värdefulla insikter i valet av ett lämpligt agent-ramverk. Vidare demonstrerar studien den utvecklade applikationens problemlösningsförmåga med kvantitativa benchmark-resultat. Utöver detta inkluderar studien även exempel på genererade applikationer för att presentera konkreta exempel på implementeringar. Denna studie visar potentialen av att använda LLM-baserade multi-agent system för att förbättra effektiviteten inom mjukvaruutveckling, och erbjuder företag ett lovande och kraftfullt verktyg för effektivisering av arbetsflöden inom mjukvaruteknik.
|
20 |
Разработка чат-бота для второй линии поддержки : магистерская диссертация / Development of a chatbot for the second technical support lineГулич, А. С., Gulich, A. S. January 2024 (has links)
Целью магистерской диссертации является разработка и программная реализация чат-бота для второй линии технической поддержки в компании Nordic IT на базе языковой модели GPT-3.5-turbo. В работе приведен подробный обзор современных средств разработки чат-ботов, проведен обзор проектирования, классификации, преимуществ и недостатков, а также практического применения чат-ботов. Проведен анализ примененных технологий разработки, обоснован выбор оптимальных методов для обучения языковых моделей. Приведено подробное описание структуры программных модулей, методов обучения, оценки качества ответов, интеграции бота с Microsoft Teams и произведен расчет экономической эффективности проекта. Практическая значимость данной работы обусловлена в возможности и целесообразности использования разработанного чат-бота для улучшения производительности и качества обслуживания в компании Nordic IT. / The aim of the master thesis is the development and program implementation of a chatbot for the second line of technical support in Nordic IT based on the GPT-3.5-turbo language model. The work provides a detailed overview of modern chatbot development tools, a review of the design, classification, advantages and disadvantages, and practical application of chatbots. The applied development technologies are analyzed, the choice of optimal methods for training language models is justified. A detailed description of the structure of program modules, training methods, evaluation of the quality of answers, integration of the bot with Microsoft Teams and calculation of the economic efficiency of the project are given. The practical significance of this work is determined by the possibility and expediency of using the developed chatbot to improve productivity and quality of service in the company Nordic IT.
|
Page generated in 0.0849 seconds