Global ETD Search

71	Automatisk yrkeskodning med rättstavning och textkategorisering / Automatic survey coding with spell checking and text categorization Westermark, Max January 2015 (has links) Statistiska Centralbyrån (SCB) samlar in data i form av enkätundersökningar. Dessa data måste sedan kodas med olika klassifikationer för att kunna användas i statistikproduktion. En sådan klassifikation är Socioekonomisk indelning (SEI). Denna rapport behandlar hur man automatiskt kan SEI-koda sådana enkätundersökningar så att varje enkät tilldelas en SEI-kod. SCB:s nuvarande algoritm kodar ca 50% av enkäterna och bygger på att det yrke som är angivet i enkäten finns med i ett lexikon. Om yrket inte finns med kan enkäten inte kodas. Målet med detta arbete är att hitta bättre algoritmer som klarar av att koda fler enkäter. Som alternativ presenteras två algoritmer, en som bygger på rättstavning och en som bygger på textkategorisering med maskininlärning. Rättstavningsalgoritmen försöker rättstava de inkommna yrkena och sedan jämföra mot det lexikon som SCB använder. Denna algoritm lyckas koda något fler enkäter än vad originalalgoritmen klarar av. Hur stor förbättringen blir beror på hur många felstavningar det finns bland enkätdatat. Maskininlärningsalgoritmen representerar enkäterna med en bag-of-words-modell som sedan tränar en Stödvektormaskin. Då problemet har multipla klasser används en En-mot-alla metod för att hantera detta. Vi drar slutsatsen att en kombination av de tre nämnda algoritmerna presterar bäst. Den största svårigheten är att kunna koda många enkäter men samtidigt bibehålla en hög precision. När algoritmerna kombineras kompletterar de också varandra. Både rättstavningsalgoritmen och maskininlärningsalgoritmen förbättrar därför kodningsprestandan jämfört med den ursprungliga algoritmen. rättstavning klassificering maskininlärning suport vector machines stödvektormaskiner wagner-fischer levenshtein Computer Sciences Datavetenskap (datalogi)
72	Minimizing the expected opportunity loss by optimizing the ordering of shipping methods in e-Commerce using Machine Learning / Minimera den potentiella förlusten genom att optimera ordningen av leveransmetoder inom e-handel med maskininlärning Ay, Jonatan, Azrak, Jamil January 2022 (has links) The shopping industry is rapidly changing as the technology is advancing. This is especially true for the online industry where consumers are nowadays able to to shop much of what the need over the internet. In order to make the shopping experience as smooth as possible, different companies develops their sites and checkouts to be as friction-less as possible. In this thesis, the shipping module of Klarnas checkout was analyzed and different models were created to get an understanding of how the likelihood of a customer finalizing a purchase (conversion rate) could be improved. The shipping module consists of a number of shipping methods along with shipping carriers. Currently, there is no logic to sort the different shipping method/carriers other than a static ordering for all customers. The order of the shipping methods and carriers are what were investigated in the thesis. Hence, the core problem is to understand how the opportunity loss could be minimized by a different ordering of the shipping methods, where the opportunity loss are derived by the reduction in conversion rate between the control group (current setup) and a new model. To achieve this, a dataset was prepared and features were engineered in such a way that the same training and test datasets could be used in all algorithms. The features were engineered using a point-in-time concept so that no target leakage would be present. The target that was used was a plain concatenation of shipping method plus the shipping carrier. Finally, three different methods tackling this multiclass classification problem were investigated, namely Logistic Regression, Extreme Gradient Boosting and Artificial Neural Network. The aim of these algorithms is to create a learner that has been trained on a given dataset and that is able to predict the combination of shipping method plus carrier given a certain set of features. By the end of the investigation, it was concluded that using a model to predict the most relevant shipping method (plus carrier) for the customer made a positive difference on the conversion rate and in turn, the increase in sales. The overall accuracy of the Logistic Regression was 65.09%, 71.61% for the Extreme Gradient Boosting and 70.88% for the Artificial Neural Network. Once the models were trained, they were used in a back-simulation (that would be a proxy for an A/B-test) on a validation set to see the effect on the conversion rate. Here, the results showed that the conversion rate was 84.85% for the Logistic Regression model, 84.95% for the Extreme Gradient Boosting and 85.02% for the Artificial Neural Network. The control group which was a random sample of the current logic had a conversion rate of 84.21%. Thus, implementing the Artificial Neural Network would increase Klarnas sales by about 6.5 SEK per session. / Detaljhandelsindustrin förändras i en snabb takt i samband med att teknologin utvecklas. Detta är speciellt fallet för näthandeln där konsumenter numer har möjligheten att handla i stort sett allt de behöver över internet. För att göra köpupplevelsen så smidig som möjlig utvecklar olika bolag deras hemsidor och online kassor så att de innehåller så lite friktion som möjligt. I denna avhandling utreddes Klarnas leveransmodul som är en den av Klarnas onlinekassa (Checkout). Här utvecklades flera modeller och analyserades för att få en förståelse för hur sannolikheten att kunden slutför ett köp (konverterinsgrad) kunde ökas. Leveransmodulen består av ett flertalet leveransmetoder tillsammans med en leverantör. I dagsläget finns det ingen logik för att sortera dessa metoder annat än en statisk sortering för alla kunder. Ordningen på leveransmetoderna och leverantörerna är alltså vad som utreddes. Kärnproblemet i denna avhandling är alltså att förstå hur den potentiella förlusten av att ha en suboptimal sortering, där den potentiella förlusten härleds av minskningen av konverteringsgraden mellan den nuvarande lösningen och en ny modell. För att uppnå detta förbereddes ett dataset och variabler skapades på sådant vis att både tränings och test datan kunde användas för samtliga algoritmer. Variablerna skapades med en Point-in-time koncept så att ingen ogiltig information skulle komma med. Målvariabeln, eller den beroende variabeln, var en enkel ihopslagning av leveransmetoden plus leverantörens namn. Sedan användes tre algoritmer för att tackla detta multiklass klassifikationsproblem, nämligen Logistisk Regression, Extreme Gradient Boosting samt ett Artificiellt Neuralt Nätverk. Målet med dessa algoritmer är att skapa en modell som tränats på ett givet dataset och som kan förutspå kombinationen av leveransmetod plus leverantör givet ett bestämt set av värden på variablerna. I slutet av utredningen drogs slutsatsen att en modell, som kunde förutspå den mest relevanta leveransmetoden (plus leverantör) för kunden, hade en positiv inverkan på konverteringsgraden och i sin tur ökningen i försäljning. Noggrannheten för den Logistiska Regressionen var 65.09%, för Extreme Gradient Boosting var den 71 69% och för det Artificiella Neurala Nätverket var den 70.88%. Efter att modellerna tränats användes de i en simulering (som skulle representera ett A/B-test) på ett valideringsset för att förstå effekten på konverteringsgraden. Här visade resultaten att konverteringsgraden var 84.55% för Logistiska Regressionen, 84.95% för Extreme Gradient Boosting samt 85.02% för det Artificiella Neurala Nätverket. Kontrollgruppen som bestod av slumpmässigt valda rader från den nuvarande logiken hade en konvertingsgrad på 84.21%. Detta innebar alltså att om det Artificiella Neurala Nätverket hade implementerats, så hade det ökat Klarnas försäljning med ca 6.5 SEK per session. statistics machine learning applied mathematics e-commerce statistik maskininlärning tillämpad matematisk statistik e-handel Computational Mathematics Beräkningsmatematik
73	Risk Evaluation in a ML-Approximated Portfolio Environment / Riskvärdering i maskininlärningsapproximerad portföljmiljö Franzén, Filip, Nord, Karl Axel January 2022 (has links) This thesis explores and evaluates the forecasting application of the machine learning method Gradient Boosting Decision Trees. This method is used to forecast the demand of the online grocery market with a 7-day time horizon. The thesis was conducted in collaboration with the online grocery company Mathem. The model is applied and evaluated on three different periods representing the spring, summer and fall. The main evaluation metric is the mean absolute percentage error (MAPE), and clear differences were found depending on the predictability of the period. Apart from the model and its application to demand forecasting, the related risk was investigated. This was done by studying the Value-at-Risk and Expected Shortfall associated with discrepancies between the forecasted and actual values over the three periods. The most important conclusion of the case study at Mathem is that overestimation in the forecast is more costly in terms of monetary value than underestimating. It is also found that this is highly dependent on the cost structure of the company's operation and could therefore vary between companies. Thus, the study has contributed to understanding the applications of machine learning models in forecasting processes as well as the risks related to over/underestimating the demand of the online grocery market. / I denna uppsats utforskas och utvärderas maskininlärningsmetoden Gradient Boosting Decision Trees och dess tillämpningsområde inom prognostisering. Metoden används för att prognostisera efterfrågan på onlinehandel av dagligvaror med en 7 dagars tidshorisont. Uppsatsen gjordes i samarbete med företaget Mathem som är aktiva inom denna sektor. Modellen appliceras och utvärderas på tre olika tidsperioder som representerar våren, sommaren och hösten. Modellen själv utvärderas med avseende på måttet mean absolute percentage error (MAPE) och tydliga skillnader mellan tidsperioderna observerades relaterat till variansen i datan. Förutom modellen och dess applikation på prognostisering av efterfrågan utforskades även de relaterade riskerna. Detta genomfördes genom att studera riskmåtten Value-at-Risk och Expected Shortfall baserade på differensen mellan de prognostiserade och de faktiska värdena under dessa tre tidsperioder. Den viktiga slutsatsen som kunde dras från uppsatsen i samband med Mathem var att det, i monetära termer, är dyrare att prognostisera för högt än för lågt. En ytterligare slutsats som kan dras är att den monetära risken är mycket beroende av kostnadsstrukturen hos företaget och skulle därför kunna variera mellan olika företag. Således har denna uppsats bidragit till förståelse för användandet av maskininlärning för prognostisering samt riskerna kopplade till över- och underskattning av efterfrågan i marknaden för dagligvaruhandel online. Financial risk management forecasting machine learning FMCG Finansiell riskhantering prognostisering maskininlärning konsumtionsvaror Other Mathematics Annan matematik
74	Industri 4.0 och digitaliseringens påverkan på den svenska tillverkningsindustrin / Industry 4.0 and the impact of digitalisation on the Swedish manufacturing industry Gojani, Ndriqim, Nyirasagamba, Anitha January 2021 (has links) Inledning: Begreppet industri 4.0 tillkom 2011 i Tyskland under Hannovermässan och kan betraktas som den fjärde upplagan av den industriella revolutionen. Det är ett begrepp för flera teknologier och en rad koncept inom automation, process och IT världen. Industri 4.0 berör ämnesområden som bland annat Digitaliseringen, Maskininlärning, Artificiell Intelligens, Big Data-analys och Internet Of Things. Denna studie har undersökt hur den svenska tillverkningsindustrin påverkats av industri 4.0, och främst digitaliseringen, samt vilka effekter det har haft på industrier både i nutid men även framtiden. Syfte: Syftet med studien är att utveckla kompetens inom ett ämne som är aktuellt i dagsläge inom industrier. Studien medför en ökad kännedom och bildar en förväntad bild på hur arbetslivet kan se ut efter utbildningen. Problemformulering: Hur jobbar svenska tillverkningsindustrier med implementeringen av industri 4.0 samt vilka effekter har det haft på företagen? Vad är framgångsfaktorer samt hinder vid implementeringen av industri 4.0 inom tillverkningsindustri? Vad är sambandet mellan hållbarutveckling och digitaliseringen? Metod: Studien har använt sig av en kvalitativ undersökningsmetod med intervjuer, fält- och litteraturstudier. Detta för att kunna öka kompetensen inom ämnet samt för att sedan kunna jämföra, analysera och ta välgrundade slutsatser. Att använda sig av triangulering forskningsmetod bidrog till studiens validitet. Resultat: Resultatet visade att det förekommer diverse fördelar om organisationer inom tillverkningsindustrin implementerar industri 4.0, vilket även tidigare forskning visat. Dessa fördelar är högre effektivitet och produktivitet i samband med kortare ledtider, bättre informationsdelning och mer spårbarhet i produktionskedjan. Dessutom främjar industri 4.0 och digitaliseringen företagens arbete mot en hållbar utveckling, då både miljön och ekonomin gynnas. Studien kunde även konstatera att företagen med hjälp av robotar och återanvändningav material hade förbättrad resursanvändning. / Introduction The concept of industry 4.0 was introduced 2011 in Germany during the HanoverFair and can be considered as the fourth edition of the industrial revolution. It is a concept for several technologies and a number of concepts in the automation, process and IT world. Industry 4.0 touches on subject areas such as Digitization, Machine Learning, Artificial Intelligence, Big Data Analysis and the Internet of Things. This study has examined how the Swedish manufacturing industry has been affected by industry 4.0, mainly by digitalisation, and what effects it has had on industries both in the present but also in the future. Purpose The purpose of the study is to develop knowledge regarding a subject that is relevant in the current situation of industries. The study entails an increased awareness and forms an expected picture of what working life can look like after the education. Research questions How do Swedish manufacturing industries work with the implementation of industry 4.0 and what effects has it had on the companies? What are the success factors and obstacles in the implementation of industry 4.0 in the manufacturing industry? What is the relation between sustainable development and digitalisation? Method The study has used a qualitative research method with interviews, field- and literature studies. This helped to increase competence in the subject and to then be able to compare, analyze and make substantial conclusions. Using the triangulation research method contributed to the validity of the study. Results The results showed that there are various advantages for organizations in the manufacturing industry to implement industry 4.0, as previous research also has shown. These benefits are higher efficiency and productivity in conjunction with shorter lead times, better informationi sharing and more traceability in the production chain. In addition, Industry 4.0 and digitalisation promote companies' work towards sustainable development, as both the environment and the economy benefit from this. The study also found that companies who works with robots and reuse of materials had improved their resource use. industri 4.0 digitalisering produktion automation artificiell intelligens maskininlärning. robotisering effektivisering hållbar utveckling Engineering and Technology Teknik och teknologier
75	Using AI for Evaluating and Classifying E-mails with Limited Data Sets Malm, Daniel January 2022 (has links) Denna rapport utvärderar olika metoder för att klassificera och kategorisera email. Mångamail anländer hos människors inkorg varje dag. När tiden går och antalet email ökar blir detsvårare att hitta specifika email. På HDAB arbetar de som konsulter och vill dela upp email iolika mappar beroende på vilket projekt det tillhör. Idag fungerar det genom ett ord-regelbaseratsystem som sorterar email I olika mappar med en precision på cirka 85%. HDAB villta reda på om det går att använda maskininlärning för det nuvarande systemet. Denna rapportpresenterar fyra maskininlärningsalgorimer, beslutsträd, random forest beslutsträd, k-nearestneighbor och naive bayes, som användas för att utvärdera om det är möjligt att kategoriseraemailen.Datan som används till rapporten kommer från HDABs mailserver och är redan kategoriseradtill rätt kaegori. / This report will evaluate methods for classifying e-mails into different categories. A lot ofemails are received in peoples inboxes every day. When the time passes and the amount ofemails increases the ability to find specific emails gets harder. At HDAB they are workingwith consulting and want to separate different emails from different project into separate folders.This is achieved today by using a word based rule system that sorts emails into differentfolders and has a precision about 85%. HDAB wants to know if it is possible to use machinelearning to automatically sort the emails into different folders instead of the current solution.This report presents four machine learning algorithms, decision tree, random forest decisiontree, k-nearest neighbor and naive bayes, which are being used for evaluation of the possibilityto categorize the emails.The data used for the report will be data gathered from HDAB’s mail server and are alreadypre-labeled into their respectively categories. Email foldering supervised learning machine learning Epost mappning övervakad inlärning maskininlärning Computer Sciences Datavetenskap (datalogi)
76	Ursäkta, vi har lite bråttom : Om automatisering för att effektivisera tillgängliggörandet av affärstryck / Would You Mind Hurrying Up Please : On Automatization as a Means of Improved Efficiency When Cataloging Commercial Ephemera Hellgren, Andreas January 2019 (has links) The demand on research libraries to digitize theircollections as a means of increasing the availabilityof said collections are increasing. However, a prerequisite for this is the cataloging of the collections – a task commonly associated with large demands on time and other resources. One way of handling this might be efforts in applying automatization as a part of the cataloging process. This thesis examines the possibilities of using automatization when catalog- ing commercial ephemera. For this, focus is directed towards the features of the material; the process of cataloging; and the demands on the catalogued mate- rial from its various users using a theory based on Monica J. Bates (2002) Cascade-model. By conducting a case study, consisting of observations based on contextual inquiry and interviews partly using photo elicitation, automatization of cataloging is found to be a possible way to improve availability, but not without its own complications and demands on re- sources. In conclusion, suggestions are made for considerations libraries should be aware of before automatization might be implemented at research libraries. contextual inquiry photo elicitation automatisering maskininlärning text mining katalogisering efemärt tryck affärstryck Information Studies Biblioteks- och informationsvetenskap
77	Automated fiction classification : an explorative study offiction classification using machine-learning techniques Falk, Olof January 2019 (has links) This thesis aims to explore the possibilities and components of employing automated text classification techniques to classify collections of narrative fiction by genre, and also, what linguistic features are prominent in distinguishing genres of fiction. The historical traditions and current practices and theories in the field of fiction classification are outlined, along with central concepts of classification and genre theory. Linguistic features are also introduced, and hypothesized to carry capabilities of distinguishing genres of fiction. The thesis also reviews the foundations and current state of automated text classification, and reasons on what constitutes topical and stylistic features in relation to fiction. Knowledge gaps are identified between automated text classification and traditional fiction classification, and also, concerning the potentially genre distinguishing qualities of topical and stylistic features. The main experiment, around which the thesis is centered, is divided into two parts. The first part employs and evaluates kNN and SVM classifiers on a collection of fiction documents across four genres of fiction. In the second part, some feature selection methods are employed for inspection of distinguishing features across the collection. Findings suggest a potential of using automated techniques to classify fiction, and also illustrates feature patterns that are argued to distinguish each of the four different genres of fiction. Some suggestions for further research are also proposed. skönlitteratur klassifikation genrer särdrag ämne stil maskininlärning Information Studies Biblioteks- och informationsvetenskap
78	Maskininlärning och fallklassificering med MEMS-accelerometer : En studie i fallklassificering med artificiella neurala nätverk / En studie i fallklassificering med artificiella neurala nätverk : Maskininlärning och fallklassificering med MEMS-accelerometer Theo, Sobczak January 2020 (has links) Denna rapport har sin utgångspunkt på skapandet av en maskininlärningsalgoritm för att kunna klassificera ett fysiskt fall av en person. En DC Kapacitiv MEMS-accelerometer (BMA250) kombinerat med en Tinyduino Processor (Atmega328P) används för datainsamling. Programmering av processorn och maskininlärningsalgoritmen skrivs i C++ och ANN (Artificiell Neuralt Nätverk) används för att klassificera det fysiska fallet. ANN kan approximera ett värde som tyder på ett falskt fall efter 10 000 träningssekvenser inom 5% av ett teoretiskt värde som tyder på ett resultat med 100% säkerhet och 0,0005% felmarginal. Ett teoretiskt värde som tyder på ett faktiskt fall kan klassificeras efter 5000 träningssekvenser inom 5% av det eftersökta värdet med 100% säkerhet och 0,0045% felmarginal. Shallow Neural network Ai fall classification Ai Maskininlärning fallklassificering Robotics Robotteknik och automation Computer Systems Datorsystem
79	Förutsättningar för artificiell intelligens i sjukvård - Med fokus på framtida arbetssätt i radiologisk praxis Persson, Emma, van Tran, Mylina January 2018 (has links) Syftet med studien har varit att undersöka hur artificiell intelligens (AI) påverkar sjukvården. Forskning inom bildbehandling har visat på stor framgång och idag körs det en AI pilot på Danderyds Sjukhus. Studien har fokuserat på hur olika yrkesroller som forskare, sjukvårdspersonal, politiker och systemleverantörer upplever att AI kan förändra arbetssätt inom radiologi. Undersökningens datainsamling sker genom semi-strukturerade intervjuer och sekundärdata för att kunna besvara hur användningen av AI kan påverka sjukvårdens organisation.Resultatet visar på att det finns en resursbrist i sjukvården där det finns problematik med att hinna utföra samtliga arbetsuppgifter och sjukvården ser att antalet patienter kommer öka. AI visar på att ha goda möjligheter till att effektivisera arbetsflödet och möjligheten att utföra bättre diagnoser via olika verktyg. Det finns även tekniska hinder som omfattar datatillgångar, datavalidering och implementering av nya system till existerande system. Tillslut finns det även legala faktorer som skapar utmaningar.Slutsatserna från studien har beskrivit hur arbetssätt kommer att förändras och hur denna process går till samt att det har identifierats en ny roll i sjukvården. Studien har framställt att det finns ett stort gap mellan den teknologiska utvecklingen och implementation i klinisk praxis. Studien föreslår att man borde fokusera på att minska detta gap genom att bland annat sätta upp regleringar för AI. / The purpose of the study was to examine the impact of artificial intelligence (AI) on healthcare. Research in imaging has shown great success and today there is an AI pilot at Danderyds Hospital. The study has focused on different professional roles such as researchers, healthcare professionals, politicians and system vendors and their perspective on how AI will change the work methods within radiology. The survey data collection is conducted through semi-structured interviews and secondary data to answer how the use of AI will affect the healthcare organization. The result shows that there is a shortage of resources in the healthcare sector, where there are problems with performing all duties and furthermore the healthcare is seeing that the number of patients are increasing. AI shows good opportunities to streamline workflow and the ability to perform better diagnoses through different tools. There are also technical barriers that include data assets, data validation and implementing new systems to existing systems. Finally, there are legal factors that create challenges. The conclusions from the study have described how the working methods will change and how this process will work and that a new role in healthcare has been identified. The study has suggested that there is a large gap between technical development and implementation in clinical practice. The study proposes focusing on reducing this gap by setting up regulations for AI. Artificiell Intelligens Sjukvård Bildbehandling Radiologi Maskininlärning Verksamhetsutveckling Radiologer Engineering and Technology Teknik och teknologier
80	Maskin eller läkare? Vendela, Talenti January 2019 (has links) I denna studie undersöks individers generella attityder till vårdapplikationer som använder maskininlärning. Datainsamlingen har skett genom både kvalitativa och kvantitativa metoder som kompletterar varandra. Metoderna innefattar en enkätundersökning och två fokusgrupper baserade på scenario-based design. Teorin är baserad på forskning inom digitaliseringen av vården, bland annat maskininlärning och mHealth, som ligger till grund och stödjer undersökningen. Även teori om attityder och förtroende till digitaliseringen av vården har underbyggt undersökningen.I slutsatsen framkommer det att det finns en korrelation mellan hög medvetenhet och positiv inställning när det kommer det användandet av vårdapplikationer med maskininlärning. Den generella attityden till att få en diagnos av maskininlärning är negativ då de flesta föredrar att få en diagnos förmedlad av en läkare. Studien indikerar på att detta kan bero på att patienterna söker empati från vården, vilket artificiell intelligens saknar. Förtroendet för en vårdapplikation grundar sig främst i ryktet om den men även i vilket företag eller organisation som ligger bakom. Studien indikerar på att individer är positivt inställda till att bidra med privat hälsodata till en vårdapplikation om det leder till förebyggande av sjukdom. Studien ger även en antydan på att det finns en rädsla kring var privata hälsodata hamnar när den har lämnats ut. / This study aims to research on individuals’ general attitudes towards healthcare applications that use machine learning. The data collection has taken place through both qualitative and quantitative methods as a complement to each other. The methods include a questionnaire survey, two focus groups based on scenario-based design. The theory is based on research in the digitalisation of healthcare, including machine learning and mHealth, which is based and supports the investigation. The theory of attitudes and confidence in the digitalisation of care also forms the basis for the study.The conclusion shows that there is a correlation between high awareness and positive attitude when it comes to the use of healthcare applications with machine learning. The general attitude towards a diagnosis from machine learning is negative since most people prefer to get a diagnosis mediated by a doctor. The study indicates that this may be because the patients seek empathy from the healthcare system, which artificial intelligence lacks of. Trust towards a healthcare application is based primarily on the reputation of it, but also in which company or organization that is behind it. The respondents in the survey are positive about contributing with their personal data to a healthcare application if it leads to a prevention of a disease. The study also gives an indication that there is a fear of what happens with private health data. Artificiell Intelligens Maskininlärning mHealth Self-tracking Selfcare Engineering and Technology Teknik och teknologier

Search results