Global ETD Search

161	Evaluating CNN Architectures on the CSAW-M Dataset / Evaluering av olika CNN Arkitekturer på CSAW-M Kristoffersson, Ludwig, Zetterman, Noa January 2022 (has links) CSAW-M is a dataset that contains about 10 000 x-ray images created from mammograms. Mammograms are used to identify patients with breast cancer through a screening process with the goal of catching cancer tumours early. Modern convolutional neural networks are very sophisticated and capable of identifying patterns nearly indistinguishable to humans. CSAW-M doesn’t contain images of active cancer tumours, rather, whether the patient will develop cancer or not. Classification tasks such as this are known to require large datasets for training, which is cumbersome to acquire in the biomedical domain. In this paper we investigate how classification performance of non-trivial classification tasks scale with the size of available annotated images. To research this, a wide range of data-sets are generated from CSAW-M, with varying sample size and cancer types. Three different convolutional neural networks were trained on all data-sets. The study showed that classification performance does increase with the size of the annotated dataset. All three networks generally improved their prediction on the supplied benchmarking dataset. However, the improvements were very small and the research question could not be conclusively answered. The primary reasons for this was the challenging nature of the classification task, and the size of the data-set. Further research is required to gain more understanding of how much data is needed to yield a usable model. / CSAW-M är ett dataset som innehåller ungefär 10 000 röntgenbilder skapade från ett stort antal mammografier. Mammografi används för att identifiera patienter med bröstcancer genom en screeningprocess med målet att fånga cancerfall tidigt. Moderna konvolutionella neurala nätverk är mycket sofistikerade och kan tränas till att identifiera mönster i bilder mycket bättre än människor. CSAW-M innehåller inga bilder av cancertumörer, utan istället data på huruvida patienten kommer att utveckla cancer eller inte. Klassificeringsuppgifter som denna är kända för att kräva stora datamängder för träning, vilket är svårt att införskaffa inom den biomedicinska domänen. I denna artikel undersöker vi hur klassificerings prestanda för svåra klassificeringsuppgifter skalar med storleken på tillgänglig annoterad data. För att undersöka detta, genererades ett antal nya dataset från CSAW-M, med varierande storleksurval och cancertyp. Tre olika konvolutionella neurala nätverk tränades på alla nya data-set. Studien visar att klassificeringsprestanda ökar med storleken på den annoterade datamängden. Alla tre nätverk förbättrade generellt sin klassificeringsprestanda desto större urval som gjordes från CSAW-M. Förbättringarna var dock små och den studerade frågan kunde inte besvaras fullständigt. De främsta anledningarna till detta var klassificeringsuppgiftens utmanande karaktär och storleken på det tillgängliga datat i CSAW-M. Ytterligare forskning krävs för att få mer förståelse för hur mycket data som behövs för att skapa en användbar modell. CNN Convolutional Neural Network Transfer Learning Breast Cancer Bachelor’s Thesis CNN Konvolutionella Neurala Nätverk Transfer Learning Bröstcancer Kandidat examensarbete Computer and Information Sciences Data- och informationsvetenskap
162	Polar Codes for Biometric Identification Systems / Polära Koder för Biometriska Identifieringssystem Bao, Yicheng January 2022 (has links) Biometrics are widely used in identification systems, such as face, fingerprint, iris, etc. Polar code is the only code that can be strictly proved to achieve channel capacity, and it has been proved to be optimal for channel and source coding. In this degree project, our goal is to apply polar codes algorithms to biometric identification systems, and to design a biometric identification system with high identification accuracy, low system complexity, and good privacy preservation. This degree project has carried out specific and in-depth research in four aspects, following results are achieved: First, idea of polar codes is learnt, for example channel combination, channel splitting, successive cancellation decoding. The successive cancellation and successive cancellation list algorithm are also applied to encoding, which further realizes polar codes for source coding. Second, using autoencoder to process biometrics. Autoencoder is introduced to compress fingerprints into binary sequences of length 1024, it has 5 encoding layers and 12 decoding layers, achieved reconstruction error is 0.03. The distribution is close to Gaussian distribution, and compressed codes are quantized into binary sequences. Properties of sequences are similar with random sequences in terms of entropy, correlation, variance. Third, the identification system under Wyner-Ziv problem is studied with fingerprints. In enrollment phase, encoding algorithms are designed to compress biometrics, and in identification phase, decoding algorithms are designed to estimate the original sequence based on decoded results and noisy sequence. Maximum mutual information method is used to identify users. Results show that with smaller number of users, longer code length, smaller noise, then recognition error rate is lower. Fourth, human faces are used in the generated secret key system. After fully considering the trade off to achieve optimal results, in enrollment phase both public data and secure data are generated, in identification phase user’s index and secret key are estimated. A hierarchical structure is further studied. First, CNN is used to classify the age of faces, and then the generated secret key system is used for identification after narrowing the range. The system complexity is reduced by 80% and the identification accuracy is not reduced. / Biometriska kännetecken används i stor utsträckning i identifieringssystem, kännetecken såsom ansikte, fingeravtryck, iris, etc. Polär kod är den enda koden som strikt bevisats uppnå kanalkapacitet och den har visat sig vara optimal för kanal- och källkodning. Målet med detta examensarbete är att tillämpa polära kodalgoritmer på biometriska identifieringssystem, och att designa ett biometriskt identifieringssystem med hög identifieringsnoggrannhet, låg systemkomplexitet och bra integritetsskydd. Under examensarbetet har det genomförts specifik och djupgående forskning i fyra aspekter, följande resultat har uppnåtts: För det första introduceras idén om polära koder, till exempel kanalkombination, kanaluppdelning, successiv annulleringsavkodning. Algoritmerna för successiv annullering och successiv annulleringslista tillämpas även på kodning,vilket ytterligare realiserar polära koders användning för källkodning. För det andra används autoencoder för att bearbeta biometriska uppgifter. Autoencoder introduceras för att komprimera fingeravtryck till binära sekvenser med längden 1024, den har 5 kodningslager och 12 avkodningslager, det uppnådda rekonstruktionsfelet är 0,03. Fördelningen liknar en normaldistribution och komprimerade koder kvantiseras till binära sekvenser. Egenskaperna för sekvenserna liknar slumpmässiga sekvenser vad gäller entropi, korrelation, varians. För det tredje studeras identifieringssystemet under Wyner-Ziv-problemet med fingeravtryck. I inskrivningsfasen är kodningsalgoritmer utformade för att komprimera biometriska kännetecken, och i identifieringsfasen är avkodningsalgoritmer utformade för att estimera den ursprungliga sekvensen baserat på avkodade resultat och brusiga sekvenser. Maximal ömsesidig informationsmetod används för att identifiera användare. Resultaten visar att med ett mindre antal användare, längre kodlängd och mindre brus så är identifieringsfelfrekvensen lägre. För det fjärde används mänskliga ansikten i det genererade hemliga nyckelsystemet. Efter att noggrant ha övervägt kompromisser fullt ut för att uppnå det optimala resultatet genereras både offentlig data och säker data under registreringsfasen, i identifieringsfasen uppskattas användarens index och säkerhetsnyckel. En hierarkisk struktur studeras vidare. Först används CNN för att klassificera ålder baserat på ansikten och sedan används det genererade hemliga nyckelsystemet för identifiering efter att intervallet har begränsats. Systemkomplexiteten reduceras med 80% men identifieringsnoggrannheten reduceras inte. Biometrics Polar codes Identification systems Convolutional neural networks Autoencoder Privacy preservation Biometri Polära koder Identifieringssystem Konvolutionella neurala nätverk Autoencoder Sekretessskydd Elektroteknik och elektronik
163	Period Drama : Punctuation restoration in Swedish through fine- tuned KB-BERT / Dags att sätta punkt : Återställning av skiljetecken genom finjusterad KB-BERT Sinderwing, John January 2021 (has links) Presented here is a method for automatic punctuation restoration in Swedish using a BERT model. The method is based on KB-BERT, a publicly available, neural network language model pre-trained on a Swedish corpus by National Library of Sweden. This model has then been fine-tuned for this specific task using a corpus of government texts. With a lower-case and unpunctuated Swedish text as input, the model is supposed to return a grammatically correct punctuated copy of the text as output. A successful solution to this problem brings benefits for an array of NLP domains, such as speech-to-text and automated text. Only the punctuation marks period, comma and question marks were considered for the project, due to a lack of data for more rare marks such as semicolon. Additionally, some marks are somewhat interchangeable with the more common, such as exclamation points and periods. Thus, the data set had all exclamation points replaced with periods. The fine-tuned Swedish BERT model, dubbed prestoBERT, achieved an overall F1-score of 78.9. The proposed model scored similarly to international counterparts, with Hungarian and Chinese models obtaining F1-scores of 82.2 and 75.6 respectively. As further comparison, a human evaluation case study was carried out. The human test group achieved an overall F1-score of 81.7, but scored substantially worse than prestoBERT on both period and comma. Inspecting output sentences from the model and humans show satisfactory results, despite the difference in F1-score. The disconnect seems to stem from an unnecessary focus on replicating the exact same punctuation used in the test set, rather than providing any of the number of correct interpretations. If the loss function could be rewritten to reward all grammatically correct outputs, rather than only the one original example, the performance could improve significantly for both prestoBERT and the human group. / Här presenteras en metod för automatisk återinföring av skiljetecken på svenska med hjälp av ett neuralt nätverk i formen av en BERT-modell. Metoden bygger på KB-BERT, en allmänt tillgänglig språkmodell, tränad på ett svensk korpus, av Kungliga Biblioteket. Denna modell har sedan finjusterats för den här specifika uppgiften med hjälp av ett korpus av offentliga texter från landsting och dylikt. Med svensk text utan versaler och skiljetecken som inmatning, ska modellen returnera en kopia av texten där korrekta skiljetecken har placerats ut på rätta platser. En framgångsrik modell ger fördelar för en rad domäner inom neurolingvistisk programmering, såsom tal- till- texttranskription och automatiserad textgenerering. Endast skiljetecknen punkt, kommatecken och frågetecken tas i beaktande i projektet på grund av en brist på data för de mer sällsynta skiljetecknen såsom semikolon. Dessutom är vissa skiljetecken någorlunda utbytbara mot de vanligaste tre, såsom utropstecken mot punkt. Således har datasetets alla utropstecken ersatts med punkter. Den finjusterade svenska BERT-modellen, kallad prestoBERT, fick en övergripande F1-poäng på 78,9. De internationella motsvarande modellerna för ungerska och kinesiska fick en övergripande F1-poäng på 82,2 respektive 75,6. Det tyder på att prestoBERT är på en liknande nivå som toppmoderna motsvarigheter. Som ytterligare jämförelse genomfördes en fallstudie med mänsklig utvärdering. Testgruppen uppnådde en övergripande F1-poäng på 81,7, men presterade betydligt sämre än prestoBERT på både punkt och kommatecken. Inspektion av utdata från modellen och människorna visar tillfredsställande resultat från båda, trots skillnaden i F1-poäng. Skillnaden verkar härstamma från ett onödigt fokus på att replikera exakt samma skiljetecken som används i indatan, snarare än att återge någon av de många korrekta tolkningar som ofta finns. Om loss-funktionen kunde skrivas om för att belöna all grammatiskt korrekt utdata, snarare än bara originalexemplet, skulle prestandan kunna förbättras avsevärt för både prestoBERT såväl som den mänskliga gruppen. NLP Transformer punctuation restoration BERT KB-BERT machine learning neural network NLP Transformer återinföring av skiljetecken BERT KB-BERT maskininlärning neurala nätverk Computer and Information Sciences Data- och informationsvetenskap
164	Modern Credit Value Adjustment / Modern Kreditvärdejustering Ratusznik, Wojciech January 2021 (has links) Counterparty risk calculations have gained importance after the latest financial crisis. The bankruptcy of Lehman Brothers showed that even large financial institutiones face a risk of default. Hence, it is important to measure the risk of default for all the contracts written between financial institutions. Credit Value Adjustment, CVA, is an industry standard method for such calculations. Nevertheless, the implementation of this method is contract dependent and the necessary computer simulations can be very intensive. Monte Carlo simulations have for a long time been known as a precise but slow technique to evaluate the cash flows for contracts of all kinds. Measuring the exposure of a contract written on structured products might require half a day of calculations if the implementation is written without significant optimization. Several ideas have been presented by researchers and applied in the industry, the idea explored and implemented in this thesis was based on using Artificial Neural Networks in Python. This procedure require a decomposition of the Expected Exposure calculation within the CVA and generating a large data set using a standard Monte Carlo simulation. Three network architectures have been tested and the final performance was compared with using standard techniques for the very same calculation. The performance gain was significant, a portfolio of 100 counterparties with 10 contracts each would take 20 minutes of calculations in total when using the best performing architecture whereas a parallel C++ implementation of the standard method would require 2.6 days. Credit Value Adjustment Monte Carlo simulations Artificial neural networks Financial risk management Stochastic calculus Kreditvärdejustering Monte Carlo simuleringar Artificiella neurala nätverk Riskvärdering Stokastisk analys Other Mathematics Annan matematik
165	A Cycle-Accurate Simulator for Accelerating Convolution on AXI4-based Network-on-Chip Architecture / En cykelexakt simulator för att accelerera konvolution på AXI4-baserad nätverk-på-chip-arkitektur Liu, Mingrui January 2024 (has links) Artificial intelligence is probably one of the most prevalent research topics in computer science area, because the technology, if well developed and used properly, is promising to affect the daily lives of ordinaries or even reshape the structure of society. In the meantime, the end of Moore’s Law has promoted the development trend towards domain-specific architectures. The upsurge in researching specific architectures for artificial intelligence applications is unprecedented. Network-on-Chip (NoC) was proposed to address the scalability problem of multi-core system. Recently, NoC has gradually appeared in deep learning computing engines. NoC-based deep learning accelerator is an area worthy of research and currently understudied. Simulating a system is an important step in computer architecture research because it not only allows for rapid verification and measurement of design’s performance, but also provides guidance for subsequent hardware design. In this thesis, we present CNNoCaXiM, a flexible and cycle-accurate simulator for accelerating 2D convolution based on NoC interconnection and AXI4 protocol. We demonstrate its ability by simulating and measuring a convolution example with two different data flows. This simulator can be very useful for upcoming research, either as a baseline case or as a building block for further research. / Artificiell intelligens är förmodligen ett av de vanligaste forskningsämnena inom datavetenskap, eftersom tekniken, om den väl utvecklas och används på rätt sätt, lovar att påverka vanliga människors vardag eller till och med omforma samhällets struktur. Under tiden har slutet av Moores lag främjat utvecklingstrenden mot domänspecifika arkitekturer. Uppsvinget i forskning om specifika arkitekturer för tillämpningar av artificiell intelligens är utan motstycke. Network-on-Chip (NoC) föreslogs för att ta itu med skalbarhetsproblemet med flerkärniga system. Nyligen har NoC gradvis dykt upp i djuplärande datormotorer. NoC-baserad accelerator för djupinlärning är ett område som är värt forskning och för närvarande understuderat. Simulering av ett system är ett viktigt steg i forskning om datorarkitektur eftersom det inte bara möjliggör snabb verifiering och mätning av designens prestanda, utan också ger vägledning för efterföljande hårdvarudesign. I detta examensarbete presenterar vi CNNoCaXiM, en flexibel och cykelnoggrann simulator för att accelerera 2D-faltning baserad på NoC-interconnection och AXI4-protokoll. Vi visar dess förmåga genom att simulera och mäta ett faltningsexempel med två olika dataflöden. Denna simulator kan vara mycket användbar för kommande forskning, antingen som ett grundfall eller som en byggsten för vidare forskning. Artificial Intelligence Deep learning Convolutional Neural Networks AXI4 protocol Network-on-Chip Artificiell intelligens Djup lärning Konvolutionella neurala nätverk AXI4 protokoll Nätverk-på-chip Computer and Information Sciences Data- och informationsvetenskap
166	Neural Network Quantum State Ansatz for the Nuclear Pairing Problem / Neuralt Nätverks Kvanttillståndsansats för Kärnparsproblemet Bonnier, Isabelle January 2024 (has links) As a degree project in Theoretical Physics, the variational MCMC-scheme aided by neural network quantum states was examined for the purpose ofsolving the nuclear pairing model. The method entailed minimization of the local energy sampled via the Born distribution obtained through the neural network output.Both the ground and excited states' energies were computed, where the latter case used an extended loss function which included the overlap to the former.The NNQS-ansatz worked well when emulating the ground state, in which case the Stochastic Reconfiguration optimization method was particularly effective. This optimization method resulted in relative fast convergence to low variance states, and did not require a large number of hyperparameter modifications. Ultimately, all resulting energy intervals encompassed the exact ground state solutions, and had relative errors equal to or near zero.For the excited states, the VMC-NNQS was less effective, as each individual occupation number state investigated required considerable hyperparameter testing before reasonably low lying energy eigenstates could be obtained. Moreover, the convergence properties were less distinguished than for the ground state, as the optimization struggled to maintain orthogonality to the ground state. Nonetheless, the final results included the nearest solutions of the first excited states for several systems and indicated correlation energies similar to those of the ground state. / Som examensarbete inom teoretisk fysik undersöktes den variationella MCMC-metoden tillsammans med neurala nätverk i syfte att lösa kärnparsmodellen. Metoden innebar minimering av den lokala energin som samplades via Born-fördelningen erhållen genom utdata från neurala nätverksapproximationer. Både grundtillståndets och exciterade tillstånds energier beräknades, där det senare fallet använde en utökad kostnadsfunktion som inkluderade överlappet med det förnämnda. NNQS-ansatsen fungerade väl vid emulering av grundtillståndet, i vilket fall optimeringsmethoden stokastisk omkonfigurering (Stochastic Reconfiguration) var särskilt effektivt. Denna optimeringsmetod resulterade i relativt snabb konvergens till lågvarianstillstånd och krävde inte ett stort antal hyperparametriska modifieringar. De slutliga energiintervallen innefattade de exakta lösningarna för grundtillstånden med en relativ felmarginal lika med eller nära noll. För exciterade tillstånd var VMC-NNQS mindre effektivt, eftersom varje enskilt ockupationstillstånd som undersöktes krävde en ansenlig mängd hyperparametrisk testning innan rimligt låga egentillstånd kunde erhållas. Dessutom var konvergensensegenskaperna mycket mindre särspäglade än för grundtillståndet, eftersom optimeringen inte fullt kunde upprätthålla ortogonaliteten mot grundtillståndet. Likväl inkluderade de slutliga resultaten de närmaste lösningarna av de första exciterade energierna för ett flertal system, och visade på korrelationsenergier liknande de för grundtillståndet. Neural Network NNQS Variational Monte Carlo VMC Quantum Many-Body Problem Neurala nätverk NNQS Variations Monte Carlo VMC Kvantfysik Kvanttillstånd Physical Sciences Fysik
167	Machine Learning for State Estimation in Fighter Aircraft / Maskininlärning för tillståndsestimering i stridsflygplan Boivie, Axel January 2023 (has links) This thesis presents an estimator to assist or replace a fighter aircraft’s air datasystem (ADS). The estimator is based on machine learning and LSTM neuralnetworks and uses the statistical correlation between states to estimate the angleof attack, angle of sideslip and Mach number using only the internal sensorsof the aircraft. The model is trained and extensively tested on a fighter jetsimulation model and shows promising results. The methodology and accuracyof the estimator are discussed, together with how a real-world implementationwould work. The estimators presented should act as a proof of concept of thepower of neural networks in state estimation, whilst the report discusses theirstrengths and weaknesses. The estimators can estimate the three targets wellin a vast envelope of altitudes, speeds, winds and manoeuvres. However, thetechnology is quite far from real-world implementation as it lacks transparencybut shows promising potential for future development. / Det här examensarbetet presenterar en estimator för att hjälpa eller ersätta ettstridsflygplans luftdatasystem (ADS). Estimatorn är baserad på maskininlärningoch LSTM neurala nätverk och använder statistisk korrelation mellan tillstånd föratt uppskatta anfallsvinkeln, sidglidningsvinkel och Mach-tal endast med hjälpav flygplanets interna sensorer. Modellen är tränad och utförligt testad på ensimuleringsmodell för stridsflygplan och visar lovande resultat. Estimatornsmetodik och noggrannhet diskuteras, tillsammans med hur en implementeringi verkligheten skulle fungera. De presenterade estimatorerna bör fungera somett “proof of concept” för kraften hos neurala nätverk för tillståndsuppskattning,medan rapporten diskuterar deras styrkor och svagheter. Estimatorerna kanuppskatta de tre tillstånden väl i ett stort spektra av altituder, hastigheter, vindaroch manövrar. Tekniken är dock ganska långt ifrån en verklig implementeringeftersom den saknar transparens, men visar lovande potential för framtidautveckling. State estimation machine learning fighter aircraft neural networks long short- term memory LSTM sensor fusion air data system ADS. Tillståndsestimering maskininlärning stridsflygplan neurala nätverk sensorfusion luftdatasystem. Mathematics Matematik
168	SU-MIMO Port Selection Using Convolutional Neural Networks Jonsson, Samuel January 2024 (has links) Background: The exponential increase in user equipment (UE) units within mobile networks necessitates more efficient Massive MIMOalgorithms. To address this demand, integrating artificial intelligence (AI) into various network aspects is gaining traction. Goal: This thesis explores the feasibility of employing a lightweight convolutional neural network (CNN) to optimize port selection in single-usermultiple-input multiple-output (SU-MIMO) networks. Port selection, a critical component of all forms of MIMO networks, determines theoptimal ports on a UE for data transmission. The objective is to enhance selection speed, reduce computational complexity,and minimize memory consumption. Method: The methodology involves a quasi-experiment where a CNN model, trained on data transfer logs between a basestation and a UE, specifically a mobile phone, is compared with a self-implemented version of the port selection algorithm utilised in Ericssonbase stations. The evaluation criteria include time-, computational-, and spatial complexity. The accuracy of the port selection capabilities of themodels is also recorded. Results: Despite the complexity of the CNN models, the results indicate subpar performance and low test accuracies.This suggests that achieving satisfactory performance would either necessitate an increased model complexity and size or that a convolutionalneural network is not the correct choice for replacing the algorithm. Conclusion: In conclusion, the thesis finds that a lightweight CNN may not be the optimal solution for port selectionoptimization in SU-MIMO networks. However, it suggests potential avenues for further research to explore alternative approaches to this task. / Bakgrund: Den exponentiella ökningen av användarutrustning (UE-enheter) inom mobila nätverk kräver mer effektiva massive MIMO-algoritmer.För att möta detta behov har intresset för att integrera artificiell intelligens (AI) i olika delar av de modila nätverkan ökat mer och mer. Mål: Denna avhandling utforskar möjligheten att använda ett lättviktigt konvolutionellt neuralt nätverk för att optimera port selection i single usermultiple-input multiple-output (SU-MIMO) nätverk. Port selection, en viktig komponent i alla former av MIMO-nätverk, avgör de optimala portarna på enUE för dataöverföring. Det slutliga målet är att förbättra valhastigheten, minska beräkningskomplexiteten och minimeraminnesanvändningen, jämfört med den nuvarande algoritmen som används på Ericsson basstationer. Metod: Metodiken innefattar ett kvasiexperiment där en CNN-modell, tränad på dataöverföringsloggar mellan en basstation och en UE, specifikt enmobiltelefon, jämförs med en egenimplementerad version av portvalsalgoritmen som används i Ericssons basstationer. Utvärderingskriterierna inkluderartids-, beräknings- och rumskomplexitet. Även noggrannheten i portvalsmodellerna mätes. Resultat: Trots den komplexa naturen hos modellerna indikerar resultaten undermålig prestanda och låga testnoggrannheter.Detta antyder att för att uppnå tillfredsställande prestanda antinge skulle kräva en ökad modellkomplexitet och storlek, eller att ett konvolutionelltneuralt nätverk inte är den optimala lösningen för att ersätta den nuvarande algorithmen. Slutsats: Slutligen konstaterar avhandlingen att ett konvolutionellt neuralt nätverk inte är den optimala lösningen för optimering av port selectioni SU-MIMO-nätverk, då ett lättviktigt sådant inte kan uppnå en acceptabel prestanda. Dock föreslår den potentiella riktningar för vidare forskningför att utforska alternativa tillvägagångssätt för denna uppgift. SU-MIMO convolutional neural networks Port Selection Optimisation Telecommunication SU-MIMO Konvolutionella Neurala Nätverk Portval Optimering Telekommunikation Övrig annan teknik
169	Makespan Estimation for Decreased Schedule Generation Time : Neural Network Job Shop Scheduling Optimisation Holm, Tobias, Waters, Phoebe January 2024 (has links) Background: Optimal scheduling is a common practice in various industries, facili-tating efficient workflow management. Accelerating the generation of schedules while maintaining their optimality could encourage broader adoption of this approach inindustry settings. Previous work has aimed to estimate the makespan for the JobShop Scheduling Problem, showing promising results. Objectives: Given the increasing demand for AI and Machine Learning (ML) solutions across industries, this research aims to explore the integration of ML techniquesinto optimal scheduling processes. Specifically, the goal is to develop a faster scheduling solution without compromising the optimality of the generated schedules. The proposed approach combines the effectiveness and speed of ML with the optimal results obtained from mathematical scheduling models. Methods: This thesis focuses on the Job Shop Scheduling (JSS) Problem, where a mathematical scheduler is tasked with minimizing the makespan of a set of jobs while following a predefined set of rules. An initial investigation is performed to establish if there is potential in providing the scheduler with its optimal makespan to decrease the scheduling time. To generalize the application of the concept, the study investigates the potential efficiency acceleration achieved by providing the scheduler with a Machine Learning estimated makespan. This involves training a Neural Network(NN) to estimate the optimal makespan of job sets, which is then utilized to speedup the scheduling process. Results: The preliminary investigation demonstrates that providing the scheduler with the optimal makespan results in an average speed-up of schedule generationby 24%. The results of the scheduling time with the NN estimated makespan is on the other hand not as well performing. Despite achieving a level of accuracy in estimating the makespan, the resulting speed-up in the scheduler’s performance falls short. For the scheduler to benefit from being provided an estimated makespan it is therefore theorized to require a close-to-perfect estimation of the makespan, which was not achieved with the trained NN model. The trained NN reached an average accuracy of 95.75%. Conclusions: The study concludes that while ML models can accurately estimate makespan, the observed speed-up in scheduling performance is not as significant as anticipated. The correlation between well-estimated makespan and speed-up appearsto be inconsistent, indicating potential limitations in the current approach. Further investigation into the search algorithm employed by the scheduling tool Gurobi mayprovide insights into optimizing the scheduling process more effectively. In summary, while the integration of ML techniques shows promise in accelerating scheduling processes, a higher accuracy of the ML model would be required. Additional researchis needed to refine the approach and potentially bring a faster optimal scheduling solution into the future. / Bakgrund: Optimal schemaläggning är en vanlig implemetation inom flera olika branscher och underlättar hantering och effektiviserar arbetsflöden. Att påskynda genereringen av scheman samtidigt som den optimala aspekten av schemaläggning inte går till spillo, skulle kunna främja en bredare användning av optimal schemaläggning för fler brancher. Tidigare undersökningar har gjorts för att estimera "makespan" för Job Shop problemet inom schemaläggning och har visat lovande resultat. Syfte: Med den ökande efterfrågan på AI- och maskininlärnings lösningar inom olika branscher syftar denna forskning till att utforska integrationen av ML-tekniker i den optimala schemaläggningsprocessen. Målet är att utveckla en snabbare schemaläggningslösning utan att kompromissa med det genererade schemats optimalitet. Det föreslagna tillvägagångssättet kombinerar ML’s effektivitet och hastighet med de optimala resultaten som den matematiska schemaläggningsmodellen erbjuder. Metod: Forskningen fokuserar på problemet med schemaläggning för jobbshoppen(JSSP), där en matematisk schemaläggare har i uppgift att minimera makespan fören uppsättning jobb med hänsyn till ett par fördefinierade regler. En initial under-sökning görs, vilket visar att det finns potential i att tillhandahålla schemaläggarendess optimala makespan för att minska schemaläggningstiden. För att generalisera tillämpningen undersöker studien den potentiella accelerationen som uppnås genomatt tillhandahålla schemaläggaren ett maskininlärt uppskattat makespan. Detta medför att träna ett neuralt nätverk för att uppskatta det optimala makespanet för en mängd jobbuppsättningar, som sedan används för att påskynda schemaläggningsprocessen. Resultat: Den preliminära undersökningen visar att schemaläggaren resulterar i igenomsnittlig hastighetsökning av schemagenereringen med cirka 24% när den får tillgång till det optimala makespanet för de givna jobben. Resultaten av schemaläggningstiden med det neurala nätverkets uppskattade makespan är dock lägre än förväntat. Trots att en viss noggrannhetsnivå uppnås vid estimeringen av makespanet, når den resulterande hastighetsökningen i schemaläggarens prestanda inte upp tillförväntningarna. För att schemaläggaren ska dra nytta av att tillhandahålla ett uppskattad makespan krävs en nära perfekt uppskattning av makespan, vilket inte uppnåddes med det tränade neurala nätverket. Slutsatser: Studien drar slutsatsen att även om ML-modeller kan uppskatta makespan någorlunda noggrant, är den observerade hastighetsökningen i schemaläggningen inte lika betydande som förväntat. Korrelationen mellan väl uppskattad makespan och hastighetsökning verkar vara inkonsekvent, vilket indikerar potentiella begränsningar i det nuvarande tillvägagångssättet. Vidare undersökning av sökalgoritmen som används av schemaläggningsverktyget Gurobi kan ge insikter för att optimera schemaläggningsprocessen mer effektivt. Sammanfattningsvis visar integrationen av ML-tekniker lovande resultat för att accelerera schemaläggningsprocesser, men en bättre estimering av makespan skulle krävas. Ytterligare forskning behövs för att förbättra tillvägagångssättet och potentiellt introducera en snabbare optimal schemaläggningslösning för framtiden. Job Shop Scheduling Makespan Estimation Neural network Mathematical Optimisation Hospital Operation Scheduling Job Shop Schemaläggning Makespan Estimat Neurala Nätverk Matem- atisk Optimering OperationSals Schemaläggning Computer Systems Datorsystem
170	Neurala nätverk försjälvkörande fordon : Utforskande av olika tillvägagångssätt / Neural Networks for Autonomous Vehicles : An Exploration of Different Approaches Hellner, Simon, Syvertsson, Henrik January 2021 (has links) Artificiella neurala nätverk (ANN) har ett brett tillämpningsområde och blir allt relevantare på flera håll, inte minst för självkörande fordon. För att träna nätverken användsmeta-algoritmer. Nätverken kan styra fordonen med hjälp av olika typer av indata. I detta projekt har vi undersökt två meta-algoritmer: genetisk algoritm (GA) och gradient descent tillsammans med bakåtpropagering (GD & BP). Vi har även undersökt två typer av indata: avståndssensorer och linjedetektering. Vi redogör för teorin bakom de metoder vi har försökt implementera. Vi lyckades inte använda GD & BP för att träna nätverk att köra fordon, men vi redogör för hur vi försökte. I resultatdelen redovisar vi hur det med GA gick att träna ANN som använder avståndssensorer och linjedetektering som indata. Sammanfattningsvis lyckades vi implementera självkörande fordon med två olika typer av indata. / Artificial Neural Networks (ANN) have a broad area of application and are growing increasingly relevant, not least in the field of autonomous vehicles. Meta algorithms are used to train networks, which can control a vehicle using several kinds of input data. In this project we have looked at two meta algorithms: genetic algorithm (GA), and gradient descent with backpropagation (GD & BP). We have looked at two types of input to the ANN: distance sensors and line detection. We explain the theory behind the methods we have tried to implement. We did not succeed in using GD & BP to train ANNs to control vehicles, but we describe our attemps. We did however succeeded in using GA to train ANNs using a combination of distance sensors and line detection as input. In summary we managed to train ANNs to control vehicles using two methods of input, and we encountered interesting problems along the way. artificial neural networks gradient descent genetic algorithm backpropagation unity self driving autonomous vehicles line detection neural network neural net artificiella neurala nätverk neurala nätverk självkörande bilar självkörande fordon unity bakåtpropagering linjedetektering gradient descent genetisk algoritm neurala nät Computer Sciences Datavetenskap (datalogi)

Search results