Global ETD Search

1	Anomaly Detection on Embedded Sensor Processing Platform Cao, Yichen January 2021 (has links) Embedded platforms are often used as a sensor data processing node to collect data and transmit the data to the remote server. Due to the poor performance and power limitation, data processing was often left to the remote server. With the improvement of the computation ability, it is becoming possible to do some partial data processing on the embedded platforms, which would reduce the power and time consumption on the data transmission. Moreover, processing the data locally on the embedded platforms could reduce the dependence on the network. The platform could even do some tasks offline. This project aims to explore effective data analysis methods, especially for anomaly detection, which could be implemented on the embedded platform to be analyzed and detected locally. In this project, we select four methods: Seasonal and Trend Decomposition Using Loess (STL), Autoregressive Integrated Moving Average Model (ARIMA), Vector Autoregression (VAR), Long ShortTerm Memory (LSTM), to implement on the embedded platform ESP32. To test which methods could better fit the platform, we evaluate and compare the result from two aspects: the time overhead and the accuracy. The results show that the STL has the highest detection accuracy, but its time overhead is significantly higher than all other methods. ARIMA has the smallest time overhead and higher accuracy than LSTM and VAR. For LSTM, the method performs better with univariable input than multivariable input. Finally, we discuss the factors that may influence the result and future works. / Inbäddade plattformar används ofta som en sensor databehandlingsnod för att samla in och sedan överföra data till fjärrservern. Databehandling lämnades ofta till fjärrservern på grund av den dåliga prestandan och effektbegränsningen. Med förbättrad beräkningsförmåga blir det framkomligt att göra en del databehandling på de inbäddade plattformarna, vilket skulle minska ström och tidsförbrukningen för dataöverföringen. För övrigt kan lokal behandling av data på de inbäddade plattformarna minska beroendet av nätverket. Plattformen kan till och med utföra vissa uppgifter I nedkopplat läge. Detta projekt avser att utforska effektiva dataanalysmetoder särskilt för avvikelsedetektering, som kan verkställas på den inbäddade plattformen för att analyseras och upptäckas lokalt. I det här projektet väljer vi fyra metoder för att införa på den inbäddade plattformen ESP32: Seasonal and Trend Decomposition Using Loess (STL), Autoregressive Integrated Moving Average Model (ARIMA), Vector Autoregression (VAR), Long Short-Term Memory (LSTM). För att testa vilka metoder som bättre passar plattformen utvärderar och jämför vi resultatet med hänsyn till två aspekter: tidsomkostnaderna och noggrannheten. Resultaten visar att STL har den högsta detektionsnoggrannheten, men dess tidsomkostning är betydligt högre än alla andra metoder. ARIMA har den minsta tidsomkostningen och högre noggrannhet än LSTM och VAR. För LSTM fungerar metoden bättre med univariable input än multivariable input. Slutligen diskuterar vi faktorerna som möjligtvis påverkar resultatet och framtida arbeten. Anomaly Detection ARIMA STL LSTM VAR Avvikelsedetektion ARIMA STL LSTM VAR Computer and Information Sciences Data- och informationsvetenskap
2	Anomaly Detection for Temporal Data using Long Short-Term Memory (LSTM) Singh, Akash January 2017 (has links) We explore the use of Long short-term memory (LSTM) for anomaly detection in temporal data. Due to the challenges in obtaining labeled anomaly datasets, an unsupervised approach is employed. We train recurrent neural networks (RNNs) with LSTM units to learn the normal time series patterns and predict future values. The resulting prediction errors are modeled to give anomaly scores. We investigate different ways of maintaining LSTM state, and the effect of using a fixed number of time steps on LSTM prediction and detection performance. LSTMs are also compared to feed-forward neural networks with fixed size time windows over inputs. Our experiments, with three real-world datasets, show that while LSTM RNNs are suitable for general purpose time series modeling and anomaly detection, maintaining LSTM state is crucial for getting desired results. Moreover, LSTMs may not be required at all for simple time series. / Vi undersöker Long short-term memory (LSTM) för avvikelsedetektion i tidsseriedata. På grund av svårigheterna i att hitta data med etiketter så har ett oövervakat an-greppssätt använts. Vi tränar rekursiva neuronnät (RNN) med LSTM-noder för att lära modellen det normala tidsseriemönstret och prediktera framtida värden. Vi undersö-ker olika sätt av att behålla LSTM-tillståndet och effekter av att använda ett konstant antal tidssteg på LSTM-prediktionen och avvikelsedetektionsprestandan. LSTM är också jämförda med vanliga neuronnät med fasta tidsfönster över indata. Våra experiment med tre verkliga datasetvisar att även om LSTM RNN är tillämpbara för generell tidsseriemodellering och avvikelsedetektion så är det avgörande att behålla LSTM-tillståndet för att få de önskaderesultaten. Dessutom är det inte nödvändigt att använda LSTM för enkla tidsserier. Computer Sciences Datavetenskap (datalogi)
3	Anomaly Detection for Monocular Camera-based Distance Estimation in Autonomous Driving / Avvikelsedetektion för monokulär kamerabaserad distanssuppskattning vid autonom körning Ge, Muchen January 2024 (has links) With the development of Autonomous Driving (AD) technology, there is a growing concern over the safety of the technology. Finding methods to improve the reliability of this technology becomes a current challenge. The AD system is composed of a perception module, a planning module, and a control module. The perception module, which provides information about the environment for the whole system, is a critical part of the AD system. This project aims to provide a better understanding of the functionality and reliability of the perception module of an AD system. In this project, a simple model of the perception module is built with YOLOv5-nano for object detection, StrongSORT for object tracking, and MonoDepth2 for depth estimation. The system takes images from a single camera as input and produces a time series of distance to the preceding vehicle. Fault injection technologies are utilized for testing the reliability of the system. Different faults, including weather factors, sensor faults, and encoder faults, are injected. The system behaviors under faults are observed and analyzed. Then multiple methods for anomaly detection are applied to the time series of distance data, including the statistic method ARIMA, and the machine learning methods MLP and LSTM. Comparisons are made among the anomaly detection methods, based on the efficiency and performance. The dataset in this project is generated by the CARLA simulator. / Med utvecklingen av tekniken för autonom körning (AD) växer oro över teknologins säkerhet. Att hitta metoder för att förbättra tillförlitligheten hos denna teknologi blir en aktuell utmaning. AD-systemet består av en perceptionsmodul, en planeringsmodul och en styrmodul. Perceptionsmodulen, som tillhandahåller information om miljön för hela systemet, är en kritisk del av AD-systemet. Detta projekt syftar till att ge en bättre förståelse för funktionaliteten och tillförlitligheten hos perceptionsmodulen i ett AD-system. I detta projekt byggs en enkel modell av perceptionsmodulen med YOLOv5-nano för objektdetektion, StrongSORT för objektföljning och MonoDepth2 för djupuppskattning. Systemet tar bilder från en enda kamera som inmatning och producerar en tidsserie av avståndet till det föregående fordonet. Felinjektionstekniker används för att testa systemets tillförlitlighet. Olika fel, inklusive väderfaktorer, sensorfel och maskininlärningsfel, injiceras. Systemets beteende under fel observeras och analyseras. Därefter tillämpas flera metoder för avvikelsedetektering på tidsserien av avstånd, inklusive statistikmetoden ARIMA samt maskininlärningsmetoderna MLP och LSTM. Jämförelser görs mellan avvikelsedetekteringsmetoderna, baserat på effektivitet och prestanda. Datamängden i detta projekt genereras av CARLAsimulatorn. Autonomous Driving Fault Injection Anomaly Detection Distance Estimation Autonom Korning Felinjektion Avvikelsedetektion Distansuppskattning Computer and Information Sciences Data- och informationsvetenskap
4	Anomaly-based intrusion detection using Tree Augmented Naive Bayes Classifier Wester, Philip January 2021 (has links) With the rise of information technology and the dependence on these systems, it becomes increasingly more important to keep the systems secure. The possibility to detect an intrusion with intrusion detection systems (IDS) is one of multiple fundamental technologies that may increase the security of a system. One of the bigger challenges of an IDS, is to detect types of intrusions that have previously not been encountered, so called unknown intrusions. These types of intrusions are generally detected by using methods collectively called anomaly detection methods. In this thesis I evaluate the performance of the algorithm Tree Augmented Naive Bayes Classifier (TAN) as an intrusion detection classifier. More specifically, I created a TAN program from scratch in Python and tested the program on two data sets containing data traffic. The thesis aims to create a better understanding of how TAN works and evaluate if it is a reasonable algorithm for intrusion detection. The results show that TAN is able to perform at an acceptable level with a reasonably high accuracy. The results also highlights the importance of using the smoothing operator included in the standard version of TAN. / Med informationsteknikens utveckling och det ökade beroendet av dessa system, blir det alltmer viktigt att hålla systemen säkra. Intrångsdetektionssystem (IDS) är en av många fundamentala teknologier som kan öka säkerheten i ett system. En av de större utmaningarna inom IDS, är att upptäcka typer av intrång som tidigare inte stötts på, så kallade okända intrång. Dessa intrång upptäcks oftast med hjälp av metoder som kollektivt kallas för avvikelsedetektionsmetoder. I denna uppsats utvärderar jag algoritmen Tree Augmented Naive Bayes Classifiers (TAN) prestation som en intrångsdetektionsklassificerare. Jag programmerade ett TAN-program, i Python, och testade detta program på två dataset som innehöll datatrafik. Denna uppsats ämnar att skapa en bättre förståelse för hur TAN fungerar, samt utvärdera om det är en lämplig algoritm för detektion av intrång. Resultaten visar att TAN kan prestera på en acceptabel nivå, med rimligt hög noggrannhet. Resultaten markerar även betydelsen av "smoothing operator", som inkluderas i standardversionen av TAN. Intrusion detection Anomaly detection Tree Augmented Naive Bayes Machine learning Network based intrusion detection. Intrångsdetektion Avvikelsedetektion Tree Augmented Naive Bayes Maskininlärning Nätverksbaserad intrångsdetektion Computer and Information Sciences Data- och informationsvetenskap
5	Anomaly Detection and Revenue Loss Estimation in Accounting Data Edholm, Gustav January 2020 (has links) Loss of revenue due to erroneous invoicing is a serious problem for many companies in the repair and maintenance industry. Revenue loss can occur in many ways, for example by consistently charging the wrong hourly price for services. If a company is experiencing revenue loss, it is incredibly important to detect it, find where it is happening, and estimate the size of it in order to treat it. The goal of this work is to find statistical methods for detecting incorrectly charged services in a dataset of invoices, and estimate the loss of revenue in the same dataset. The dataset used comes from a real company experiencing revenue loss through incorrectly charged prices for services, and thus represents a real world instance of this problem. Multiple machine learning methods with different levels of supervision are tested for detecting anomalous invoice items and estimating revenue loss using raw invoice data. Neural network regression, and different decision tree regression methods, as well as an ensemble of these are tested and compared. The dataset has ground truth labels for each price, thus results are compared to real world targets. It is found that an ensemble using a weighted average of predictions from neural network regression and gradient boosted decision tree regression to predict the charged prices in an invoice dataset performs anomaly detection most reliably. On the top 1000 anomaly candidates, this method flags anomalies correctly 87% of the time, catching 45% of all anomalies. Moreover, in terms of estimating revenue loss, using a neural network to perform regression, a revenue loss error of just 13% is achieved. / Förlorad omsättning till följd av felaktig fakturering ar ett alvarligt problem for vissa företag i service- och reparationsbranchen. Detta kan uppstå på manga satt, till exempel genom konsekvent felaktig prissättning av tjänster. Om ett företag har stor förlust av omsättning ar det otroligt viktigt att upptäcka det, hitta var det sker, och uppskatta storleken av förlusten for att kunna behandla den. Malet med detta arbete ar att hitta statistiska metoder for att identifiera felaktigt prissatta tjänster i ett dataset av fakturor, och uppskatta förlorad omsättning i datasetet. Datasetet som används kommer från ett företag som förlorar omsättning på grund av just felfakturerat pris på tjänster, och representerar därför en verklig instans av detta problem. Ett flertal maskininlärningsmetoder, med olika grader av vägledning, används for att upptäcka felaktiga fakturarader och uppskatta förlorad omsättning i omärkt fakturadata. Regression med neuronnät, och olika beslutstradmetoder såväl som en ensembel av dessa testas och jämförs. Datasetet har sanningsenliga ettiketter till varje rad, därmed kan resultaten jämföras och utvärderas mot korrekta priser. Vi finner att en ensembel av ett neuralnät och ett gradientförstärkt beslutstrad for regression identifierar felaktiga prissättningar mest pålitligt. Pa de 1000 mest sannolika felen har denna metod ratt på 87%, vilket fångar 45% av alla fel. Vidare, med hänsyn till förlorad omsättning finner vi att ett neuralnät som utför regresssion uppnår ett fel på endast 13% i sitt estimat av förlorad omsättning. Machine Learning Anomaly Detection Regression Neural Network Invoice Revenue Loss Maskininlärning Avvikelsedetektion Regression Neuralnät Faktura Omsättningsförlust Computer and Information Sciences Data- och informationsvetenskap
6	Automated advanced analytics on vehicle data using AI / Automatiserad avancerad analys av fordonsdata med hjälp av AI Zhang, Simin January 2020 (has links) The evolution of electrification and autonomous driving on automotive leads to the increasing complexity of the in-vehicle electrical network, which poses a new challenge for testers to do troubleshooting work in massive log files. This thesis project aims to develop a predictive technique for anomaly detection focusing on user function level failures using machine learning technologies.\\ Specifically, it investigates the performance of point anomaly detection models and temporal dependent anomaly detection models on the analysis of Controller Area Network (CAN) data obtained from software-in-loop simulation. For point anomaly detection, the models of Isolation forest, Multivariate normal distribution, and Local outlier factor are implemented respectively. For temporal dependent anomaly detection, the model of an encoder-decoder architecture neural network using Long Short-Temporal Memory (LSTM) units is implemented, so is a stacking hybrid detector in the combination of LSTM Encoder and Local outlier factor.\\ With a comparison of the comprehensive performance of the proposed models, the model of LSTM AutoEncoder is selected for detecting the anomalies on sequential data in CAN logs. The experiment results show promising detection performance of LSTM AutoEncoder on the studied functional failures and suggest that it is possible to be deployed in real-time automated anomaly detection on vehicle systems. / Utvecklingen av elektrifiering och autonom körning på fordon leder till den ökande komplexiteten i fordonets elektriska nätverk, vilket utgör en ny utmaning för testare att göra felsökningsarbete i massiva loggfiler. Detta avhandlings syftar till att utveckla en förutsägbar teknik för detektering av avvikelser med fokus på användarfunktionsnivåfel med maskininlärningstekniker.\\ Specifikt undersöker den prestandan hos punktavvikelsedetekteringsmodeller och tidsberoende anomalidetekteringsmodeller på analysen av data från Controller Area Network (CAN) erhållen från simulering av mjukvara in-loop. För detektion av punktavvikelser implementeras modellerna för Isolation forest, Multivariate normal distribution och Local outlier factor. För temporär beroende anomalidetektering implementeras modellen för ett kodnings-avkodningsarkitekturneuralt nätverk som använder Long Short-Temporal Memory (LSTM) -enheter, så är en stapling hybriddetektor i kombination med LSTM Encoder och Local outlier factor.\\ Med en jämförelse av den omfattande prestandan hos de föreslagna modellerna väljs modellen för LSTM AutoEncoder för att detektera avvikelser på sekventiell data i CAN-loggar. Experimentresultaten visar lovande detektionsprestanda för LSTM AutoEncoder på de studerade funktionella misslyckandena och föreslår att det är möjligt att distribueras i realtid automatiserad anomalidetektering på fordonssystem. Anomaly detection Machine learning Controller Area Network User function level failure Avvikelsedetektion Maskininlärning Controller Area Network Fel på användarfunktionsnivå Vehicle Engineering Farkostteknik
7	Anomaly Detection in Telecom Service Provider Network Infrastructure Security Logs using an LSTM Autoencoder : Leveraging Time Series Patterns for Improved Anomaly Detection / Avvikelsedetektering i säkerhetsloggar för nätverksinfrastruktur hos en telekomtjänstleverantör med en LSTM Autoencoder : Uttnyttjande av tidsseriemönster för förbättrad avvikelsedetektering Vlk, Vendela January 2024 (has links) New regulations are placed on Swedish Telecom Service Providers (TSPs) due to a rising concern for safeguarding network security and privacy in the face of ever-evolving cyber threats. These regulations demand that Swedish telecom companies expand their data security strategies with proactive security measures. Logs, serving as digital footprints in IT infrastructure, play a crucial role in identifying anomalies that could indicate security breaches. Deep Learning (DL) has been used to detect anomalies in logs due to its ability to discern intricate patterns within the data. By leveraging deep learning-based models, it is not only possible to identify anomalies but also to predict and mitigate potential threats within the telecom network. An LSTM autoencoder was implemented to detect anomalies in two separate multivariate temporal log datasets; the BETH cybersecurity dataset, and a Cisco log dataset that was created specifically for this thesis. The empirical results in this thesis show that the LSTM autoencoder reached an ROC AUC of 99.5% for the BETH dataset and 76.6% for the Cisco audit dataset. The use of an additional anomaly detection aid in the Cisco audit dataset let the model reach an ROC AUC of 99.6%. The conclusion that could be drawn from this work was that the systematic approach to developing a deep learning model for anomaly detection in log data was efficient. However, the study’s findings raise crucial considerations regarding the appropriateness of various log data for deep learning models used in anomaly detection. / Nya föreskrifter har införts för svenska telekomtjänsteleverantörer på grund av en ökad angelägenhet av att säkerställa nätverkssäkerhet och integritet inför ständigt föränderliga cyberhot. Dessa föreskrifter kräver att svenska telekomföretag utvidgar sina dataskyddsstrategier med proaktiva säkerhetsåtgärder. Loggar, som fungerar som digitala fotspår inom IT-infrastruktur, spelar en avgörande roll för att identifiera avvikelser som kan tyda på säkerhetsintrång. Djupinlärning har använts för att upptäcka avvikelser i loggar på grund av dess förmåga att urskilja intrikata mönster inom data. Genom att utnyttja modeller baserade på djupinlärning är det inte bara möjligt att identifiera avvikelser utan även att förutsäga samt mildra konsekvenserna av potentiella hot inom telekomnätet. En LSTM-autoencoder implementerades för att upptäcka avvikelser i två separata multivariata tidsserielogguppsättningar; BETH-cybersäkerhetsdatauppsättningen och en Cisco-loggdatauppsättning som skapades specifikt för detta arbete. De empiriska resultaten i denna avhandling visar att LSTM-autoencodern uppnådde en ROC AUC på 99.5% för BETH-datauppsättningen och 76.6% för Cisco-datauppsättningen. Användningen av ett ytterligare avvikelsedetekteringsstöd i Cisco-datauppsättningen möjliggjorde att modellen uppnådde en ROC AUC på 99.6%. Slutsatsen som kunde dras från detta arbete var att den systematiska metoden för att utveckla en djupinlärningsmodell för avvikelsedetektering i loggdata var effektiv. Dock väcker studiens resultat kritiska överväganden angående lämpligheten av olika loggdata för djupinlärningsmodeller som används för avvikelsedetektering. Anomaly detection Deep Learning LSTM Autoencoder Time series Log analysis Avvikelsedetektion Djupinlärning LSTM Autoencoder Tidsserier Logganalys Computer Sciences Datavetenskap (datalogi) Computer Engineering Datorteknik
8	Exploring Integration of Predictive Maintenance using Anomaly Detection : Enhancing Productivity in Manufacturing / Utforska integration av prediktivt underhåll med hjälp av avvikelsedetektering : Förbättra produktiviteten inom tillverkning Bülund, Malin January 2024 (has links) In the manufacturing industry, predictive maintenance (PdM) stands out by leveraging data analytics and IoT technologies to predict machine failures, offering a significant advancement over traditional reactive and scheduled maintenance practices. The aim of this thesis was to examine how anomaly detection algorithms could be utilized to anticipate potential breakdowns in manufacturing operations, while also investigating the feasibility and potential benefits of integrating PdM strategies into a production line. The methodology of this projectconsisted of a literature review, application of machine learning (ML) algorithms, and conducting interviews. Firstly, the literature review provided a foundational basis to explore the benefits of PdM and its impact on production line productivity, thereby shaping the development of interview questions. Secondly, ML algorithms were employed to analyze data and predict equipment failures. The algorithms used in this project were: Isolation Forest (IF), Local Outlier Factor (LOF), Logistic Regression (LR), One-Class Support Vector Machine(OC-SVM) and Random Forest (RF). Lastly, interviews with production line personnel provided qualitative insights into the current maintenance practices and perceptions of PdM. The findings from this project underscore the efficacy of the IF model in identifying potential equipment failures, emphasizing its key role in improving future PdM strategies to enhance maintenance schedules and boost operational efficiency. Insights gained from both literature and interviews underscore the transformative potential of PdM in refining maintenance strategies, enhancing operational efficiency, and minimizing unplanned downtime. More broadly, the successful implementation of these technologies is expected to revolutionize manufacturing processes, driving towards more sustainable and efficient industrial operations. / I tillverkningsindustrin utmärker sig prediktivt underhåll (PdM) genom att använda dataanalys och IoT-teknologier för att förutse maskinfel, vilket erbjuder ett betydande framsteg jämfört med traditionella reaktiva och schemalagda underhållsstrategier. Syftet med denna avhandling var att undersöka hur algoritmer för avvikelsedetektering kunde användas för att förutse potentiella haverier i tillverkningsoperationer, samtidigt som genomförbarheten och de potentiella fördelarna med att integrera PdM-strategier i en produktionslinje undersöktes. Metodologin för detta projekt bestod av en litteraturöversikt, tillämpning av maskininlärningsalgoritmer (ML) och genomförande av intervjuer. Först och främst gav litteraturöversikten en grundläggande bas för att utforska fördelarna med PdM och dess inverkan på produktionslinjens produktivitet, vilket därmed påverkade utformningen av intervjufrågorna. För det andra användes ML-algoritmer för att analysera data och förutsäga utrustningsfel. Algoritmerna som användes i detta projekt var: Isolation Forest (IF), Local Outlier Factor (LOF), Logistic Regression (LR), One-Class Support Vector Machine (OCSVM) och Random Forest (RF). Slutligen gav intervjuer med produktionslinjepersonal kvalitativa insikter i de nuvarande underhållsstrategierna och uppfattningarna om PdM.Resultaten från detta projekt understryker effektiviteten hos IF-modellen för att identifiera potentiella utrustningsfel, vilket betonar dess centrala roll i att förbättra framtida PdM-strategier för att förbättra underhållsscheman och öka den operativa effektiviteten. Insikter vunna från både litteratur och intervjuer understryker PdM:s transformativa potential att finslipa underhållsstrategier, öka operativ effektivitet och minimera oplanerade driftstopp. Mer generellt förväntas den framgångsrika implementeringen av dessa teknologier revolutionera tillverkningsprocesser och driva mot mer hållbara och effektiva industriella operationer. Anomaly Detection Isolation Forest Local Outlier Factor Logistic Regression One-Class Support Vector Machine Predictive Maintenance Random Forest. Avvikelsedetektion Isolation Forest Local Outlier Factor Logistic Regression One-Class Support Vector Machine Prediktivt Underhåll Random Forest. Medical Engineering Medicinteknik Transport Systems and Logistics Transportteknik och logistik Other Medical Engineering Annan medicinteknik

Search results