Global ETD Search

481	Applied Machine Learning Predicts the Postmortem Interval from the Metabolomic Fingerprint Arpe, Jenny January 2024 (has links) In forensic autopsies, accurately estimating the postmortem interval (PMI) is crucial. Traditional methods, relying on physical parameters and police data, often lack precision, particularly after approximately two days have passed since the person's death. New methods are increasingly focusing on analyzing postmortem metabolomics in biological systems, acting as a 'fingerprint' of ongoing processes influenced by internal and external molecules. By carefully analyzing these metabolomic profiles, which span a diverse range of information from events preceding death to postmortem changes, there is potential to provide more accurate estimates of the PMI. The limitation of available real human data has hindered comprehensive investigation until recently. Large-scale metabolomic data collected by the National Board of Forensic Medicine (RMV, Rättsmedicinalverket) presents a unique opportunity for predictive analysis in forensic science, enabling innovative approaches for improving PMI estimation. However, the metabolomic data appears to be large, complex, and potentially nonlinear, making it difficult to interpret. This underscores the importance of effectively employing machine learning algorithms to manage metabolomic data for the purpose of PMI predictions, the primary focus of this project. In this study, a dataset consisting of 4,866 human samples and 2,304 metabolites from the RMV was utilized to train a model capable of predicting the PMI. Random Forest (RF) and Artificial Neural Network (ANN) models were then employed for PMI prediction. Furthermore, feature selection and incorporating sex and age into the model were explored to improve the neural network's performance. This master's thesis shows that ANN consistently outperforms RF in PMI estimation, achieving an R2 of 0.68 and an MAE of 1.51 days compared to RF's R2 of 0.43 and MAE of 2.0 days across the entire PMI-interval. Additionally, feature selection indicates that only 35% of total metabolites are necessary for comparable results with maintained predictive accuracy. Furthermore, Principal Component Analysis (PCA) reveals that these informative metabolites are primarily located within a specific cluster on the first and second principal components (PC), suggesting a need for further research into the biological context of these metabolites. In conclusion, the dataset has proven valuable for predicting PMI. This indicates significant potential for employing machine learning models in PMI estimation, thereby assisting forensic pathologists in determining the time of death. Notably, the model shows promise in surpassing current methods and filling crucial gaps in the field, representing an important step towards achieving accurate PMI estimations in forensic practice. This project suggests that machine learning will play a central role in assisting with determining time since death in the future. Forensic autopsies postmortem interval (PMI) postmortem metabolomics biological systems metabolomic fingerprints postmortem changes National Board of Forensic Medicine RMV Rättsmedicinalverket forensic science PMI estimation machine learning algorithms metabolomic data metabolites Random Forest (RF) Artificial Neural Network (ANN) feature selection Principal Component Analysis (PCA) forensic pathologists forensic practice. Rättsmedicinska obduktioner postmortalt intervall (PMI) postmortal metabolomik biologiska system metabolomiska fingeravtryck postmortala förändringar Rättsmedicinalverket RMV PMI-estimering maskininlärningsalgoritmer metabolomiska data metaboliter Random Forest (RF) Artificiella neurala nätverk (ANN) feature selection Principal Component Analysis (PCA) rättsläkare rättsmedicinsk praxis. Other Medical Engineering Annan medicinteknik
482	Data mining and predictive analytics application on cellular networks to monitor and optimize quality of service and customer experience Muwawa, Jean Nestor Dahj 11 1900 (has links) This research study focuses on the application models of Data Mining and Machine Learning covering cellular network traffic, in the objective to arm Mobile Network Operators with full view of performance branches (Services, Device, Subscribers). The purpose is to optimize and minimize the time to detect service and subscriber patterns behaviour. Different data mining techniques and predictive algorithms have been applied on real cellular network datasets to uncover different data usage patterns using specific Key Performance Indicators (KPIs) and Key Quality Indicators (KQI). The following tools will be used to develop the concept: RStudio for Machine Learning and process visualization, Apache Spark, SparkSQL for data and big data processing and clicData for service Visualization. Two use cases have been studied during this research. In the first study, the process of Data and predictive Analytics are fully applied in the field of Telecommunications to efficiently address users’ experience, in the goal of increasing customer loyalty and decreasing churn or customer attrition. Using real cellular network transactions, prediction analytics are used to predict customers who are likely to churn, which can result in revenue loss. Prediction algorithms and models including Classification Tree, Random Forest, Neural Networks and Gradient boosting have been used with an exploratory Data Analysis, determining relationship between predicting variables. The data is segmented in to two, a training set to train the model and a testing set to test the model. The evaluation of the best performing model is based on the prediction accuracy, sensitivity, specificity and the Confusion Matrix on the test set. The second use case analyses Service Quality Management using modern data mining techniques and the advantages of in-memory big data processing with Apache Spark and SparkSQL to save cost on tool investment; thus, a low-cost Service Quality Management model is proposed and analyzed. With increase in Smart phone adoption, access to mobile internet services, applications such as streaming, interactive chats require a certain service level to ensure customer satisfaction. As a result, an SQM framework is developed with Service Quality Index (SQI) and Key Performance Index (KPI). The research concludes with recommendations and future studies around modern technology applications in Telecommunications including Internet of Things (IoT), Cloud and recommender systems. / Cellular networks have evolved and are still evolving, from traditional GSM (Global System for Mobile Communication) Circuit switched which only supported voice services and extremely low data rate, to LTE all Packet networks accommodating high speed data used for various service applications such as video streaming, video conferencing, heavy torrent download; and for say in a near future the roll-out of the Fifth generation (5G) cellular networks, intended to support complex technologies such as IoT (Internet of Things), High Definition video streaming and projected to cater massive amount of data. With high demand on network services and easy access to mobile phones, billions of transactions are performed by subscribers. The transactions appear in the form of SMSs, Handovers, voice calls, web browsing activities, video and audio streaming, heavy downloads and uploads. Nevertheless, the stormy growth in data traffic and the high requirements of new services introduce bigger challenges to Mobile Network Operators (NMOs) in analysing the big data traffic flowing in the network. Therefore, Quality of Service (QoS) and Quality of Experience (QoE) turn in to a challenge. Inefficiency in mining, analysing data and applying predictive intelligence on network traffic can produce high rate of unhappy customers or subscribers, loss on revenue and negative services’ perspective. Researchers and Service Providers are investing in Data mining, Machine Learning and AI (Artificial Intelligence) methods to manage services and experience. This research study focuses on the application models of Data Mining and Machine Learning covering network traffic, in the objective to arm Mobile Network Operators with full view of performance branches (Services, Device, Subscribers). The purpose is to optimize and minimize the time to detect service and subscriber patterns behaviour. Different data mining techniques and predictive algorithms will be applied on cellular network datasets to uncover different data usage patterns using specific Key Performance Indicators (KPIs) and Key Quality Indicators (KQI). The following tools will be used to develop the concept: R-Studio for Machine Learning, Apache Spark, SparkSQL for data processing and clicData for Visualization. / Electrical and Mining Engineering / M. Tech (Electrical Engineering) Data Mining Predictive Analytics Big Data Quality of Service (QoS) Customer Experience Business Intelligence (BI) Network Churn Key Quality Index (KQI) Key Performance Index (KPI) Service Quality Management (SQM) Neural Network (NN) Deep Learning (DL) Random Forest (RF) Classification Tree Regression In-memory Data processing Data Science 006.312 Data mining Machine learning Business intelligence Packet switching (Data transmission) Quality of service (Computer networks) Artificial intelligence
483	Snow depth measurements and predictions : Reducing environmental impact for artificial grass pitches at snowfall Forsblom, Findlay, Ulvatne, Lars Petter January 2020 (has links) Rubber granulates, used at artificial grass pitches, pose a threat to the environment when leaking into the nature. As the granulates leak to the environment through rain water and snow clearances, they can be transported by rivers and later on end up in the marine life. Therefore, reducing the snow clearances to its minimum is of importance. If the snow clearance problem is minimized or even eliminated, this will have a positive impact on the surrounding nature. The object of this project is to propose a method for deciding when to remove snow and automate the information dispersing upon clearing or closing a pitch. This includes finding low powered sensors to measure snow depth, find a machine learning model to predict upcoming snow levels and create an application with a clear and easy-to-use interface to present weather information and disperse information to the responsible persons. Controlled experiments is used to find the models and sensors that are suitable to solve this problem. The sensors are tested on a single snow quality, where ultrasonic and infrared sensors are found suitable. However, fabricated tests for newly fallen snow questioned the possibility of measuring snow depth using the ultrasonic sensor in the general case. Random Forest is presented as the machine learning model that predicts future snow levels with the highest accuracy. From a survey, indications is found that the web application fulfills the intended functionalities, with some improvements suggested. artificial grass rubber granulate pollution snow depth measurement snow level prediction temperature index energy index energy balance model machine learning python random forest ultrasonic sensor infrared sensor lorawan pycom lopy4 micro python arduino uno web application javascript node.js Computer Sciences Datavetenskap (datalogi) Computer Engineering Datorteknik
484	Price Prediction of Vinyl Records Using Machine Learning Algorithms Johansson, David January 2020 (has links) Machine learning algorithms have been used for price prediction within several application areas. Examples include real estate, the stock market, tourist accommodation, electricity, art, cryptocurrencies, and fine wine. Common approaches in studies are to evaluate the accuracy of predictions and compare different algorithms, such as Linear Regression or Neural Networks. There is a thriving global second-hand market for vinyl records, but the research of price prediction within the area is very limited. The purpose of this project was to expand on existing knowledge within price prediction in general to evaluate some aspects of price prediction of vinyl records. That included investigating the possible level of accuracy and comparing the efficiency of algorithms. A dataset of 37000 samples of vinyl records was created with data from the Discogs website, and multiple machine learning algorithms were utilized in a controlled experiment. Among the conclusions drawn from the results was that the Random Forest algorithm generally generated the strongest results, that results can vary substantially between different artists or genres, and that a large part of the predictions had a good accuracy level, but that a relatively small amount of large errors had a considerable effect on the general results. price prediction price estimation vinyl records vinyl prices regression machine learning machine learning algorithms algorithm comparison dataset vinyl dataset k-nearest neighbors linear regression neural network random forest discogs ai artificial intelligence prisuppskattning vinyl vinylskivor maskininlärning artificiell intelligens Computer Sciences Datavetenskap (datalogi)
485	Time series monitoring and prediction of data deviations in a manufacturing industry Lantz, Robin January 2020 (has links) An automated manufacturing industry makes use of many interacting moving parts and sensors. Data from these sensors generate complex multidimensional data in the production environment. This data is difficult to interpret and also difficult to find patterns in. This project provides tools to get a deeper understanding of Swedsafe’s production data, a company involved in an automated manufacturing business. The project is based on and will show the potential of the multidimensional production data. The project mainly consists of predicting deviations from predefined threshold values in Swedsafe’s production data. Machine learning is a good method of finding relationships in complex datasets. Supervised machine learning classification is used to predict deviation from threshold values in the data. An investigation is conducted to identify the classifier that performs best on Swedsafe's production data. The technique sliding window is used for managing time series data, which is used in this project. Apart from predicting deviations, this project also includes an implementation of live graphs to easily get an overview of the production data. A steady production with stable process values is important. So being able to monitor and predict events in the production environment can provide the same benefit for other manufacturing companies and is therefore suitable not only for Swedsafe. The best performing machine learning classifier tested in this project was the Random Forest classifier. The Multilayer Perceptron did not perform well on Swedsafe’s data, but further investigation in recurrent neural networks using LSTM neurons would be recommended. During the projekt a web based application displaying the sensor data in live graphs is also developed. Machine learning Supervised learning Time series classification Manufacturing industry Production data Data deviations Support Vector Machine K-Nearest Neighbours Linear Regression Decision Tree Random Forest Neural Network Recurrent Neural Network Computer Science Maskininlärning Tidsserier Tillverkningsindustri Klassificerare Avvikelser Computer Sciences Datavetenskap (datalogi) Mathematical Analysis Matematisk analys
486	Optimalizace strojového učení pro predikci KPI / Machine Learning Optimization of KPI Prediction Haris, Daniel January 2018 (has links) This thesis aims to optimize the machine learning algorithms for predicting KPI metrics for an organization. The organization is predicting whether projects meet planned deadlines of the last phase of development process using machine learning. The work focuses on the analysis of prediction models and sets the goal of selecting new candidate models for the prediction system. We have implemented a system that automatically selects the best feature variables for learning. Trained models were evaluated by several performance metrics and the best candidates were chosen for the prediction. Candidate models achieved higher accuracy, which means, that the prediction system provides more reliable responses. We suggested other improvements that could increase the accuracy of the forecast.
487	Performance Comparison of Public Bike Demand Predictions: The Impact of Weather and Air Pollution Min Namgung (9380318) 15 December 2020 (has links) Many metropolitan cities motivate people to exploit public bike-sharing programs as alternative transportation for many reasons. Due to its’ popularity, multiple types of research on optimizing public bike-sharing systems is conducted on city-level, neighborhood-level, station-level, or user-level to predict the public bike demand. Previously, the research on the public bike demand prediction primarily focused on discovering a relationship with weather as an external factor that possibly impacted the bike usage or analyzing the bike user trend in one aspect. This work hypothesizes two external factors that are likely to affect public bike demand: weather and air pollution. This study uses a public bike data set, daily temperature, precipitation data, and air condition data to discover the trend of bike usage using multiple machine learning techniques such as Decision Tree, Naïve Bayes, and Random Forest. After conducting the research, each algorithm’s output is evaluated with performance comparisons such as accuracy, precision, or sensitivity. As a result, Random Forest is an efficient classifier for the bike demand prediction by weather and precipitation, and Decision Tree performs best for the bike demand prediction by air pollutants. Also, the three class labelings in the daily bike demand has high specificity, and is easy to trace the trend of the public bike system. Applied Computer Science performance comparisons Public bike-sharing systems NYC bike trip data Chicago bike trip data Decision Tree Approach Research Random Forest models Naive Bayes models demand prediction air pollution impacts weather impacts Transportation Demand Management
488	Peak shaving optimisation in school kitchens : A machine learning approach Alhoush, George, Edvardsson, Emil January 2022 (has links) With the increasing electrification of todays society the electrical grid is experiencing increasing pressure from demand. One factor that affects the stability of the grid are the time intervals at which power demand is at its highest which is referred to as peak demand. This project was conducted in order to reduce the peak demand through a process called peak shaving in order to relieve some of this pressure through the use of batteries and renewable energy. By doing so, the user of such systems could reduce the installation cost of their electrical infrastructure as well as the electrical billing. Peak shaving in this project was implemented using machine learning algorithms that predicted the daily power consumption in school kitchens with help of their food menus, which were then fed to an algorithm to steer a battery according to the results. All of these project findings are compared to another system installed by a company to decide whether the algorithm has the right accuracy and performance. The results of the simulations were promising as the algorithm was able to detect the vast majority of the peaks and perform peak shaving intelligently. Based on the graphs and values presented in this report, it can be concluded that the algorithm is ready to be implemented in the real world with the potential to contribute to a long-term sustainable electrical grid while saving money for the user. Peak shaving Machine learning Peak-shaving Battery optimisation Random Forest Gradient Boosting Peak demand Peak-demand Sustainable Electrification AI A.I Artificial intelligence Energy system Maskininlärning Utjämnings av effekttoppar Batterioptimering Hållbarhet Elektrifiering Artificiell intelligens Energisystem Energy Systems Energisystem Software Engineering Programvaruteknik Computer Sciences Datavetenskap (datalogi)
489	Radar based tank level measurement using machine learning : Agricultural machines / Nivåmätning av tank med radar sensorer och maskininlärning Thorén, Daniel January 2021 (has links) Agriculture is becoming more dependent on computerized solutions to make thefarmer’s job easier. The big step that many companies are working towards is fullyautonomous vehicles that work the fields. To that end, the equipment fitted to saidvehicles must also adapt and become autonomous. Making this equipment autonomoustakes many incremental steps, one of which is developing an accurate and reliable tanklevel measurement system. In this thesis, a system for tank level measurement in a seedplanting machine is evaluated. Traditional systems use load cells to measure the weightof the tank however, these types of systems are expensive to build and cumbersome torepair. They also add a lot of weight to the equipment which increases the fuel consump-tion of the tractor. Thus, this thesis investigates the use of radar sensors together witha number of Machine Learning algorithms. Fourteen radar sensors are fitted to a tankat different positions, data is collected, and a preprocessing method is developed. Then,the data is used to test the following Machine Learning algorithms: Bagged RegressionTrees (BG), Random Forest Regression (RF), Boosted Regression Trees (BRT), LinearRegression (LR), Linear Support Vector Machine (L-SVM), Multi-Layer Perceptron Re-gressor (MLPR). The model with the best 5-fold crossvalidation scores was Random For-est, closely followed by Boosted Regression Trees. A robustness test, using 5 previouslyunseen scenarios, revealed that the Boosted Regression Trees model was the most robust.The radar position analysis showed that 6 sensors together with the MLPR model gavethe best RMSE scores.In conclusion, the models performed well on this type of system which shows thatthey might be a competitive alternative to load cell based systems. Machine Learning Radar AI Tank level measurement Linear Regression Support Vector Regression Bagging Bagged Trees Bagged Regression Trees Boosting Boosted Trees Boosted Regression Trees Random Forest Multi Layer Perceptron Regressor Neural Networks Regression Maskininlärning Radar Nivåmätning Regression Computer Sciences Datavetenskap (datalogi)
490	Prognostics for Condition Based Maintenance of Electrical Control Units Using On-Board Sensors and Machine Learning Fredriksson, Gabriel January 2022 (has links) In this thesis it has been studied how operational and workshop data can be used to improve the handling of field quality (FQ) issues for electronic units. This was done by analysing how failure rates can be predicted, how failure mechanisms can be detected and how data-based lifetime models could be developed. The work has been done on an electronic control unit (ECU) that has been subject to a field quality (FQ) issue, determining thermomechanical stress on the solder joints of the BGAs (Ball Grid Array) on the PCBAs (Printed circuit board assembly) to be the main cause of failure. The project is divided into two parts. Part one, "PCBA" where a laboratory study on the effects of thermomechanical cycling on solder joints for different electrical components of the PCBAs are investigated. The second part, "ECU" is the main part of the project investigating data-driven solutions using operational and workshop history data. The results from part one show that the Weibull distribution commonly used to predict lifetimes of electrical components, work well to describe the laboratory results but also that non parametric methods such as kernel distribution can give good results. In part two when Weibull together with Gamma and Normal distributions were tested on the real ECU (electronic control unit) data, it is shown that none of them describe the data well. However, when random forest is used to develop data-based models most of the ECU lifetimes of a separate test dataset can be correctly predicted within a half a year margin. Further using random survival forest it was possible to produce a model with just 0.06 in (OOB) prediction error. This shows that machine learning methods could potentially be used in the purpose of condition based maintenance for ECUs. machine learning random forest random survival forest condition based maintenance cbm reliability solder joint failure thermomechanical cycling ECU lifetime prediction data-driven statistics BGA PCBA field quality maintenance truck bus vehicle Probability Theory and Statistics Sannolikhetsteori och statistik Transport Systems and Logistics Transportteknik och logistik Robotics Robotteknik och automation Control Engineering Reglerteknik

Search results