• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 339
  • 26
  • 21
  • 13
  • 8
  • 5
  • 5
  • 5
  • 4
  • 3
  • 2
  • 1
  • 1
  • 1
  • 1
  • Tagged with
  • 507
  • 507
  • 272
  • 270
  • 147
  • 135
  • 129
  • 128
  • 113
  • 92
  • 88
  • 77
  • 76
  • 74
  • 59
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
391

Malicious Activity Detection in Encrypted Network Traffic using A Fully Homomorphic Encryption Method

Adiyodi Madhavan, Resmi, Sajan, Ann Zenna January 2022 (has links)
Everyone is in need for their own privacy and data protection, since encryption transmission was becoming common. Fully Homomorphic Encryption (FHE) has received increased attention because of its capability to execute calculations over the encoded domain. Through using FHE approach, model training can be properly outsourced. The goal of FHE is to enable computations on encrypted files without decoding aside from the end outcome. The CKKS scheme is used in FHE.Network threats are serious danger to credential information, which enable an unauthorised user to extract important and sensitive data by evaluating the information of computations done on raw data. Thus the study provided an efficient solution to the problem of privacy protection in data-driven applications using Machine Learning. The study used an encrypted NSL KDD dataset. Machine learning-based techniques have emerged as a significant trend for detecting malicious attack. Thus, Random Forest (RF) is proposed for the detection of malicious attacks on Homomorphic encrypted data in the cloud server. Logistic Regression (LR) machine learning model is used to predict encrypted data on cloud server. Regardless of the distributed setting, the technique may retain the accuracy and integrity of the previous methods to obtain the final results.
392

Epigenetic Responses of Arabidopsis to Abiotic Stress

Laliberte, Suzanne Rae 17 March 2023 (has links)
Weed resistance to control measures, particularly herbicides, is a growing problem in agriculture. In the case of herbicides, resistance is sometimes connected to genetic changes that directly affect the target site of the herbicide. Other cases are less straightforward where resistance arises without such a clear-cut mechanism. Understanding the genetic and gene regulatory mechanisms that may lead to the rapid evolution of resistance in weedy species is critical to securing our food supply. To study this phenomenon, we exposed young Arabidopsis plants to sublethal levels of one of four weed management stressors, glyphosate herbicide, trifloxysulfuron herbicide, mechanical clipping, and shading. To evaluate responses to these stressors we collected data on gene expression and regulation via epigenetic modification (methylation) and small RNA (sRNA). For all of the treatments except shade, the stress was limited in duration, and the plants were allowed to recover until flowering, to identify changes that persist to reproduction. At flowering, DNA for methylation bisulfite sequencing, RNA, and sRNA were extracted from newly formed rosette leaf tissue. Analyzing the individual datasets revealed many differential responses when compared to the untreated control for gene expression, methylation, and sRNA expression. All three measures showed increases in differential abundance that were unique to each stressor, with very little overlap between stressors. Herbicide treatments tended to exhibit the largest number of significant differential responses, with glyphosate treatment most often associated with the greatest differences and contributing to overlap. To evaluate how large datasets from methylation, gene expression, and sRNA analyses could be connected and mined to link regulatory information with changes in gene expression, the information from each dataset and for each gene was united in a single large matrix and mined with classification algorithms. Although our models were able to differentiate patterns in a set of simulated data, the raw datasets were too noisy for the models to consistently identify differentially expressed genes. However, by focusing on responses at a local level, we identified several genes with differential expression, differential sRNA, and differential methylation. While further studies will be needed to determine whether these epigenetic changes truly influence gene expression at these sites, the changes detected at the treatment level could prime the plants for future incidents of stress, including herbicides. / Doctor of Philosophy / Growing resistance to herbicides, particularly glyphosate, is one of the many problems facing agriculture. The rapid rise of resistance across herbicide classes has caused some to wonder if there is a mechanism of adaptation that does not involve mutations. Epigenetics is the study of changes in the phenotype that cannot be attributed to changes in the genotype. Typically, studies revolve around two features of the chromosomes: cytosine methylation and histone modifications. The former can influence how proteins interact with DNA, and the latter can influence protein access to DNA. Both can affect each other in self-reinforcing loops. They can affect gene expression, and DNA methylation can be directed by small RNA (sRNA), which can also influence gene expression through other pathways. To study these processes and their role in abiotic stress response, we aimed to analyze sRNA, RNA, and DNA from Arabidopsis thaliana plants under stress. The stresses applied were sublethal doses of the herbicides, glyphosate and trifloxysulfuron, as well as mechanical clipping and shade to represent other weed management stressors. The focus of the project was to analyze these responses individually and together to find epigenetic responses to stresses routinely encountered by weeds. We tested RNA for gene expression changes under our stress conditions and identified many, including some pertaining to DNA methylation regulation. The herbicide treatments were associated with upregulated defense genes and downregulated growth genes. Shade treated plants had many downregulated defense and other stress response genes. We also detected differential methylation and sRNA responses when compared to the control plants. Changes to methylation and sRNA only accounted for about 20% of the variation in gene expression. While attempting to link the epigenetic process of methylation to gene expression, we connected all the data sets and developed computer programs to try to make correlations. While these methods worked on a simulated dataset, we did not detect broad patterns of changes to epigenetic pathways that correlated strongly with gene expression in our experiment's data. There are many factors that can influence gene expression that could create noise that would hinder the algorithms' abilities to detect differentially expressed genes. This does not, however, rule out the possibility of epigenetic influence on gene expression in local contexts. Through scoring the traits of individual genes, we found several that interest us for future studies.
393

Real-Time Estimation of Traffic Stream Density using Connected Vehicle Data

Aljamal, Mohammad Abdulraheem 02 October 2020 (has links)
The macroscopic measure of traffic stream density is crucial in advanced traffic management systems. However, measuring the traffic stream density in the field is difficult since it is a spatial measurement. In this dissertation, several estimation approaches are developed to estimate the traffic stream density on signalized approaches using connected vehicle (CV) data. First, the dissertation introduces a novel variable estimation interval that allows for higher estimation precision, as the updating time interval always contains a fixed number of CVs. After that, the dissertation develops model-driven approaches, such as a linear Kalman filter (KF), a linear adaptive KF (AKF), and a nonlinear Particle filter (PF), to estimate the traffic stream density using CV data only. The proposed model-driven approaches are evaluated using empirical and simulated data, the former of which were collected along a signalized approach in downtown Blacksburg, VA. Results indicate that density estimates produced by the linear KF approach are the most accurate. A sensitivity of the estimation approaches to various factors including the level of market penetration (LMP) of CVs, the initial conditions, the number of particles in the PF approach, traffic demand levels, traffic signal control methods, and vehicle length is presented. Results show that the accuracy of the density estimate increases as the LMP increases. The KF is the least sensitive to the initial traffic density estimate, while the PF is the most sensitive to the initial traffic density estimate. The results also demonstrate that the proposed estimation approaches work better at higher demand levels given that more CVs exist for the same LMP scenario. For traffic signal control methods, the results demonstrate a higher estimation accuracy for fixed traffic signal timings at low traffic demand levels, while the estimation accuracy is better when the adaptive phase split optimizer is activated for high traffic demand levels. The dissertation also investigates the sensitivity of the KF estimation approach to vehicle length, demonstrating that the presence of longer vehicles (e.g. trucks) in the traffic link reduces the estimation accuracy. Data-driven approaches are also developed to estimate the traffic stream density, such as an artificial neural network (ANN), a k-nearest neighbor (k-NN), and a random forest (RF). The data-driven approaches also utilize solely CV data. Results demonstrate that the ANN approach outperforms the k-NN and RF approaches. Lastly, the dissertation compares the performance of the model-driven and the data-driven approaches, showing that the ANN approach produces the most accurate estimates. However, taking into consideration the computational time needed to train the ANN approach, the large amount of data needed, and the uncertainty in the performance when new traffic behaviors are observed (e.g., incidents), the use of the linear KF approach is highly recommended in the application of traffic density estimation due to its simplicity and applicability in the field. / Doctor of Philosophy / Estimating the number of vehicles (vehicle counts) on a road segment is crucial in advanced traffic management systems. However, measuring the number of vehicles on a road segment in the field is difficult because of the need for installing multiple detection sensors in that road segment. In this dissertation, several estimation approaches are developed to estimate the number of vehicles on signalized roadways using connected vehicle (CV) data. The CV is defined as the vehicle that can share its instantaneous location every time t. The dissertation develops model-driven approaches, such as a linear Kalman filter (KF), a linear adaptive KF (AKF), and a nonlinear Particle filter (PF), to estimate the number of vehicles using CV data only. The proposed model-driven approaches are evaluated using real and simulated data, the former of which were collected along a signalized roadway in downtown Blacksburg, VA. Results indicate that the number of vehicles produced by the linear KF approach is the most accurate. The results also show that the KF approach is the least sensitive approach to the initial conditions. Machine learning approaches are also developed to estimate the number of vehicles, such as an artificial neural network (ANN), a k-nearest neighbor (k-NN), and a random forest (RF). The machine learning approaches also use CV data only. Results demonstrate that the ANN approach outperforms the k-NN and RF approaches. Finally, the dissertation compares the performance of the model-driven and the machine learning approaches, showing that the ANN approach produces the most accurate estimates. However, taking into consideration the computational time needed to train the ANN approach, the huge amount of data needed, and the uncertainty in the performance when new traffic behaviors are observed (e.g., incidents), the use of the KF approach is highly recommended in the application of vehicle count estimation due to its simplicity and applicability in the field.
394

Divine Truces : Forecasting How Religious Audience Costs Affect Ceasefire Success

Holmberg, Jonas January 2024 (has links)
This thesis investigates how religious holidays affect the chance for ceasefire success. It does so while engaging in the topic of explanation and prediction, with combined methods consisting of regression analysis and forecasting using random forests. The theoretical framework argues that religious holidays impose higher audience costs for violence on leaders, increasing the chance for successful agreements. This would manifest as ceasefires connected to religious holidays being more successful than those that are not. The findings from the regression analysis find no support for the hypothesis, and rather indicate that conflict intensity, ceasefire duration, as well as monitoring and verification and enforcement mechanisms better explain the apparent variation in success. The forecasting indicates a minor difference in predictive power between the model including holiday and religiosity and the one excluding them, as well as minor effects of the independent variables on ceasefire success from partial dependence plots. These findings do not oppose the rejection of the hypothesis but rather indicate the need for an increased prevalence of forecasting methods within peace and conflict research and robustness tests using other definitions of success.
395

Data-Driven Diagnosis For Fuel Injectors Of Diesel Engines In Heavy-Duty Trucks

Eriksson, Felix, Björkkvist, Emely January 2024 (has links)
The diesel engine in heavy-duty trucks is a complex system with many components working together, and a malfunction in any of these components can impact engine performance and result in increased emissions. Fault detection and diagnosis have therefore become essential in modern vehicles, ensuring optimal performance and compliance with progressively stricter legal requirements. One of the most common faults in a diesel engineis faulty injectors, which can lead to fluctuations in the amount of fuel injected. Detecting these issues is crucial, prompting a growing interest in exploring additional signals beyond the currently used signal to enhance the performance and robustness of diagnosing this fault. In this work, an investigation was conducted to identify signals that correlate with faulty injectors causing over- and underfueling. It was found that the NOx, O2, and exhaust pressure signals are sensitive to this fault and could potentially serve as additional diagnostic signals. With these signals, two different diagnostic methods were evaluated to assess their effectiveness in detecting injector faults. The methods evaluated were data-driven residuals and Random Forest classifier. The data-driven residuals, when combined with the CUSUM algorithm, demonstrated promising results in detecting faulty injectors. The O2 signal proved effective in identifying both fault instances, while NOx and exhaust pressure were more effective at detecting overfueling. The Random Forest classifier also showed good performance in detecting both over- and underfueling. However, it was observed that using a classifier requires more extensive data preprocessing. Two preprocessing methods were employed: integrating previous measurements and calculating statistical measures over a defined time span. Both methods showed promising results, with the latter proving to be the better choice. Additionally, the generalization capabilities of these methods across different operating conditions were evaluated. It was demonstrated thatthe data-driven residuals yielded better results compared to the classifier, which requiredtraining on new cases to perform effectively.
396

Classifying Portable Electronic Devices using Device Specifications : A Comparison of Machine Learning Techniques

Westerholm, Ludvig January 2024 (has links)
In this project, we explored the usage of machine learning in classifying portable electronic devices. The primary objective was to identify devices such as laptops, smartphones, and tablets, based on their physical and technical specification. These specifications, sourced from the Pricerunner price comparison website, contain height, Wi-Fi standard, and screen resolution. We aggregated this information into a dataset and split it into a training set and a testing set. To achieve the classification of devices, we trained four popular machine learning models: Random Forest (RF), Logistic Regression (LR), k-Nearest Neighbor (kNN), and Fully Connected Network (FCN). We then compared the performance of these models. The evaluation metrics used to compare performance included precision, recall, F1-score, accuracy, and training time. The RF model achieved the highest overall accuracy of 95.4% on the original dataset. The FCN, applied to a dataset processed with standardization followed by Principal Component Analysis (PCA), reached an accuracy of 92.7%, the best within this specific subset. LR excelled in a few class-specific metrics, while kNN performed notably well relative to its training time. The RF model was the clear winner on the original dataset, while the kNN model was a strong contender on the PCA-processed dataset due to its significantly faster training time compared to the FCN. In conclusion, the RF was the best-performing model on the original dataset, the FCN showed impressive results on the standardized and PCA-processed dataset, and the kNN model, with its highest macro precision and rapid training time, also demonstrated competitive performance.
397

Comparative Analysis of Machine Learning Algorithms for Cryptocurrency Price Prediction

Kurtagic, Leila January 2024 (has links)
As the cryptocurrency markets continuously grow, so does the need for reliable analytical tools for price prediction. This study conducted a comparative analysis of machine learning (ML) algorithms for cryptocurrency price prediction. Through a literature review, three common and reliable ML algorithms for cryptocurrency price prediction were identified: Long Short-Term Memory (LSTM), Random Forest (RF), and eXtreme Gradient Boosting (XGBoost). Utilizing the Bitcoin All Time History dataset from TradingView, the study assessed both the individual performance of each algorithm and the potential of ensemble methods to enhance predictive accuracy. The results reveal that the LSTM algorithm outperformed RF and XGBoost in terms of predictive accuracy according to the metrics Mean Absolute Error (MAE), Mean Squared Error (MSE), and Root Mean Squared Error (RMSE). Additionally, two ensemble approaches were tested: Ensemble 1, which enhanced the LSTM model with the combined predictions from RF and XGBoost, and Ensemble 2, which integrated predictions from all three models. Ensemble 2 demonstrated the highest predictive performance among all models, highlighting the advantages of using ensemble approaches for more robust predictions.
398

Explainable predictive quality inautomotive manufacturing : Case study at Magna Electronics

Ke, Damian January 2024 (has links)
This thesis is a case study conducted at Magna Electronics to explore the use of machinelearning techniques in improving the predictive quality of electronic control unit (ECU)within the automotive manufacturing. This thesis aims to apply interpretable machinelearning methods to predict potential future ECU failures early. With the interpretablemachine learning the goal is to identify predictive variables that lead to ECU failure andwhich can be used as support for decision making.Logistic Regression and Random Forest were chosen as the machine learning methods,which have been used in research of predictive quality and have different levels of interpretability.TreeSHAP was used on the Random Forest as the post-hoc method to furtherunderstand the results. The models’ performances were quantitatively evaluatedthrough metrics such as accuracy and area under precision-recall curve. Subsequently, thebest-performing models were further analyzed using confusion matrices, precision-recallcurves, and horizontal bar charts to assess the impact of predictive variables.The results of this thesis indicated that while Random Forest outperformed Logistic Regression,both models demonstrated limited capability in accurately predicting faulty ECUs,due to the low AUCPR scores. The precision-recall curves suggested performance near randomguess, highlighting the possible variability in parameter impact.This study has also identified significant challenges, such as data imbalance and mislabeling,which may have had a negative effect on the results. Given these issues, the thesisadvises caution in using these results for decision-making. Although, findings of this thesisunderscore the need for a cautious approach to interpreting model outputs, suggestingthat real-world application may require to use different models based on the specific goalsand context of the analysis.
399

Detection and Prevention of Malware Smuggling in Scalable Vector Graphics (SVG)

Ufnal, Marek, Longuevergne, Thomas January 2024 (has links)
Since 2015, malicious actors have been using SVG files to obfuscate malware from potential defensive mechanisms and carry out attacks undetected through the use of smuggling tech- niques [1]. Throughout this thesis, we use the Design Science Research methodology in order to design and develop an artefact able to detect these attacks within a real network infrastruc- ture, while minimising the impact on the user experience. For the designed artefact to answer these challenges, we conduct two scoping reviews: an analysis of seven of these incidents to determine the technique used to perform the smuggling. This is followed by a map of the dif- ferent security processes available to network administrators and individuals who search for open-source technologies and aim to close the gap left by lack of these solutions.  Moreover the paper proposes a SVG parser and a Random Forest classifier to extract valu- able features needed to find the malicious payloads hidden in the graphics. The performance of the artefact is analysed to determine its suitability for real-world usage and if an adequate success rate is reached. The paper finally concludes that the task of obfuscated malware de- tection is a multi-faceted problem and the artefact, while successful, is a suitable blueprint for exploring future improvements in the field.
400

Maskininlärning för Prediktion av Fartygsvibrationer : En jämförelsestudie av Random forest,Gradient boosting och Neurala nätverk

Tvinghagen, Fredrik, Queckfeldt, Jonathan January 2024 (has links)
Syftet med detta projekt är att utveckla en prediktiv modell för att förutsäga fartygsvibrationer baserat på historisk mätdata från lastfartyg. Projektet fokuserar på att använda maskininlärningsmetoder för att förutspå amplituden av vibrationer och identifiera de mest relevanta variablerna för modellens prediktiva förmåga. De undersökta metoderna inkluderar random forest, gradient boosting och neurala nätverk. Resultaten visar att random forest-modellen presterar bäst utifrån prestandamåtten: medelkvadratfelet (MSE), genomsnittliga absoluta felet (MAE) och genomsnittliga procentuella felet (MAPE). Projektet bidrar till en djupare förståelse av maskininlärningens användningsområden för en mer hållbar sjöfart. Syftet är att ge ett underlag till att potentiellt reducera vibrationer som påverkar fartygens komfort och strukturella integritet.

Page generated in 0.0521 seconds