Global ETD Search

461	Deep Learning-Based Automated Segmentation and Detection of Chondral Lesions on the Distal Femur Lindemalm Karlsson, Josefin January 2019 (has links) Articular chondral lesions in the knee joint can be diagnosed at an early stage using MRI. Segmenting and visualizing lesions and the overall joint structure allows improved communication between the radiologist and referring physician. It can also be of help when determining diagnosis or conducting surgical planning. Although there are a variety of studies proving good results of segmentation of larger structures such as bone and cartilage in the knee, there are no studies available researching segmentation of articular cartilage lesions. Automating the segmentation will save time and money since manual segmentation is very time-consuming. In this thesis, a U-Net based convolutional neural network is used to perform automatic segmentation of chondral lesions located on the distal part of the femur, in the knee joint. Using two different techniques, batch normalization and dropout, a network was trained and tested using MRI sequences collected from Episurf Medical's database. The network was then evaluated using a segmentation approach and a detection approach. For the segmentation approach, the highest achieved dice coefficient and sensitivity of 0.4059 ± 0.1833 and 0.4591 ± 0.2387, was obtained using batch normalization and 260 training subjects, consisting of MRI sequence and corresponding masks. Using a detection approach, the predicted output could correctly identify 81.8% of the chondral lesions in the MRI sequences. Although there is a need for improvement of technique and datasets used in this thesis, the achieved results show prerequisites for future improvement and possible implementation. / Skador i knäledens brosk kan diagnostiseras i ett tidigt stadie med hjälp av MR. Segmentering och visualisering av skadorna, samt ledens struktur i helhet, bidrar till en förbättrad kommunikation mellan radiolog och remitterande läkare. Det kan också underlätta för att ställa diagnos eller utföra operationsplanering. I dagsläget finns flertalet studier som påvisar goda resultat för segmentering av större strukturer, t.ex. ben och brosk. Det finns dock få studier som studerar segmentering av skador i ledbrosk. Genom att automatisera segmenteringsprocessen kan både tid och pengar sparas. Detta eftersom att manuell segmentering är mycket tidskrävande. I detta arbete kommer ett U-Net baserat convolutional neural network att användas för att utföra automatisk segmentering av skador på distala femur i knäleden. Nätverket kommer att tränas med två olika tekniker, batch normalization och dropout. Nätverket kommer att tränas med data som är hämtad från Episurf Medicals databas och består av MR sekvenser. Nätverket kommer att tränas och utvärderas med hjälp av två metoder, en segmenteringsmetod och detekteringsmetod. Den högsta uppnådda dice koefficienten och sensitiviteten vid utvärderingen av segmenteringsmetoden uppmätte 0,4059 ± 0,1833 och 0,4591 ± 0,2387. Den upnåddes med hjälp av batch normalization och 260 MR sekvenser för träning och testning. För detektionsmetoden kunde programmet identifiera 81,8% av skadorna synliga på MR sekvenserna. Även om tekniken och datan som används behöver optimeras, så visar det uppnådda resultatet på bra förutsättningar för fortsatta studier och i framtiden möjligen även implementering av tekniken. Deep Learning Machine Learning automated segmentation MRI chondral lesions articular cartilage Deep Learning Maskininlärning automatiserad segmentering MRI broskskador ledbrosk Medical Engineering Medicinteknik
462	Exploring Cross-Lingual Transfer Learning for Swedish Named Entity Recognition : Fine-tuning of English and Multilingual Pre-trained Models / Utforskning av tvärspråklig överföringsinlärning för igenkänning av namngivna enheter på svenska Lai Wikström, Daniel, Sparr, Axel January 2023 (has links) Named Entity Recognition (NER) is a critical task in Natural Language Processing (NLP), and recent advancements in language model pre-training have significantly improved its performance. However, this improvement is not universally applicable due to a lack of large pre-training datasets or computational budget for smaller languages. This study explores the viability of fine-tuning an English and a multilingual model on a Swedish NER task, compared to a model trained solely on Swedish. Our methods involved training these models and measuring their performance using the F1-score metric. Despite fine-tuning, the Swedish model outperformed both the English and multilingual models by 3.0 and 9.0 percentage points, respectively. The performance gap between the English and Swedish models during fine-tuning decreased from 19.8 to 9.0 percentage points. This suggests that while the Swedish model achieved the best performance, fine-tuning can substantially enhance the performance of English and multilingual models for Swedish NER tasks. / Inom området för Natural Language Processing (NLP) är identifiering av namngivna entiteter (NER) en viktig problemtyp. Tack vare senaste tidens framsteg inom förtränade språkmodeller har modellernas prestanda på problemtypen ökat kraftigt. Denna förbättring kan dock inte tillämpas överallt på grund av en brist på omfattande dataset för förträning eller tillräcklig datorkraft för mindre språk. I denna studie undersöks potentialen av fine-tuning på både en engelsk, en svensk och en flerspråkig modell för en svensk NER-uppgift. Dessa modeller tränades och deras effektivitet bedömdes genom att använda F1-score som mått på prestanda. Även med fine-tuning var den svenska modellen bättre än både den engelska och flerspråkiga modellen, med en skillnad på 3,0 respektive 9,0 procentenheter i F1-score. Skillnaden i prestandan mellan den engelska och svenska modellen minskade från 19,8 till 9,0 procentenheter efter fine-tuning. Detta indikerar att även om den svenska modellen var mest framgångsrik, kan fine-tuning av engelska och flerspråkiga modeller betydligt förbättra prestandan för svenska NER-uppgifter. NER Cross-lingual transfer Transformer BERT Deep Learning namnigenkänning NER multilingvistisk överföring Transformer BERT deep learning Computer and Information Sciences Data- och informationsvetenskap
463	Exploiting Deep Learning and Traffic Models for Freeway Traffic Estimation Genser, Alexander, Makridis, Michail A., Kouvelas, Anastasios 23 June 2023 (has links) Emerging sensors and intelligent traffic technologies provide extensive data sets in a traffic network. However, realizing the full potential of such data sets for a unique representation of real-world states is challenging due to data accuracy, noise, and temporal-spatial resolution. Data assimilation is a known group of methodological approaches that exploit physics-informed traffic models and data observations to perform short-term predictions of the traffic state in freeway environments. At the same time, neural networks capture high non-linearities, similar to those presented in traffic networks. Despite numerous works applying different variants of Kalman filters, the possibility of traffic state estimation with deep-learning-based methodologies is only partially explored in the literature. We present a deep-learning modeling approach to perform traffic state estimation on large freeway networks. The proposed framework is trained on local observations from static and moving sensors and identifies differences between well-trusted data and model outputs. The detected patterns are then used throughout the network, even where there are no available observations to estimate fundamental traffic quantities. The preliminary results of the work highlight the potential of deep learning for traffic state estimation. info:eu-repo/classification/ddc/360 ddc:360
464	Depth-Aware Deep Learning Networks for Object Detection and Image Segmentation Dickens, James 01 September 2021 (has links) The rise of convolutional neural networks (CNNs) in the context of computer vision has occurred in tandem with the advancement of depth sensing technology. Depth cameras are capable of yielding two-dimensional arrays storing at each pixel the distance from objects and surfaces in a scene from a given sensor, aligned with a regular color image, obtaining so-called RGBD images. Inspired by prior models in the literature, this work develops a suite of RGBD CNN models to tackle the challenging tasks of object detection, instance segmentation, and semantic segmentation. Prominent architectures for object detection and image segmentation are modified to incorporate dual backbone approaches inputting RGB and depth images, combining features from both modalities through the use of novel fusion modules. For each task, the models developed are competitive with state-of-the-art RGBD architectures. In particular, the proposed RGBD object detection approach achieves 53.5% mAP on the SUN RGBD 19-class object detection benchmark, while the proposed RGBD semantic segmentation architecture yields 69.4% accuracy with respect to the SUN RGBD 37-class semantic segmentation benchmark. An original 13-class RGBD instance segmentation benchmark is introduced for the SUN RGBD dataset, for which the proposed model achieves 38.4% mAP. Additionally, an original depth-aware panoptic segmentation model is developed, trained, and tested for new benchmarks conceived for the NYUDv2 and SUN RGBD datasets. These benchmarks offer researchers a baseline for the task of RGBD panoptic segmentation on these datasets, where the novel depth-aware model outperforms a comparable RGB counterpart. Deep learning Computer vision CNN Object detection Semantic segmentation Instance segmentation Multi-modal deep learning Panoptic segmentation Artificial intelligence Convolutional neural networks Neural networks RGBD Depth images
465	Evaluation and Optimization of Deep Learning Networks for Plant Disease Forecasting And Assessment of their Generalizability for Early Warning Systems Hannah Elizabeth Klein (15375262) 05 May 2023 (has links) <p>This research focused on developing adaptable models and protocols for early warning systems for forecasting plant diseases and datasets. It compared the performance of deep learning models in predicting soybean rust disease outbreaks using three years of public epidemiological data and gridded weather data. The models selected were a dense network and a Long Short-Term Memory (LSTM) network. The objectives included evaluating the effectiveness of small citizen science datasets and gridded meteorological weather in sequential forecasting, assessing the ideal window size and important inputs, and exploring the generalizability of the model protocol and models to other diseases. The model protocol was developed using a soybean rust dataset. Both the dense and the LSTM networks produced accuracies of over 90% during optimization. When tested for forecasting, both networks could forecast with an accuracy of 85% or higher over various window sizes. Experiments on window size indicated a minimum input of 8 -11 days. Generalizability was demonstrated by applying the same protocol to a southern corn rust dataset, resulting in 87.8% accuracy. In addition, transfer learning and pre-trained models were tested. Direct transfer learning between disease was not successful, while pre training models resulted both positive and negative results. Preliminary results are reported for building generalizable disease models using epidemiological and weather data that researchers could apply to generate forecasts for new diseases and locations.</p> Agro-ecosystem function and prediction Deep learning Neural networks Deep Learning Long Short-Term Memory Soybean rust disease prediction models Forecasting
466	ANALYSIS OF CONTINUOUS LEARNING MODELS FOR TRAJECTORY REPRESENTATION Kendal Graham Norman (15344170) 24 April 2023 (has links) <p> Trajectory planning is a field with widespread utility, and imitation learning pipelines<br> show promise as an accessible training method for trajectory planning. MPNet is the state<br> of the art for imitation learning with respect to success rates. MPNet has two general<br> components to its runtime: a neural network predicts the location of the next anchor point in<br> a trajectory, and then planning infrastructure applies sampling-based techniques to produce<br> near-optimal, collision-less paths. This distinction between the two parts of MPNet prompts<br> investigation into the role of the neural architectures in the Neural Motion Planning pipeline,<br> to discover where improvements can be made. This thesis seeks to explore the importance<br> of neural architecture choice by removing the planning structures, and comparing MPNet’s<br> feedforward anchor point predictor with that of a continuous model trained to output a<br> continuous trajectory from start to goal. A new state of the art model in continuous learning<br> is the Neural Flow model. As a continuous model, it possess a low standard deviation runtime<br> which can be properly leveraged in the absence of planning infrastructure. Neural Flows also<br> output smooth, continuous trajectory curves that serve to reduce noisy path outputs in the<br> absence of lazy vertex contraction. This project analyzes the performance of MPNet, Resnet<br> Flow, and Coupling Flow models when sampling-based planning tools such as dropout, lazy<br> vertex contraction, and replanning are removed. Each neural planner is trained end-to-end in<br> an imitation learning pipeline utilizing a simple feedforward encoder, a CNN-based encoder,<br> and a Pointnet encoder to encode the environment, for purposes of comparison. Results<br> indicate that performance is competitive, with Neural Flows slightly outperforming MPNet’s<br> success rates on our reduced dataset in Simple2D, and being slighty outperformed by MPNet<br> with respect to collision penetration distance in our UR5 Cubby test suite. These results<br> indicate that continuous models can compete with the performance of anchor point predictor<br> models when sampling-based planning techniques are not applied. Neural Flow models also<br> have other benefits that anchor point predictors do not, like continuity guarantees, the ability<br> to select a proportional location in a trajectory to output, and smoothness. </p> Intelligent robotics Deep learning Neural networks Neural ODEs Neural Flows Deep Learning Neural Networks Robotics Trajectory Planning Path Planning Anchor Point Prediction
467	Generative adversarial networks for single image super resolution in microscopy images Gawande, Saurabh January 2018 (has links) Image Super resolution is a widely-studied problem in computer vision, where the objective is to convert a lowresolution image to a high resolution image. Conventional methods for achieving super-resolution such as image priors, interpolation, sparse coding require a lot of pre/post processing and optimization. Recently, deep learning methods such as convolutional neural networks and generative adversarial networks are being used to perform super-resolution with results competitive to the state of the art but none of them have been used on microscopy images. In this thesis, a generative adversarial network, mSRGAN, is proposed for super resolution with a perceptual loss function consisting of a adversarial loss, mean squared error and content loss. The objective of our implementation is to learn an end to end mapping between the low / high resolution images and optimize the upscaled image for quantitative metrics as well as perceptual quality. We then compare our results with the current state of the art methods in super resolution, conduct a proof of concept segmentation study to show that super resolved images can be used as a effective pre processing step before segmentation and validate the findings statistically. / Image Super-resolution är ett allmänt studerad problem i datasyn, där målet är att konvertera en lågupplösningsbild till en högupplöst bild. Konventionella metoder för att uppnå superupplösning som image priors, interpolation, sparse coding behöver mycket föroch efterbehandling och optimering.Nyligen djupa inlärningsmetoder som convolutional neurala nätverk och generativa adversariella nätverk är användas för att utföra superupplösning med resultat som är konkurrenskraftiga mot toppmoderna teknik, men ingen av dem har använts på mikroskopibilder. I denna avhandling, ett generativ kontradiktorisktsnätverk, mSRGAN, är föreslås för superupplösning med en perceptuell förlustfunktion bestående av en motsatt förlust, medelkvadratfel och innehållförlust.Mål med vår implementering är att lära oss ett slut på att slut kartläggning mellan bilder med låg / hög upplösning och optimera den uppskalade bilden för kvantitativa metriks såväl som perceptuell kvalitet. Vi jämför sedan våra resultat med de nuvarande toppmoderna metoderna i superupplösning, och uppträdande ett bevis på konceptsegmenteringsstudie för att visa att superlösa bilder kan användas som ett effektivt förbehandling steg före segmentering och validera fynden statistiskt. Deep Learning Generative adversarial networks Super resolution High content screening microscopy Deep Learning Generative adversarial networks Super resolution High content screening microscopy Computer and Information Sciences Data- och informationsvetenskap
468	Opto-Acoustic Slopping Prediction System in Basic Oxygen Furnace Converters Ghosh, Binayak January 2017 (has links) Today, everyday objects are becoming more and more intelligent and some-times even have self-learning capabilities. These self-learning capacities in particular also act as catalysts for new developments in the steel industry.Technical developments that enhance the sustainability and productivity of steel production are very much in demand in the long-term. The methods of Industry 4.0 can support the steel production process in a way that enables steel to be produced in a more cost-effective and environmentally friendly manner. This thesis describes the development of an opto-acoustic system for the early detection of slag slopping in the BOF (Basic Oxygen Furnace) converter process. The prototype has been installed in Salzgitter Stahlwerks, a German steel plant for initial testing. It consists of an image monitoring camera at the converter mouth, a sound measurement system and an oscillation measurement device installed at the blowing lance. The camera signals are processed by a special image processing software. These signals are used to rate the amount of spilled slag and for a better interpretation of both the sound data and the oscillation data. A certain aspect of the opto-acoustic system for slopping detection is that all signals, i.e. optic, acoustic and vibratory, are affected by process-related parameters which are not always relevant for the slopping event. These uncertainties affect the prediction of the slopping phenomena and ultimately the reliability of the entire slopping system. Machine Learning algorithms have been been applied to predict the Slopping phenomenon based on the data from the sensors as well as the other process parameters. / Idag blir vardagliga föremål mer och mer intelligenta och ibland har de självlärande möjligheter. Dessa självlärande förmågor fungerar också som katalysatorer för den nya utvecklingen inom stålindustrin. Teknisk utveckling som stärker hållbarheten och produktiviteten i stålproduktionen är mycket efterfrågad på lång sikt. Metoderna för Industry 4.0 kan stödja stålproduktionsprocessen på ett sätt som gör att stål kan produceras på ett mer kostnadseffektivt och miljövänligt sätt. Denna avhandling beskriver utvecklingen av ett opto-akustiskt system för tidig detektering av slaggsslipning i konverteringsprocessen BOF (Basic Oxygen Furnace). Prototypen har installerats i Salzgitter Stahlwerks, en tysk stålverk för första provning. Den består av en bildövervakningskamera på omvandlarens mun, ett ljudmätningssystem och en oscillationsmätningsenhet som installeras vid blåsans. Kamerans signaler behandlas av en speciell bildbehandlingsprogram. Dessa signaler används för att bestämma mängden spilld slagg och för bättre tolkning av både ljuddata och oscillationsdata. En viss aspekt av det optoakustiska systemet för släckningsdetektering är att alla signaler, dvs optiska, akustiska och vibrerande, påverkas av processrelaterade parametrar som inte alltid är relevanta för slöjningsevenemanget. Dessa osäkerheter påverkar förutsägelsen av slopfenomenerna och i slutändan tillförlitligheten för hela slöjningssystemet. Maskininlärningsalgoritmer har tillämpats för att förutsäga Slopping-fenomenet baserat på data från sensorerna liksom de andra processparametrarna. BOF Slopping Sensor fusion Image Processing Machine Learning Data Analysis Neural Networks Deep Learning BOF Slopping Sensorfusion Bildbehandling Maskininlärning Dataanalys Neurala nätverk Deep Learning Computer Sciences Datavetenskap (datalogi)
469	Deep Brain Dynamics and Images Mining for Tumor Detection and Precision Medicine Lakshmi Ramesh (16637316) 30 August 2023 (has links) <p>Automatic brain tumor segmentation in Magnetic Resonance Imaging scans is essential for the diagnosis, treatment, and surgery of cancerous tumors. However, identifying the hardly detectable tumors poses a considerable challenge, which are usually of different sizes, irregular shapes, and vague invasion areas. Current advancements have not yet fully leveraged the dynamics in the multiple modalities of MRI, since they usually treat multi-modality as multi-channel, and the early channel merging may not fully reveal inter-modal couplings and complementary patterns. In this thesis, we propose a novel deep cross-attention learning algorithm that maximizes the subtle dynamics mining from each of the input modalities and then boosts feature fusion capability. More specifically, we have designed a Multimodal Cross-Attention Module (MM-CAM), equipped with a 3D Multimodal Feature Rectification and Feature Fusion Module. Extensive experiments have shown that the proposed novel deep learning architecture, empowered by the innovative MM- CAM, produces higher-quality segmentation masks of the tumor subregions. Further, we have enhanced the algorithm with image matting refinement techniques. We propose to integrate a Progressive Refinement Module (PRM) and perform Cross-Subregion Refinement (CSR) for the precise identification of tumor boundaries. A Multiscale Dice Loss was also successfully employed to enforce additional supervision for the auxiliary segmentation outputs. This enhancement will facilitate effectively matting-based refinement for medical image segmentation applications. Overall, this thesis, with deep learning, transformer-empowered pattern mining, and sophisticated architecture designs, will greatly advance deep brain dynamics and images mining for tumor detection and precision medicine.</p> Computer vision Multimodal analysis and synthesis Deep learning Neural networks Semantic Segmentation Brain Tumor Segmentation Deep Learning Computer Vision Multimodal ML 3D Computer Vision Attention Cross-Attention Biomedical Segmentation
470	Models and Representation Learning Mechanisms for Graph Data Susheel Suresh (14228138) 15 December 2022 (has links) <p>Graph representation learning (GRL) has been increasing used to model and understand data from a wide variety of complex systems spanning social, technological, bio-chemical and physical domains. GRL consists of two main components (1) a parametrized encoder that provides representations of graph data and (2) a learning process to train the encoder parameters. Designing flexible encoders that capture the underlying invariances and characteristics of graph data are crucial to the success of GRL. On the other hand, the learning process drives the quality of the encoder representations and developing principled learning mechanisms are vital for a number of growing applications in self-supervised, transfer and federated learning settings. To this end, we propose a suite of models and learning algorithms for GRL which form the two main thrusts of this dissertation.</p> <p><br></p> <p>In Thrust I, we propose two novel encoders which build upon on a widely popular GRL encoder class called graph neural networks (GNNs). First, we empirically study the prediction performance of current GNN based encoders when applied to graphs with heterogeneous node mixing patterns using our proposed notion of local assortativity. We find that GNN performance in node prediction tasks strongly correlates with our local assortativity metric---thereby introducing a limit. We propose to transform the input graph into a computation graph with proximity and structural information as distinct types of edges. We then propose a novel GNN based encoder that operates on this computation graph and adaptively chooses between structure and proximity information. Empirically, adopting our transformation and encoder framework leads to improved node classification performance compared to baselines in real-world graphs that exhibit diverse mixing.</p> <p>Secondly, we study the trade-off between expressivity and efficiency of GNNs when applied to temporal graphs for the task of link ranking. We develop an encoder that incorporates a labeling approach designed to allow for efficient inference over the candidate set jointly, while provably boosting expressivity. We also propose to optimize a list-wise loss for improved ranking. With extensive evaluation on real-world temporal graphs, we demonstrate its improved performance and efficiency compared to baselines.</p> <p><br></p> <p>In Thrust II, we propose two principled encoder learning mechanisms for challenging and realistic graph data settings. First, we consider a scenario where only limited or even no labelled data is available for GRL. Recent research has converged on graph contrastive learning (GCL), where GNNs are trained to maximize the correspondence between representations of the same graph in its different augmented forms. However, we find that GNNs trained by traditional GCL often risk capturing redundant graph features and thus may be brittle and provide sub-par performance in downstream tasks. We then propose a novel principle, termed adversarial-GCL (AD-GCL), which enables GNNs to avoid capturing redundant information during the training by optimizing adversarial graph augmentation strategies used in GCL. We pair AD-GCL with theoretical explanations and design a practical instantiation based on trainable edge-dropping graph augmentation. We experimentally validate AD-GCL by comparing with state-of-the-art GCL methods and achieve performance gains in semi-supervised, unsupervised and transfer learning settings using benchmark chemical and biological molecule datasets. </p> <p>Secondly, we consider a scenario where graph data is silo-ed across clients for GRL. We focus on two unique challenges encountered when applying distributed training to GRL: (i) client task heterogeneity and (ii) label scarcity. We propose a novel learning framework called federated self-supervised graph learning (FedSGL), which first utilizes a self-supervised objective to train GNNs in a federated fashion across clients and then, each client fine-tunes the obtained GNNs based on its local task and available labels. Our framework enables the federated GNN model to extract patterns from the common feature (attribute and graph topology) space without the need of labels or being biased by heterogeneous local tasks. Extensive empirical study of FedSGL on both node and graph classification tasks yields fruitful insights into how the level of feature / task heterogeneity, the adopted federated algorithm and the level of label scarcity affects the clients’ performance in their tasks.</p> Data mining and knowledge discovery Graph, social and multimedia data Deep learning Neural networks Semi- and unsupervised learning Graph Neural Networks (GNNs) Deep Learning Self Supervised Learning Federated Learning frameworks

Search results