411 |
Optimizations for Deep Learning-Based CT Image EnhancementChaturvedi, Ayush 04 March 2024 (has links)
Computed tomography (CT) combined with deep learning (DL) has recently shown great potential in biomedical imaging. Complex DL models with varying architectures inspired by the human brain are improving imaging software and aiding diagnosis. However, the accuracy of these DL models heavily relies on the datasets used for training, which often contain low-quality CT images from low-dose CT (LDCT) scans. Moreover, in contrast to the neural architecture of the human brain, DL models today are dense and complex, resulting in a significant computational footprint. Therefore, in this work, we propose sparse optimizations to minimize the complexity of the DL models and leverage architecture-aware optimization to reduce the total training time of these DL models. To that end, we leverage a DL model called DenseNet and Deconvolution Network (DDNet). The model enhances LDCT chest images into high-quality (HQ) ones but requires many hours to train. To further improve the quality of final HQ images, we first modified DDNet's architecture with a more robust multi-level VGG (ML-VGG) loss function to achieve state-of-the-art CT image enhancement. However, improving the loss function results in increased computational cost. Hence, we introduce sparse optimizations to reduce the complexity of the improved DL model and then propose architecture-aware optimizations to efficiently utilize the underlying computing hardware to reduce the overall training time. Finally, we evaluate our techniques for performance and accuracy using state-of-the-art hardware resources. / Master of Science / Deep learning-based (DL) techniques that leverage computed tomography (CT) are becoming omnipresent in diagnosing diseases and abnormalities associated with different parts of the human body. However, their diagnostic accuracy is directly proportional to the quality of the CT images used in training the DL models, which is majorly governed by the radiation dose of the X-ray in the CT scanner. To improve the quality of low-dose CT (LDCT) images, DL-based techniques show promising improvements. However, these techniques require substantial computational resources and time to train the DL models. Therefore, in this work, we incorporate algorithmic techniques inspired by sparse neural architecture of the human brain to reduce the complexity of such DL models. To that end, we leverage a DL model called DenseNet and Deconvolution Network (DDNet) that enhances the quality of CT images generated by low X-ray dosage into high-quality CT images. However, due to its architecture, it takes hours to train DDNet on state-of-the-art hardware resources.
Hence, in this work, we propose techniques that efficiently utilize the hardware resources and reduce the time required to train DDNet. We evaluate the efficacy of our techniques on modern supercomputers in terms of speed and accuracy.
|
412 |
Gated Transformer-Based Architecture for Automatic Modulation ClassificationSahu, Antorip 05 February 2024 (has links)
This thesis delves into the advancement of 5G portable test-nodes in wireless communication systems with cognitive radio capabilities, specifically addressing the critical need for dynamic spectrum sensing and awareness at the radio receiver through AI-driven automatic modulation classification. Our methodology is centered around the transformer encoder architecture incorporating a multi-head self-attention mechanism. We train our architecture extensively across a diverse range of signal-to-noise ratios (SNRs) from the RadioML 2018.01A dataset. We introduce a novel transformer-based architecture with a gated mechanism, designed as a runtime re-configurable automatic modulation classification framework, which demonstrates enhanced performance with low SNR RF signals during evaluation, an area where conventional methods have shown limitations, as corroborated by existing research. Our innovative single-model framework employs distinct weight sets, activated by varying SNR levels, to enable a gating mechanism for more accurate modulation classification. This advancement in automatic modulation classification marks a crucial step toward the evolution of smarter communication systems. / Master of Science / This thesis delves into the advancement of wireless communication systems, particularly in developing portable devices capable of effectively detecting and analyzing radio signals with cognitive radio capabilities. Central to our research is leveraging artificial intelligence (AI) for automatic modulation classification, a method to identify signal modulation types. We utilize a transformer-based AI model trained on the RadioML 2018.01A dataset. Our training approach is particularly effective when evaluating low-quality signals using a gating mechanism based on signal-to-noise ratios, an area previously considered challenging in existing research. This work marks a significant advancement in creating more intelligent and responsive wireless communication systems.
|
413 |
Deep Learning-Based Image Analysis for Microwell AssayBiörck, Jonatan, Staniszewski, Maciej January 2024 (has links)
This thesis investigates the performance of deep learning models, specifically Resnet50 and TransUnet, in semantic image segmentation on microwell images containing tumor and natural killer (NK) cells. The main goal is to examine the effect of only using bright-field data (1-channel) as input instead of both fluorescent and brightfield data (4-channel); this is interesting since fluorescent imaging can cause damage to the cells being analyzed. The network performance is measured by Intersection over Union (IoU), the networks were trained and using manually annotated data from Onfelt Lab. TransUnet consistently outperformed the Resnet50 for both the 4-channel and 1-channel data. Moreover, the 4-channel input generally resulted in a better IoU compared to using only the bright-field channel. Furthermore, a significant decline in performance is observed when the networks are tested on the control data. For the control data, the overall IoU for the best performing 4-channel model dropped from 86.2\% to 73.9\%. The best performing 1-channel model dropped from 83.8\% to 70.8\% overall IoU.
|
414 |
Ocean Rain Detection and Wind Retrieval Through Deep Learning Architectures on Advanced Scatterometer DataMcKinney, Matthew Yoshinori Otani 18 June 2024 (has links) (PDF)
The Advanced Scatterometer (ASCAT) is a satellite-based remote sensing instrument designed for measuring wind speed and direction over the Earth's oceans. This thesis aims to expand and improve the capabilities of ASCAT by adding rain detection and advancing wind retrieval. Additionally, this expansion to ASCAT serves as evidence of Artificial Intelligence (AI) techniques learning both novel and traditional methods in remote sensing. I apply semantic segmentation to ASCAT measurements to detect rain over the oceans, enhancing capabilities to monitor global precipitation. I use two common neural network architectures and train them on measurements from the Tropical Rainfall Measuring Mission (TRMM) collocated with ASCAT measurements. I apply the same semantic segmentation techniques on wind retrieval in order to create a machine learning model that acts as an inverse Geophysical Model Function (GMF). I use three common neural network architectures and train the models on ASCAT data collocated with European Centre for Medium-Range Weather Forecasts (ECMWF) wind vector data. I successfully increase the capabilities of the ASCAT satellite to detect rainfall in Earth's oceans, with the ability to retrieve wind vectors without a GMF or Maximum Likelihood Estimation (MLE).
|
415 |
Consumer-Centric Innovation for Mobile Apps Empowered by Social Media AnalyticsQiao, Zhilei 20 June 2018 (has links)
Due to the rapid development of Internet communication technologies (ICTs), an increasing number of social media platforms exist where consumers can exchange comments online about products and services that businesses offer. The existing literature has demonstrated that online user-generated content can significantly influence consumer behavior and increase sales. However, its impact on organizational operations has been primarily focused on marketing, with other areas understudied. Hence, there is a pressing need to design a research framework that explores the impact of online user-generated content on important organizational operations such as product innovation, customer relationship management, and operations management. Research efforts in this dissertation center on exploring the co-creation value of online consumer reviews, where consumers' demands influence firms' decision-making. The dissertation is composed of three studies. The first study finds empirical evidence that quality signals in online product reviews are predictors of the timing of firms' incremental innovation. Guided by the product differentiation theory, the second study examines how companies' innovation and marketing differentiation strategies influence app performance. The last study proposes a novel text analytics framework to discover different information types from user reviews. The research contributes theoretical and practical insights to consumer-centric innovation and social media analytics literature. / PHD / The IT industry, and especially the mobile application (app) market, is intensively competitive and propelled by rapid innovation. The number of apps downloaded worldwide is 102,062 million, generating $88.3 billion in revenue, and projections suggest this will rise to $189 billion in 2020. Hence, there is an impetus to examine competition strategies of app makers to better understand how this important market functions. The app update is an important competitive strategy. The first study investigates what types of public information from both customers and app makers can be used to predict app makers’ updating decisions. The findings indicate customer provided information impacts app makers’ updating decisions. Hence, the study provides insights into the importance of customer-centric strategy to market players. In the second study, it explores the impacts of product differentiation strategies on app product performance in the mobile app marketplace. The results indicate that product updates, which the first study showed are influenced by consumer feedback, are a vertical product differentiation strategy that impacts app performance. Therefore, the results from the two studies illustrate the importance of integrating online customer feedback into companies’ technology strategy. Finally, the third study proposes a novel framework that applies a domain-adapted deep learning approach to categorizing and summarizing two types of innovation opportunities (i.e., feature requests) embedded in app reviews. The results show that the proposed classification approach outperforms traditional algorithms.
|
416 |
Digital Phenotyping and Genomic Prediction Using Machine and Deep Learning in Animals and PlantsBi, Ye 03 October 2024 (has links)
This dissertation investigates the utility of deep learning and machine learning approaches for livestock management and quantitative genetic modeling of rice grain size under climate change. Monitoring the live body weight of animals is crucial to support farm management decisions due to its direct relationship with animal growth, nutritional status, and health. However, conventional manual weighing methods are time consuming and can cause potential stress to animals. While there is a growing trend towards the use of three-dimensional cameras coupled with computer vision techniques to predict animal body weight, their validation with deep learning models as well as large-scale data collected in commercial environments is still limited. Therefore, the first two research chapters show how deep learning-based computer vision systems can enable accurate live body weight prediction for dairy cattle and pigs. These studies also address the challenges of managing large, complex phenotypic data and highlight the potential of deep learning models to automate data processing and improve prediction accuracy in an industry-scale commercial setting. The dissertation then shifts the focus to crop resilience, particularly in rice, where the asymmetric increase in average nighttime temperatures relative to the increase in average daytime temperatures due to climate change is reducing grain yield and quality in rice. Through the use of deep learning and machine learning models, the last two chapters explore how metabolic data can be used in quantitative genetic modeling in rice under environmental stress conditions such as high night temperatures. These studies showed that the integration of metabolites and genomics provided an improvement in the prediction of rice grain size-related traits, and certain metabolites were identified as potential candidates for improving multi-trait genomic prediction. Further research showed that metabolic accumulation was low to moderately heritable, and genomic prediction accuracies were consistent with expected genomic heritability estimates. Genomic correlations between control and high night temperature conditions indicated genotype-by-environment interactions in metabolic accumulation and the effectiveness of genomic prediction models for metabolic accumulation varied across metabolites. Joint analysis of multiple metabolites improved the accuracy of genomic prediction by exploiting correlations between metabolite accumulation. Overall, this dissertation highlights the potential of integrating digital technologies and multi-omic data to advance data analytics in agriculture, with applications in livestock management and quantitative genetic modeling of rice. / Doctor of Philosophy / This dissertation explores the application of deep learning and machine learning to computer vision-based livestock management and quantitative genetic modeling of rice grain size under climate change. The first half of the research chapters illustrate how computer vision systems can enable digital phenotyping of dairy cows and pigs, which is critical for informed management decisions and quantitative genetic analysis. These studies address the challenges of managing large-scale, complex phenotypic data and highlight the potential of deep learning models to automate data processing and improve prediction accuracy. Chapter 3 showed that a deep learning-based segmentation, Mask R-CNN, improved the prediction performance of cow body weight from longitudinal depth video data. Among the image features, volume followed by width correlated best with body weight. Chapter 4 showed that efficient deep learning-based supervised learning models are a promising approach for predicting pig body weight from industry-scale depth video data. Although the sparse design, which simulates budget and time constraints by using a subset of the data for training, resulted in some performance loss compared to the full design, the Vision Transformer models effectively mitigated this loss. The second half of the research chapters focuses on integrating metabolomic and genomic data to predict grain traits and metabolic content in rice under climate change. Through the use of machine learning models, these studies investigate how combining genomic and metabolic data can improve predictions, particularly under high night temperature stress in rice. Chapter 5 showed that the integration of metabolites and genomics improved the prediction of rice grain size-related traits, and certain metabolites were identified as potential candidates for improving multi-trait genomic prediction. Chapter 6 showed that metabolic accumulation was low to moderately heritable. Genomic correlations between control and high night temperature conditions indicated genotype-by-environment interactions in metabolic accumulation, and the effectiveness of genomic prediction models for metabolic accumulation varied across metabolites. Joint analysis of multiple metabolites improved the accuracy of genomic prediction by exploiting correlations between metabolite accumulation. Overall, the dissertation provides insight into how cutting-edge methods can be used to improve livestock management and multi-omic quantitative genetic modeling for breeding.
|
417 |
Enhancing Surgical Gesture Recognition Using Bidirectional LSTM and Evolutionary Computation: A Machine Learning Approach to Improving Robotic-Assisted Surgery / BiLSTM and Evolutionary Computation for Surgical Gesture RecognitionZhang, Yifei January 2024 (has links)
The integration of artificial intelligence (AI) and machine learning in the medical field has led to significant advancements in surgical robotics, particularly in enhancing the precision and efficiency of surgical procedures. This thesis investigates the application of a single-layer bidirectional Long Short-Term Memory (BiLSTM) model to the JHU-ISI Gesture and Skill Assessment Working Set (JIGSAWS) dataset, aiming to improve the recognition and classification of surgical gestures. The BiLSTM model, with its capability to process data in both forward and backward directions, offers a comprehensive analysis of temporal sequences, capturing intricate patterns within surgical motion data. This research explores the potential of BiLSTM models to outperform traditional unidirectional models in the context of robotic surgery.
In addition to the core model development, this study employs evolutionary computation techniques for hyperparameter tuning, systematically searching for optimal configurations to enhance model performance. The evaluation metrics include training and validation loss, accuracy, confusion matrices, prediction time, and model size. The results demonstrate that the BiLSTM model with evolutionary hyperparameter tuning achieves superior performance in recognizing surgical gestures compared to standard LSTM models.
The findings of this thesis contribute to the broader field of surgical robotics and human-AI partnership by providing a robust method for accurate gesture recognition, which is crucial for assessing and training surgeons and advancing automated and assistive technologies in surgical procedures. The improved model performance underscores the importance of sophisticated hyperparameter optimization in developing high-performing deep learning models for complex sequential data analysis. / Thesis / Master of Applied Science (MASc) / Advancements in artificial intelligence (AI) are transforming medicine, particularly in robotic surgery. This thesis focuses on improving how robots recognize and classify surgeons' movements during operations. Using a special AI model called a bidirectional Long Short-Term Memory (BiLSTM) network, which looks at data both forwards and backwards, the study aims to better understand and predict surgical gestures.
By applying this model to a dataset of surgical tasks, specifically suturing, and optimizing its settings with advanced techniques, the research shows significant improvements in accuracy and efficiency over traditional methods. The enhanced model is not only more accurate but also smaller and faster.
These improvements can help train surgeons more effectively and advance robotic assistance in surgeries, leading to safer and more precise operations, ultimately benefiting both surgeons and patients.
|
418 |
Developing a User-Independent Deep Learning-Based Biomechanical Gait Analysis System Using Full Body Kinematics and ElectromyographyAvdan, Goksu 01 August 2024 (has links) (PDF)
Motion capture (mocap) systems integrated with force plates and electromyography (EMG) collect detailed kinematic and kinetic data on subjects, including stride length, width, cadence, speed, and other spatiotemporal parameters. These systems allow clinicians and researchers to analyze movements, both cyclic (e.g., walking, running) and non-cyclic (e.g., jumping, falling), which is crucial for understanding movement patterns and identifying abnormalities. Clinical gait analysis, a key application, focuses on detecting musculoskeletal issues and walking impairments. While essential for diagnosing gait disorders and planning interventions, clinical gait analysis faces challenges such as noise, outliers, and marker occlusion in optical motion tracking data, requiring complex post-processing. Additionally, the measurement of ground reaction forces (GRFs) and moments (GRMs) is limited due to the restricted number of force plates. There are also challenges in EMG data collection, such as finding optimal MVC positions and developing nonlinear normalization techniques to replace traditional methods.To address these challenges, this research aims to develop an AI-driven gait analysis system that is cost-effective, user-independent, and relies solely on kinematic and EMG data for real-time analysis. The system is specifically designed to assess musculoskeletal characteristics in individuals with special needs, walking disabilities, or injuries, where measuring MVC levels is impractical or unsafe. The research has four main objectives: (1) standardize MVC positions for four lower limb muscles, (2) develop an alternative EMG normalization technique using nonlinear data analysis, (3) create an unsupervised framework using transformers for missing marker recovery without perfect ground-truth data, and (4) generate GRFs, GRMs, and JMs from lower limb kinematics using a 1D-CNN, improving accuracy and efficiency with transfer learning, without requiring force plates. While addressing these challenges, the proposed system aims to minimize user interaction, reduce pre- and post-processing, and lower costs for researchers and clinicians. The designed tool will integrate with existing optical marker-based mocap systems, providing greater flexibility and usability. In educational settings, it will offer students hands-on experience in advanced gait analysis techniques. Economically, widespread adoption of the tool in research and clinical settings will reduce data collection and analysis costs, making advanced gait analysis more accessible. Additionally, this tool can be applied to other fields, such as precision manufacturing, security, and predictive maintenance, where analyzing data can predict failures. Consequently, this research will significantly advance the field of human movement, increasing the volume and quality of research using optical marker-based mocap systems.
|
419 |
DeepARG+ - A Computational Pipeline for the Prediction of Antibiotic ResistanceKulkarni, Rutwik Shashank 16 June 2021 (has links)
The global spread of antibiotic resistance warrants concerted surveillance in the clinic and in the environment. The widespread use of metagenomics for various studies has led to the generation of a large amount of sequencing data. Next-generation sequencing of microbial communities provides an opportunity for proactive detection of emerging antibiotic resistance genes (ARGs) from such data, but there are a limited number of pipelines that enable the identification of novel ARGs belonging to diverse antibiotic classes at present. Therefore, there is a need for the development of computational pipelines that can identify these putative novel ARGs. Such pipelines should be scalable, accessible and have good performance.
To address this problem we develop a new method for predicting novel ARGs from genomic or metagenomic sequences, leveraging known ARGs of different resistance categories. Our method takes into account the physio-chemical properties that are intrinsic to different ARG families. Traditionally, new ARGs are predicted by making sequence alignment and calculating sequence similarity to existing ARG reference databases, which can be very time consuming. Here we introduce an alignment free and deep learning prediction method that incorporates both the primary protein sequences of ARGs and their physio-chemical properties.
We compare our method with existing pipelines including hidden Markov model based Resfams and fARGene, sequence alignment and machine learning-based DeepARG-LS, and homology modelling based Pairwise Comparative Modelling. We also use our model to detect novel ARGs from various environments including human-gut, soil, activated sludge and the influent samples collected from a waste water treatment plant. Results show that our method achieves greater accuracy compared to existing models for the prediction of ARGs and enables the detection of putative novel ARGs, providing promising targets for experimental characterization to the scientific community. / Master of Science / Various bacteria contain genes that allow them to survive and grow even after the application of antibiotics. Such genes are called antibiotic resistance genes (ARGs). Each ARG has properties that make it resistant to a particular class of antibiotics. This class is called the resistance class/category of the gene. Antimicrobial resistance (AMR) is one of the biggest challenges to public health in recent times. It has been projected that a large number of deaths might occur due to AMR in the future. Therefore, there is a need for monitoring AMR in various environments. Currently, developed methods use the sequence's similarity with the existing database as a feature for ARG prediction. Some tools also use the 3D structure of proteins as a feature for ARG prediction. In this thesis, we develop a tool that incorporates both the sequence similarity and the structural information of proteins for ARG prediction. The structural information is encoded with physio-chemical properties (such as hydrophobicity, molecular weight etc.) of the amino acids. Our results show the efficacy of the pipeline in various environments. Results also show that our method achieves accuracy greater than existing models for the prediction of ARGs from metagenomic data. It also enables the detection of putative novel ARGs, providing promising targets for experimental characterization to the scientific community.
|
420 |
Human gait movement analysis using wearable solutions and Artificial IntelligenceDavarzani, Samaneh 09 December 2022 (has links) (PDF)
Gait recognition systems have gained tremendous attention due to its potential applications in healthcare, criminal investigation, sports biomechanics, and so forth. A new solution to gait recognition tasks can be provided by wearable sensors integrated in wearable objects or mobile devices. In this research a sock prototype designed with embedded soft robotic sensors (SRS) is implemented to measure foot ankle kinematic and kinetic data during three experiments designed to track participants’ feet ankle movement. Deep learning and statistical methods have been employed to model SRS data against Motion capture system (MoCap) to determine their ability to provide accurate kinematic and kinetic data using SRS measurements. In the first study, the capacitance of SRS related to foot-ankle basic movements was quantified during the gait movements of twenty participants on a flat surface and a cross-sloped surface. I have conducted another study regarding kinematic features in which deep learning models were trained to estimate the joint angles in sagittal and frontal planes measured by a MoCap system. Participant-specific models were established for ten healthy subjects walking on a treadmill. The prototype was tested at various walking speeds to assess its ability to track movements for multiple speeds and generalize models for estimating joint angles in sagittal and frontal planes. The focus of the last study is measuring the kinetic features and the goal is determining the validity of SRS measurements, to this end the pressure data measured with SRS embedded into the sock prototype would be compared with the force plate data.
|
Page generated in 0.0956 seconds