Spelling suggestions: "subject:"bobilev V3""
1 |
COMPRESSED MOBILENET V3: AN EFFICIENT CNN FOR RESOURCE CONSTRAINED PLATFORMSKavyashree Pras Shalini Pradeep Prasad (10662020) 10 May 2021 (has links)
<p>Computer Vision is a mathematical
tool formulated to extend human vision to machines. This tool can perform
various tasks such as object classification, object tracking, motion
estimation, and image segmentation. These tasks find their use in many applications,
namely robotics, self-driving cars, augmented reality, and mobile applications.
However, opposed to the traditional technique of incorporating handcrafted
features to understand images, convolution neural networks are being used to
perform the same function. Computer vision applications widely use CNNs due to
their stellar performance in interpreting images. Over the years, there have
been numerous advancements in machine learning, particularly to CNNs. However,
the need to improve their accuracy, model size and complexity increased, making
their deployment in restricted environments a challenge. Many researchers
proposed techniques to reduce the size of CNN while still retaining its
accuracy. Few of these include network quantization, pruning, low rank, and
sparse decomposition and knowledge distillation. Some methods developed
efficient models from scratch. This thesis achieves a similar goal using design
space exploration techniques on the latest variant of MobileNets, MobileNet V3.
Using Depthwise Pointwise Depthwise (DPD) blocks, escalation in the number of
expansion filters in some layers and mish activation function MobileNet V3 is
reduced to 84.96% in size and made 0.2% more accurate. Furthermore, it is
deployed in NXP i.MX RT1060 for image classification on CIFAR-10 dataset.</p>
|
2 |
Compressed MobileNet V3: An efficient CNN for resource constrained platformsPrasad, S. P. Kavyashree 05 1900 (has links)
Indiana University-Purdue University Indianapolis (IUPUI) / Computer Vision is a mathematical tool formulated to extend human vision to machines.
This tool can perform various tasks such as object classification, object tracking, motion
estimation, and image segmentation. These tasks find their use in many applications, namely robotics, self-driving cars, augmented reality, and mobile applications. However, opposed to the traditional technique of incorporating handcrafted features to understand images, convolution neural networks are being used to perform the same function.
Computer vision applications widely use CNNs due to their stellar performance in interpreting images. Over the years, there have been numerous advancements in machine learning, particularly to CNNs.However, the need to improve their accuracy, model size and complexity increased, making their deployment in restricted environments a challenge.
Many researchers proposed techniques to reduce the size of CNN while still retaining
its accuracy. Few of these include network quantization, pruning, low rank, and sparse
decomposition and knowledge distillation. Some methods developed efficient models from
scratch. This thesis achieves a similar goal using design space exploration techniques on the latest variant of MobileNets, MobileNet V3. Using DPD blocks, escalation in the number of expansion filters in some layers and mish activation function MobileNet V3 is reduced to 84.96% in size and made 0.2% more accurate. Furthermore, it is deployed in NXP i.MX RT1060 for image classification on CIFAR-10 dataset.
|
3 |
Deep Multimodal Physiological Learning Of Cerebral Vasoregulation Dynamics On Stroke Patients Towards Precision Brain MedicineAkanksha Tipparti (18824731) 03 September 2024 (has links)
<p dir="ltr">Impaired cerebral vasoregulation is one of the most common post-ischemic stroke effects. Diagnosis and prevention of this condition is often invasive, costly and in-effective. This impairment restricts the cerebral blood vessels to properly regulate blood flow, which is very important for normal brain functioning. Developing accurate, non-invasive and efficient methods to detect this condition aids in better stroke diagnosis and prevention. </p><p dir="ltr">The aim of this thesis is to develop deep learning techniques for the purpose of detection of cerebral vasoregulation impairments by analyzing physiological signals. This research employs various Deep learning techniques like Convolution Neural Networks (CNN), MobileNet, and Long-Short-Term Memory (LSTM) to determine variety of physiological signals from the PhysioNet database like Electrocardio-gram (ECG), Transcranial Doppler (TCD), Electromyogram (EMG), and Blood Pressure(BP) as stroke or non-stroke subjects. The effectiveness of these algorithms is demonstrated by a classification accuracy of 90\% for the combination of ECG and EMG signals. </p><p dir="ltr">Furthermore, this research explores the importance of analyzing dynamic physiological activities in determining the impairment. The dynamic activities include Sit-stand, Sit-stand-balance, Head-up-tilt, and Walk dataset from the PhysioNet website. CNN and MobileNetV3 are employed in classification purposes of these signals, attempting to identify cerebral health. The accuracy of the model and robustness of these methods is greatly enhanced when multiple signals are integrated. </p><p dir="ltr">Overall, this study highlights the potential of deep multimodal physiological learning in the development of precision brain medicine further enhancing stroke diagnosis. The results pave the way for the development of advanced diagnostic tools to determine cerebral health. </p>
|
Page generated in 0.0357 seconds