161 |
Mixed-Criticality System Design For Real-Time Scheduling And Routing Upon Platforms With UncertaintiesVaidhun Bhaskar, Sudharsan 01 January 2022 (has links) (PDF)
Unlike typical computing systems, applications in real-time systems require strict timing guarantees. In the pursuit of providing guarantees, the complex dynamic behaviors of these systems are simplified using models to keep the analysis tractable. In order to guarantee safety, such models often involve pessimistic assumptions. While the amount of pessimism was reasonable for simple computing platforms, for modern platforms the pessimism involves ignoring features that improve performance such as cache usage, instruction pipelines, and more. In this work, we explore routing and scheduling problems in real-time systems, where the uncertainties in the operation are carefully accounted for by complex models and/or the routing and scheduling algorithms proposed. For real-time scheduling problems, we incorporate the execution time distribution into the task model to design a system that can meet the maximum permitted incidences of failure per hour. We also consider the case where no failure is permitted and all jobs in the system must be scheduled without violating their timing requirements, throughout their operation. It is achieved on a varying speed multiprocessor platform. For real-time routing problems, we consider graphs whose edge cost distribution is dynamic and the routed packets have deadlines to be met. We then extend this problem to the case where the initial (discrete) distribution of the edge costs is fully known. We propose a technique to safely incorporate a reinforcement learning strategy once the system deviates from its initial distribution. Finally, we focus on practical improvements to the popular and optimal earliest deadline first scheduling algorithm, upon a uniprocessor setting. Specifically, we develop techniques to quantify and utilize the idle times to handle uncertainties in the form of additional run-time workloads, arbitrary self-suspensions, and execution time estimate overruns.
|
162 |
Leveraging Signal Transfer Characteristics and Parasitics of Spintronic Circuits for Area and Energy-Optimized Hybrid Digital and Analog ArithmeticTatulian, Adrian 01 January 2023 (has links) (PDF)
While Internet of Things (IoT) sensors offer numerous benefits in diverse applications, they are limited by stringent constraints in energy, processing area and memory. These constraints are especially challenging within applications such as Compressive Sensing (CS) and Machine Learning (ML) via Deep Neural Networks (DNNs), which require dot product computations on large data sets. A solution to these challenges has been offered by the development of crossbar array architectures, enabled by recent advances in spintronic devices such as Magnetic Tunnel Junctions (MTJs). Crossbar arrays offer a compact, low-energy and in-memory approach to dot product computation in the analog domain by leveraging intrinsic signal-transfer characteristics of the embedded MTJ devices. The first phase of this dissertation research seeks to build on these benefits by optimizing resource allocation within spintronic crossbar arrays. A hardware approach to non-uniform CS is developed, which dynamically configures sampling rates by deriving necessary control signals using circuit parasitics. Next, an alternate approach to non-uniform CS based on adaptive quantization is developed, which reduces circuit area in addition to energy consumption. Adaptive quantization is then applied to DNNs by developing an architecture allowing for layer-wise quantization based on relative robustness levels. The second phase of this research focuses on extension of the analog computation paradigm by development of an operational amplifier-based arithmetic unit for generalized scalar operations. This approach allows for 95% area reduction in scalar multiplications, compared to the state-of-the-art digital alternative. Moreover, analog computation of enhanced activation functions allows for significant improvement in DNN accuracy, which can be harnessed through triple modular redundancy to yield 81.2% reduction in power at the cost of only 4% accuracy loss, compared to a larger network. Together these results substantiate promising approaches to several challenges facing the design of future IoT sensors within the targeted applications of CS and ML.
|
163 |
The Effects of Specular Reflection on Pulsed Infrared Thermography Threshold CrossingsGoettemoeller, Ryan Joseph 07 August 2023 (has links)
No description available.
|
164 |
Advanced Smart Healthcare Technology and The Applications Based on Human Activity Recognition and Human 3D ReconstructionQian, Xiaoye 26 May 2023 (has links)
No description available.
|
165 |
True shared memory architecture for next-generation multi-GPU systemsMojumder, Md Saiful Arefin 15 May 2021 (has links)
Machine learning (ML) is now omnipresent in all spheres of life. The use of deep neural networks (DNNs) for ML has gained popularity over the past few years. This is because DNNs are capable of efficiently solving complex problems such as image processing, object detection, language processing, etc. To train these DNN workloads, graphics process- ing units (GPUs) have become the most widely used platform. A GPU can support a large number of parallel threads that execute simultaneously to achieve a very high throughput. However, as the sizes of the DNN workloads grow, a single GPU is no longer adequate to provide fast training, and developers resort to using multi-GPU (MGPU) systems that can reduce the training time significantly. Consequently, to keep pace with the growth of DNN applications, GPU vendors are actively developing novel and efficient MGPU systems.
To better understand the challenges associated with designing MGPU systems for DNN workloads, in this thesis, we first present our efforts to understand the behavior of the DNN workloads, in particular, the training of DNN workloads on MGPU systems. Using the DNN workloads as benchmarks, we observe the evolution of MGPU system architecture. Based on our profiling and characterization of DNN workloads on existing high-performance MGPU systems, we identify the computation- and communication- intensiveness of the DNN workloads and the hardware- and software-level inefficiencies present in the existing MGPU systems. We find that the data movement across multiple GPUs and high remote data access cost leading to NUMA effects, data duplication, and inefficient use of GPU memory leading to memory capacity issues, and the complexity in programming MGPUs pose serious limitations in the execution of ever-scaling DNN workloads on MGPU systems.
To overcome the limitations of existing MGPU systems, we propose to unify the main memory of GPUs to design an MGPU system with true shared memory (MGPU-TSM). Our proposed MGPU-TSM system demonstrates a significant performance boost (3.8× for a 4 GPU system) over the best-performing existing MGPU system. This is because MGPU-TSM system eliminates the NUMA effects and the necessity for data duplication. To provide seamless data sharing across multiple GPUs and ease programming of MGPU- TSM, we propose a light-weight coherence protocol called MGCC. MGCC is a timestamp- based protocol that provides both intra- and inter-GPU coherence. We implement a number of hardware features including unified memory controller, request tracker and timestamp storage unit to support MGCC. Using both standard and synthetic stress benchmarks, we evaluate the MGPU-TSM system with MGCC leveraging sequential as well as relaxed consistency. Our evaluation of a 4-GPU system using MGPUSim simulator suggests that our proposed coherent MGPU system achieves up to 3.8× improved performance than current best-performing MGPU system while the stress tests performed using synthetic benchmarks suggests that MGCC leads to up to 46.1% performance overhead.
|
166 |
Implementation of bitmask based code compression algorithm on MSP430 microcontrollerJohn, Lijo January 2012 (has links)
No description available.
|
167 |
Improving Reliability in DNA based Computations with Applications to CryptographyMantha, Anusha January 2012 (has links)
No description available.
|
168 |
Tie Inducement using Closure Analysis in Information NetworksMunimadugu, Hareendra January 2012 (has links)
No description available.
|
169 |
Solar-Powered Wireless Sensor Nodes with Dynamic Power Management for Indoor UseHumphrey, Ethan Charles 01 May 2013 (has links)
No description available.
|
170 |
Ultralow-Power and Robust Embedded Memories for Bioimplantable MicrosystemsHashemian, MaryamSadat 23 August 2013 (has links)
No description available.
|
Page generated in 0.052 seconds