Spelling suggestions: "subject:"[een] FAULT TOLERANCE"" "subject:"[enn] FAULT TOLERANCE""
91 |
[en] TRANSIENT FAULT TOLERANCE BY DISTINCTNESS / [pt] TOLERANCIA A FALHAS TRANSIENTES POR DIVERSIFICAÇÃOALBERTO CLEMENTINO MESQUITA JUNIOR 05 November 2009 (has links)
[pt] Neste trabalho considera-se um sistema de computação geograficamente localizado, destinado ao controle de processos em tempo real. O trabalho tem como objetivo determinar uma política de tolerância a falhas transientes, bem como uma arquitetura de base para o sistema em questão, assim como dispendido para implementá-la.
Abordam-se os pontos chaves necessários a tolerância, os quais são a caracterização das falhas físicas e humanas, as funções de mascaramento, deteção e recuperação após falhas físicas.
O conceito de diversificação é considerado como uma alternativa à deteção e tolerância a falhas humanas (projeto), como também no que diz respeito a capacidade de fornecer uma deteção eficaz de falhas físicas em modo comum, pois proporciona uma independência entre os módulos redundantes quando uma mesma falha os atinge de forma idêntica.
São apresentados uma arquitetura de base e a forma selecionada de colocar em prática a diversificação: a emulação de um dos microprocessadores. / [en] This work consideres a geographically localized computerized control sustem. The aim is to determine a policy of tolerance to transient faults, a well as a basic architecture for the control system. A discussion of the allocated effort to implement it is included here.
They included the characterization of physical and human faults, masking, detection and recovery of physical faults.
The concept of distinctness is considered as an alternative to detection and toleranc3e to human faults and also with respect to the capability to provide an effective detection of physical faults of common mode. This approach provides an independing among redundant modules when the same fault affects them in an identical way.
This work presents a basic architecture which illustrates the use of the concept of distinctness, through the emulation of a microprocessor.
|
92 |
Hardware-Assisted Dependable SystemsKuvaiskii, Dmitrii 22 March 2018 (has links) (PDF)
Unpredictable hardware faults and software bugs lead to application crashes, incorrect computations, unavailability of internet services, data losses, malfunctioning components, and consequently financial losses or even death of people. In particular, faults in microprocessors (CPUs) and memory corruption bugs are among the major unresolved issues of today. CPU faults may result in benign crashes and, more problematically, in silent data corruptions that can lead to catastrophic consequences, silently propagating from component to component and finally shutting down the whole system. Similarly, memory corruption bugs (memory-safety vulnerabilities) may result in a benign application crash but may also be exploited by a malicious hacker to gain control over the system or leak confidential data.
Both these classes of errors are notoriously hard to detect and tolerate. Usual mitigation strategy is to apply ad-hoc local patches: checksums to protect specific computations against hardware faults and bug fixes to protect programs against known vulnerabilities. This strategy is unsatisfactory since it is prone to errors, requires significant manual effort, and protects only against anticipated faults. On the other extreme, Byzantine Fault Tolerance solutions defend against all kinds of hardware and software errors, but are inadequately expensive in terms of resources and performance overhead.
In this thesis, we examine and propose five techniques to protect against hardware CPU faults and software memory-corruption bugs. All these techniques are hardware-assisted: they use recent advancements in CPU designs and modern CPU extensions. Three of these techniques target hardware CPU faults and rely on specific CPU features: ∆-encoding efficiently utilizes instruction-level parallelism of modern CPUs, Elzar re-purposes Intel AVX extensions, and HAFT builds on Intel TSX instructions. The rest two target software bugs: SGXBounds detects vulnerabilities inside Intel SGX enclaves, and “MPX Explained” analyzes the recent Intel MPX extension to protect against buffer overflow bugs.
Our techniques achieve three goals: transparency, practicality, and efficiency. All our systems are implemented as compiler passes which transparently harden unmodified applications against hardware faults and software bugs. They are practical since they rely on commodity CPUs and require no specialized hardware or operating system support. Finally, they are efficient because they use hardware assistance in the form of CPU extensions to lower performance overhead.
|
93 |
Adaptive Fault-Tolerant TeleoperationDede, Mehmet Ismet Can 14 November 2007 (has links)
While the robots gradually become a part of our daily lives, they already play vital roles in many critical operations. Some of these critical tasks include surgeries, battlefield operations, and tasks that take place in hazardous environments or distant locations such as space missions. In most of these tasks, remotely controlled robots are used instead of autonomous robots. This special area of robotics is called teleoperation. Teleoperation systems must be reliable when used in critical tasks; hence, all of the subsystems must be dependable even under a subsystem or communication line failure. These systems are categorized as unilateral or bilateral teleoperation. A special type of bilateral teleoperation is described as force-reflecting teleoperation, which is further investigated as limited- and unlimited-workspace teleoperation. Teleoperation systems configured in this study are tested both in numerical simulations and experiments. A new method, Virtual Rapid Robot Prototyping, is introduced to create system models rapidly and accurately. This method is then extended to configure experimental setups with actual master systems working with system models of the slave robots accompanied with virtual reality screens as well as the actual slaves. Fault-tolerant design and modeling of the master and slave systems are also addressed at different levels to prevent subsystem failure. Teleoperation controllers are designed to compensate for instabilities due to communication time delays. Modifications to the existing controllers are proposed to configure a controller that is reliable in communication line failures. Position/force controllers are also introduced for master and/or slave robots. Later, controller architecture changes are discussed in order to make these controllers dependable even in systems experiencing communication problems. The customary and proposed controllers for teleoperation systems are tested in numerical simulations on single- and multi-DOF teleoperation systems. Experimental studies are then conducted on seven different systems that included limited- and unlimited-workspace teleoperation to verify and improve simulation studies. Experiments of the proposed controllers were successful relative to the customary controllers. Overall, by employing the fault-tolerance features and the proposed controllers, a more reliable teleoperation system is possible to design and configure which allows these systems to be used in a wider range of critical missions.
|
94 |
A Multi-leader Approach to Byzantine Fault Tolerance : Achieving Higher Throughput Using Concurrent ConsensusAbid, Muhammad Zeeshan January 2015 (has links)
Byzantine Fault Tolerant protocols are complicated and hard to implement.Today’s software industry is reluctant to adopt these protocols because of thehigh overhead of message exchange in the agreement phase and the high resourceconsumption necessary to tolerate faults (as 3 f + 1 replicas are required totolerate f faults). Moreover, total ordering of messages is needed by mostclassical protocols to provide strong consistency in both agreement and executionphases. Research has improved throughput of the execution phase by introducingconcurrency using modern multicore infrastructures in recent years. However,improvements to the agreement phase remains an open area. Byzantine Fault Tolerant systems use State Machine Replication to tolerate awide range of faults. The approach uses leader based consensus algorithms for thedeterministic execution of service on all replicas to make sure all correct replicasreach same state. For this purpose, several algorithms have been proposed toprovide total ordering of messages through an elected leader. Usually, a singleleader is considered to be a bottleneck as it cannot provide the desired throughputfor real-time software services. In order to achieve a higher throughput there is aneed for a solution which can execute multiple consensus rounds concurrently. We present a solution that enables multiple consensus rounds in parallel bychoosing multiple leaders. By enabling concurrent consensus, our approach canexecute several requests in parallel. In our approach we incorporate applicationspecific knowledge to split the total order of events into multiple partial orderswhich are causally consistent in order to ensure safety. Furthermore, a dependencycheck is required for every client request before it is assigned to a particular leaderfor agreement. This methodology relies on optimistic prediction of dependenciesto provide higher throughput. We also propose a solution to correct the course ofexecution without rollbacking if dependencies were wrongly predicted. Our evaluation shows that in normal cases this approach can achieve upto 100% higher throughput than conventional approaches for large numbers ofclients. We also show that this approach has the potential to perform better incomplex scenarios
|
95 |
Intersection Collision Avoidance For Autonomous Vehicles Using Petri NetsShankar Kumar, Valli Sanghami 08 1900 (has links)
Indiana University-Purdue University Indianapolis (IUPUI) / Autonomous vehicles currently dominate the automobile field for their impact on
humanity and society. Connected and Automated Vehicles (CAV’s) are vehicles that
use different communication technologies to communicate with other vehicles, infrastructure, the cloud, etc. With the information received from the sensors present, the
vehicles analyze and take necessary steps for smooth, collision-free driving. This
the
sis talks about the cruise control system along with the intersection collision avoidance
system based on Petri net models. It consists of two internal controllers for velocity
and distance control, respectively, and three external ones for collision avoidance.
Fault-tolerant redundant controllers are designed to keep these three controllers in
check. The model is built using a PN toolbox and tested for various scenarios. The
model is also validated, and its distinct properties are analyzed.
|
96 |
Fault tolerance and re-training analysis on neural networksGeorge, Abhinav Kurian 09 July 2019 (has links)
No description available.
|
97 |
Hardware Assertions for Mitigating Single-Event Upsets in FPGAsDumitrescu, Stefan January 2020 (has links)
The memory cells used in modern field programmable gate arrays (FPGAs) are highly
susceptible to single event upsets (SEUs). The typical mitigation strategy in the industry is some form of hardware redundancy in the form of duplication with comparison (DWC) or triple modular redundancy (TMR). While this strategy is highly effective in masking out the effect of faults, it incurs a large hardware cost. In this thesis, we explore a different approach to hardware redundancy. The core idea of our approach is to exploit the conflict-driven clause learning (CDCL) mechanism in modern Boolean satisfiability (SAT) solvers to provide us with
invariants which can be realized as hardware checkers. Furthermore, we develop the algorithms required to select a subset of these invariants to be included as part of a checker circuit. This checker circuit then augments the original FPGA design. We find which look-up table (LUT) memory cells are sensitive to bitflips, then we automatically generate a checker circuit consisting of hardware invariants targeted towards those faults. We aim to reach 100% coverage of sensitizable faults. After extensive experimentation, we conclude that this approach is not competitive with DWC with respect to hardware area. However, we demonstrate that many bitflips will have reduced a detection latency compared to DWC. / Thesis / Master of Applied Science (MASc)
|
98 |
Adaptive Algorithms for Fault Tolerant Re-Routing in Wireless Sensor NetworksGregoire, Michael S 01 January 2007 (has links) (PDF)
No description available.
|
99 |
The G-Network and Its Inherent Fault Tolerant PropertiesHaynes, Teresa, Dutton, Ronald D. 01 January 1990 (has links)
This paper presents the G-network, a new topological design which is a suitable architecture for point-to-point communication and interconnection networks, We show that the G-network has the following desirable characteristics: Efficient routing, small number of links, and fault tolerance. The performance of the G-network is compared to that of the Barrel Shifter and Illiac Mesh networks.
|
100 |
COPING WITH DISCREPANCIES OF THE MANUFACTURED WEIGHTS IN THRESHOLD LOGIC GATESGoparaju, Manoj Kumar 01 December 2009 (has links)
Threshold Logic technology is conceived as the crucial alternate emerging technology to CMOS implementation in nanoelectronic era and the realization of complex functionalities is becoming an increasingly promising approach in the deep sub-micron design era. The gate that is implemented with threshold logic is called a Threshold Logic Gate (TLG). The logic output value of a Threshold Logic Gate (TLG) depends on the weighted sum of its inputs. Manufactured weights in the threshold logic gates (TLGs) may differ from the designed values and significantly affects the fault coverage. A novel fault model for weight defects is proposed. Also an Automatic Test Pattern Generation (ATPG) tool has been implemented that uses the fault model to detect whether the circuit is malfunctioning due to such weight-related defects. A novel design methodology is presented in this work to design complex TLG networks that are tolerant to manufacturing shortcomings. It uses a procedure to identify the optimum fault tolerant design of any given k-input TLG. Extensive research has been done in the development of synthesis methodologies in the past, predominantly greedy. A fault tolerance aware synthesis methodology is proposed.
|
Page generated in 0.0418 seconds