The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Distributed computing environments have been very much in sight for the last one and a half decade. Geographically distributed resources are provisioned to the user tasks in the distributed computing environment as per their requirements. A number of parameters are to be taken into account while provisioning the distributed resources such as task performance and fault tolerance etc. Extensive research...
Scaling trends of reconfigurable hardware (RH) and their design flexibility have proliferated their use in dependability-critical embedded applications. Although their reconfigurability can enable significant fault tolerance, due to the complexity of execution time in their design flow, in-field reconfigurability can be infeasible and thus limit such potential. This need is addressed by developing...
To achieve better performance, computer designers employ advanced techniques that shrink feature sizes, lower supply voltage, increase clock rates and memory capacity, and meanwhile modern computers become increasingly vulnerable to soft errors caused by energetic particles, such as alpha particles and neutron strikes. Therefore, fault tolerance evolves into one of the most significant design objectives,...
Fault tolerant (or dependable) computing has always been an exciting research area in the intersection of computer science and engineering and electrical and electronics engineering. During the last two decades the applicability of the methods and tools that the fault tolerance research community produces has expanded to virtually all application domains. The type of fault tolerance methods employed...
To ensure high reliability for large and complex software systems, fault tolerance mechanisms have been widely used in the system design processes. However, less attention has been paid to the analysis of the system reliability improvement concerning fault tolerance. Therefore, they might underestimate the system reliability. To remedy this, we extend architecture based reliability estimation to take...
Our work evaluates the use of software-defined networking (SDN) for reliable communication. Reliable communication has become an important topic in many areas, including energy communication networks or, more generally, automation control networks. Electrical grids are developing into smart grids, which depend heavily on reliability, robustness and optimized resource usage. On the other side, the...
Evaluation of dependability of electric supply is left to the skill and expertise of single engineers. Power systems schematics of health care facilities, airports, stadia, skyscrapers, main roads tunnels, too frequently, are not designed on the basis of concepts available in industrial context, since the ′60thies. The author has found strong resistance in the acceptance of developed design criteria,...
In this paper we present a fault tolerant Mesh based Network-on-Chip design that helps to tolerate router faults along with core recovery mechanism. Spare links are used to provide a connection to horizontal and vertical routers pivoting the failed one. To compliment the modified topology a routing algorithm has been developed that uses minimal and non minimal paths to communicate between source and...
Failure detection plays a central role in the engineering of distributed systems. Furthermore, many applications have timing constraints and require failure detectors that provide quality of service (QoS) with some quantitative timeliness guarantees. Therefore, they need failure detectors that are fast and accurate. We introduce the Two-Windows Failure Detector (2W-FD), an algorithm able to react...
In this paper, we have proposed a new defect tolerant technique by adding redundancy at transistor level, where each transistor is replaced by three transistors placed in a special way such that reliability of the whole structure increases. The new triple-transistor redundancy technique offers a good reliability at lower area and delay overheads compared to most of the popular static fault tolerant...
Majority voted redundancy is increasingly implemented in fault-tolerant design today. In this technique, a voter receives parallel bits from an odd number of digital components and votes for the majority. Reliability improvement is the main focus of every fault tolerant system design. In this paper, we first present a viable alternative to the voting redundancy concept in order to significantly increase...
The enabling of scientific experiments that are embarrassingly parallel, long running and data-intensive into a cloud-based execution environment is a desirable, though complex undertaking for many researchers. The management of such virtual environments is cumbersome and not necessarily within the core skill set for scientists and engineers. We present here Chiminey, a software platform that enables...
Reliable built-in self-test (Reliable BIST) scheme equips to be tolerant of faults, which occur in embedded BIST circuits. To realize reliable BIST, it is required to recover itself from transient errors of its embedded BIST circuits. In this paper, we propose a self-error-correctable response analyzer (RA) for a reliable BIST scheme. Experimental results show that test-reliability of SECRA is superior...
This paper discusses a methodology for calculating the probability of a redundant system to fail in a specified time interval after a first fault has occurred. Four different redundant system configurations of a safety relevant automotive control system are discussed and compared with respect to their reliability characteristics. The four configurations differ in system topologies as well as the intelligence...
The consensus problem is an important topic in modern networked systems. The goal of the consensus protocol is to make each non-faulty processor agree on a common value without influence from faulty processors. However, if the initialvalue proposed by the processor is shared by only a small number of other processors, this initial-value will not become the consensus value. In order to increase the...
Virtual Coordinate (VC) based algorithms do not use physical coordinates for addressing, and thus possess many advantages for large scale sensor networks. They rely on the validity of VCs of nodes. VCs are affected by events such as node failures which are unpredictable and inevitable in WSNs. This degrades the performance of the algorithms and may even reduce overall life of the network. A distributed...
Graph robustness metrics have been used largely to study the behavior of communication networks in the presence of targeted attacks and random failures. Several researchers have proposed new graph metrics to better predict network resilience and survivability against such attacks. Most of these metrics have been compared to a few established graph metrics for evaluating the effectiveness of measuring...
We propose three fault-tolerant methods for a new lightweight block cipher SIMON, which has the potential to be a hardware-efficient security primitive for embedded systems. As a single fault in the encryption (decryption) process can completely change the ciphertext (received plaintext), it is critical to ensure the reliability of encryption and decryption modules. We explore double-modular redundancy...
Fault Tolerant Controls are highly reliable control systems aimed at increasing system reliability and reducing safety hazards. FTCs are designed to accommodate faults at their early stage of development to prevent those minor faults from developing into failures that might jeopardize system stability and availability. FTC design is a topic of active and ongoing research activities with applications...
Permanent Magnet (PM) machines are increasingly being used in safety critical systems in aerospace and automotive applications. In such applications, reliability and availability are key concerns. Numerous works have been presented in literature which take into account both system reliability and availability in the event of single fault within the PM machine. This paper mainly focuses on a turn Short-Circuit...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.