Search results

Items from 1 to 20 out of 82 results

chapter

Model-driven reliability evaluation for MPSoC design

Tien Thanh Nguyen, Anthony Mouraud, Mathieu Thevenin, Gwenole Corre, more

2017 Conference on Design and Architectures for Signal and Image Processing (DASIP) > 1 - 6

2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)

When designing a Multi-Processor System-on-Chip (MPSoC), a very large range of design alternatives arises from a huge space of possible design options and component choices. Literature proposes numerous Design-Space-Exploration (DSE) approaches thats mainly focus on cost optimization. In this paper, we present a DSE approach which focuses on the reliability of the whole design. This approach is based...

chapter

Self-Repairing Software Architecture for Predictable Hardware Faults

Yinghua Guo, Yali Qi, Hang Zhou

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 1224 - 1228

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Because of hardware faults, the situation that the processor cannot perform properly is occurred frequently in large scale software-intensive systems. Most of traditional fault-tolerant methods do not distinguish the type of hardware failure. In view of this, we propose self-repairing software architecture for predictable hardware faults. By introducing computational reflection, the software architecture...

chapter

Formal Definition of Program Faults and Hierarchy of Program Fault-Tolerant Abilities

Liu Xiaojian, Jiang Ting, Dong Xiaofeng

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 339 - 343

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

These two issues are addressed in this paper: 1) the formal definitions of the concepts relevant to program faults, and 2) the comparison and classification of program faulttolerant abilities. We firstly analyze the subtle differences among these basic concepts: faults, errors and failures, and represent their formal definitions by using the state-based theory of program behavior; and then we propose...

chapter

System-level architecture for mixed criticality applications on MPSoC: A space application

Stefano Esposito, Massimo Violante

2017 IEEE International Workshop on Metrology for AeroSpace (MetroAeroSpace) > 479 - 483

2017 IEEE International Workshop on Metrology for AeroSpace (MetroAeroSpace)

This paper discusses SEE effects in an architecture based on commercial-off-the-shelf multicore processors for consolidating mixed criticalities applications in single board computers for space applications. This paper builds on previously proposed system-level architectures for mixed-criticality applications, describing them both for convenience together with the previous validation results. The...

chapter

FLIPPER: Fault-tolerant distributed network management and control

Subhrendu Chattopadhyay, Niladri Sett, Sukumar Nandi, Sandip Chakraborty

2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM) > 421 - 427

2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM)

The current developments of software defined networking (SDN) paradigm provide a flexible architecture for network control and management, in the cost of deploying new hardwares by replacing the existing routing infrastructure. Further, the centralized controller architecture of SDN makes the network prone to single point failure and creates performance bottleneck. To avoid these issues and to support...

chapter

Practical Task Allocation for Software Fault-Tolerance and Its Implementation in Embedded Automotive Systems

Anand Bhat, Soheil Samii, Ragunathan Rajkumar

2017 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS) > 87 - 98

2017 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Due to the advent of active safety features and automated driving capabilities, the complexity of embedded computing systems within automobiles continues to increase. Such advanced driver assistance systems (ADAS) are inherently safetycritical and must tolerate failures in any subsystem. However, fault-tolerance in safety-critical systems has been traditionally supported by hardware replication, which...

chapter

Software errors and reliability of embedded software

Alexey N. Ivutin, Eugene V. Larkin, Dmitry A. Perepelkin

2016 IEEE Conference on Quality Management, Transport and Information Security, Information Technologies (IT&MQ&IS) > 69 - 71

2016 IEEE Conference on Quality Management, Transport and Information Security, Information Technologies (IT&MQ&IS)

The problem of software fault-tolerance is described. The fault-tolerance problem is considered as hardware faults and software errors. The software errors classification is proposed. Authors describe the computational process as treelike directed graph. Errors are bringing in the realisation of the algorithm at the stage of programming. It is cause forming “real” algorithm instead of its “theoretical”...

chapter

Comparative study on data error detection techniques in embedded systems

Venu Babu Thati, Jens Vankeirsbilck, Jeroen Boydens

2016 XXV International Scientific Conference Electronics (ET) > 1 - 4

2016 XXV International Scientific Conference Electronics (ET)

This paper presents a theoretical comparison of different existing data error detection techniques. The techniques are compared by fault coverage, memory overhead and performance overhead. For this comparison, ten different data error detection techniques are taken into account. In general, the best error detection technique always has the highest fault coverage with low performance and memory overhead...

chapter

Methods and tools to increase fault tolerance of high-performance computing systems

I.A. Sidorov

2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) > 226 - 230

2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

This work focuses on the development of models, methods, and tools to increase a fault tolerance of highperformance computing systems. The described models and methods are based on automatic diagnostics of the basic software and hardware components of these systems, the use of automatic localization, correction of faults, and the use of automatic HPC-system reconfiguration mechanisms. The originality...

chapter

Smart reconfiguration approach for fault-tolerant NoC based MPSoCs

Jarbas Silveira, Paulo Cortez, Alan Cadore, Rafael Mota, more

2015 28th Symposium on Integrated Circuits and Systems Design (SBCCI) > 1 - 6

2015 28th Symposium on Integrated Circuits and Systems Design (SBCCI)

Newest technologies of integrated circuits fabrication allow billions of transistors arranged in a single chip enabling to implement a complex parallel system, which requires a high scalable and parallel communication architecture, such as a Network-on-Chip (NoC). These technologies are very close to physical limitations increasing faults in manufacture and at runtime. Thus, it is essential to provide...

chapter

Engineering Adaptive Fault-Tolerance Mechanisms for Resilient Computing on ROS

Michael Lauer, Matthieu Amy, Jean-Charles Fabre, Matthieu Roy, more

2016 IEEE 17th International Symposium on High Assurance Systems Engineering (HASE) > 94 - 101

2016 IEEE 17th International Symposium on High Assurance Systems Engineering (HASE)

Systems are expected to evolve during their service life in order to cope with changes of various natures, ranging from fluctuations in available resources to additional features requested by users. For dependable embedded systems, the challenge is even greater, as evolution must not impair dependability attributes. Resilient computing implies maintaining dependability properties when facing changes...

chapter

Design constraints and challenges behind fault tolerance systems in a mobile application framework

Venkata N. Inukollu, Taeghyun Kang, Nina Sakhnini

2015 10th International Design & Test Symposium (IDT) > 159 - 160

2015 10th International Design & Test Symposium (IDT)

Mobile applications are a part of human life, ranging from simple tasks such as e-mails to critical operations such as security surveillances. Referable to the different softwares and hardwares used in mobile devices, failure of a mobile application is unavoidable. Failure of mobile applications poses a serious threat to the success of a mobile software. Also, those failures can result in a great...

chapter

Development, integration, and test architecture for a software-based hardware-agnostic Fault Tolerant Flight Computer

Andrew Cunningham, Michael Kass

2015 IEEE AUTOTESTCON > 403 - 408

2015 IEEE AUTOTESTCON

The advent of software-based fault tolerance presents a rare opportunity to create a new paradigm for support equipment architecture. This test system must be capable of servicing the development, integration, and test of hardware and software, allowing developers remote access to the units under test (UUT) throughout the integration and test process. Using mainly low-cost commercial off the shelf...

chapter

Differentiated Failure Remediation with Action Selection for Resilient Computing

Song Huang, Song Fu, Nathan DeBardeleben, Qiang Guan, more

2015 IEEE 21st Pacific Rim International Symposium on Dependable Computing (PRDC) > 199 - 208

2015 IEEE 21st Pacific Rim International Symposium on Dependable Computing (PRDC)

As the fault frequency is increasing with the component count in modern and future computer systems, resilience becomes increasingly critical. Existing work on anomaly detection and fault prediction enables failure avoidance techniques to circumvent fault effects proactively. In addition, traditional fault tolerance techniques can be applied to handle faults reactively. Different types of faults may...

chapter

Diverse Compiling for Microprocessor Fault Detection in Temporal Redundant Systems

Andrea Holler, Tobias Rauter, Johannes Iber, Christian Kreiner

2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing > 1928 - 1935

As hardware components are expected to become ever more unreliable due to the technology scaling, hardware errors have become unavoidable. Dependable systems that rely on a correct functionality often use redundancy to detect such hardware faults during operation. However, to design costefficient reliable systems, it is crucial to effectively exploit the available redundancy. Thus, researchers have...

chapter

Bit Flipping Errors in High Performance Linpack at Exascale and Beyond

Erlin Yao, Guangming Tan

2015 44th International Conference on Parallel Processing > 420 - 429

2015 44th International Conference on Parallel Processing (ICPP)

For the High Performance Linpack (HPL) benchmark at the coming Exascale and beyond, silent errors like bit flipping in memory are expected to become inevitable. However, since bit flipping errors are difficult to be detected and located, their impact to the numerical correctness of HPL has not been evaluated thoroughly and quantitatively, while the impact at Exascale is especially susceptible. In...

chapter

FAIL*: An Open and Versatile Fault-Injection Framework for the Assessment of Software-Implemented Hardware Fault Tolerance

Horst Schirmeier, Martin Hoffmann, Christian Dietrich, Michael Lenz, more

2015 11th European Dependable Computing Conference (EDCC) > 245 - 255

2015 11th European Dependable Computing Conference (EDCC)

Due to voltage and structure shrinking, the influence of radiation on a circuit's operation increases, resulting in future hardware designs exhibiting much higher rates of soft errors. Software developers have to cope with these effects to ensure functional safety. However, software-based hardware fault tolerance is a holistic property that is tricky to achieve in practice, potentially impaired by...

chapter

Virtual machines of high availability using hardware-assisted failure detection

Wei-Jen Wang, Hung-Lin Huang, Shan-Hao Chuang, Shao-Jui Chen, more

2015 International Carnahan Conference on Security Technology (ICCST) > 1 - 6

2015 International Carnahan Conference on Security Technology (ICCST)

The virtualization technology has been widely used in today's doud computing datacenters. With the virtualization technology, each physical machine in a datacenter can be logically divided into several virtual machines, on which different types of software services can host. However, many reasons may decrease the availability of the whole system. For example, a failed physical machine automatically...

chapter

A Fault-Tolerant Java Virtual Machine Using Fast Rejuvenation for Soft-Error-Prone Systems

Qi Ao, Longbing Zhang, Shuai Chen, Jie Fu, more

2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems > 463 - 469

2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS) and 2015 IEEE 12th International Conf on Embedded Software and Systems (ICESS)

To achieve better performance, computer designers employ advanced techniques that shrink feature sizes, lower supply voltage, increase clock rates and memory capacity, and meanwhile modern computers become increasingly vulnerable to soft errors caused by energetic particles, such as alpha particles and neutron strikes. Therefore, fault tolerance evolves into one of the most significant design objectives,...

chapter

A toolchain for safety-critical embedded processor programming using FPGAs

Jonathan Kimmitt, David J Greaves, Marcian Cirstea

2015 IEEE 13th International Conference on Industrial Informatics (INDIN) > 848 - 855

2015 IEEE 13th International Conference on Industrial Informatics (INDIN)

In safety-critical environments it is no longer sufficient to rely on legacy methodologies. Correctness should be built in all the way through the process. This paper presents a toolchain which allows theorem prover output to be interfaced to fault-tolerant FPGA circuitry. We show a shallow embedding of a lambda calculus executing on a Xilinx platform with the assistance of a choice of fault-tolerance...

Data set:
ieee
Keywords:
SOFTWARE
HARDWARE
FAULT TOLERANCE
Publication type:
book

Publication date

Set your own date range

Content availability

Available (80)
None (2)

Keywords

FAULT TOLERANT SYSTEMS (47)
COMPUTER ARCHITECTURE (17)
RELIABILITY (16)
CIRCUIT FAULTS (14)
FIELD PROGRAMMABLE GATE ARRAYS (12)
SOFTWARE RELIABILITY (11)
REDUNDANCY (9)
REGISTERS (9)
EMBEDDED SYSTEMS (8)
FAULT TOLERANT COMPUTING (8)
MONITORING (8)
SOFTWARE FAULT TOLERANCE (8)
CHECKPOINTING (6)
COMPUTERS (6)
RUNTIME (6)
COMPUTATIONAL MODELING (5)
FAULT-TOLERANCE (5)
LIBRARIES (5)
OPTIMIZATION (5)
SAFETY (5)
CLOCKS (4)
MICROPROCESSOR CHIPS (4)
MICROPROCESSORS (4)
OBJECT-ORIENTED PROGRAMMING (4)
RANDOM ACCESS MEMORY (4)
SAFETY-CRITICAL SOFTWARE (4)
AUTOMOTIVE ENGINEERING (3)
ERROR DETECTION (3)
FAULT COVERAGE (3)
FAULT DETECTION (3)
FAULT INJECTION (3)
FPGA (3)
HARDWARE-SOFTWARE CODESIGN (3)
IP NETWORKS (3)
MULTICORE PROCESSING (3)
PERFORMANCE (3)
PROTOCOLS (3)
SERVERS (3)
SOFT ERROR (3)
SOFT ERRORS (3)
SYNCHRONIZATION (3)
SYSTEM-ON-CHIP (3)
TESTING (3)
TIMING (3)
TRANSIENT ANALYSIS (3)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (2)
ASPECT-ORIENTED PROGRAMMING (2)
AVAILABILITY (2)
COMMUNICATION CHANNELS (2)
CONTEXT (2)
DATA MINING (2)
DEBUGGING (2)
DECISION MAKING (2)
DELAY (2)
EMBEDDED SYSTEM (2)
ERROR DETECTION IMPLEMENTATION (2)
FAULT DIAGNOSIS (2)
FAULT RECOVERY (2)
FINITE IMPULSE RESPONSE FILTER (2)
FLOW GRAPHS (2)
FORMAL METHODS (2)
GRACEFUL DEGRADATION (2)
HARDWARE DESCRIPTION LANGUAGES (2)
INTEGRATED CIRCUIT DESIGN (2)
LOGIC DESIGN (2)
MPSOC (2)
MULTI-CORE (2)
MULTIMEDIA COMMUNICATION (2)
NETWORK-ON-CHIP (2)
NOC (2)
OPERATING SYSTEMS (2)
PETRI NETS (2)
RADIATION HARDENING (ELECTRONICS) (2)
RECONFIGURABLE ARCHITECTURES (2)
RELIABILITY ENGINEERING (2)
RESILIENCE (2)
ROUTING (2)
SCHEDULES (2)
SEMANTICS (2)
SIMULATION (2)
SOFTWARE ALGORITHMS (2)
SUPERCOMPUTERS (2)
SYSTEM RECOVERY (2)
SYSTEM-LEVEL OPTIMIZATION (2)
TEMPERATURE SENSORS (2)
TRANSIENT FAULTS (2)
TUNNELING MAGNETORESISTANCE (2)
UNIFIED MODELING LANGUAGE (2)
VHDL (2)
VLSI (2)
2D-MESH NETWORK-ON- CHIP (1)
ADAPTATION MODELS (1)
ADAPTIVE FAULT-TOLERANCE (1)
AEROSPACE COMPUTING (1)
AEROSPACE ELECTRONICS (1)
AGGREGATE REMOTE MEMORY COPY INTERFACE (1)
ALGORITHM DESIGN AND ANALYSIS (1)
more

INFONA - science communication portal

Search results

Model-driven reliability evaluation for MPSoC design

Self-Repairing Software Architecture for Predictable Hardware Faults

Formal Definition of Program Faults and Hierarchy of Program Fault-Tolerant Abilities

System-level architecture for mixed criticality applications on MPSoC: A space application

FLIPPER: Fault-tolerant distributed network management and control

Practical Task Allocation for Software Fault-Tolerance and Its Implementation in Embedded Automotive Systems

Software errors and reliability of embedded software

Comparative study on data error detection techniques in embedded systems

Methods and tools to increase fault tolerance of high-performance computing systems

Smart reconfiguration approach for fault-tolerant NoC based MPSoCs

Engineering Adaptive Fault-Tolerance Mechanisms for Resilient Computing on ROS

Design constraints and challenges behind fault tolerance systems in a mobile application framework

Development, integration, and test architecture for a software-based hardware-agnostic Fault Tolerant Flight Computer

Differentiated Failure Remediation with Action Selection for Resilient Computing

Diverse Compiling for Microprocessor Fault Detection in Temporal Redundant Systems

Bit Flipping Errors in High Performance Linpack at Exascale and Beyond

FAIL*: An Open and Versatile Fault-Injection Framework for the Assessment of Software-Implemented Hardware Fault Tolerance

Virtual machines of high availability using hardware-assisted failure detection

A Fault-Tolerant Java Virtual Machine Using Fast Rejuvenation for Soft-Error-Prone Systems

A toolchain for safety-critical embedded processor programming using FPGAs

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options