Search results for: Zaid Al-Ars

Items from 21 to 40 out of 45 results

chapter

Calculation of worst-case execution time for multicore processors using deterministic execution

Hamid Mushtaq, Zaid Al-Ars, Koen Bertels

2015 25th International Workshop on Power and Timing Modeling, Optimization and Simulation (PATMOS) > 33 - 39

2015 25th International Workshop on Power and Timing Modeling, Optimization and Simulation (PATMOS)

Safety critical real time systems need to meet strict timing deadlines. We use a model checking based approach to calculate the WCET, where we apply optimizations to reduce the number of states stored by the model checker. Furthermore, we used deterministic shared memory accesses to further reduce calculation time, memory and number of states needed for calculating WCET. By optimizing the model checking...

chapter

Using VLIW softcore processors for image processing applications

Joost Hoozemans, Stephan Wong, Zaid Al-Ars

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS) > 315 - 318

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)

The ever-increasing complexity of advanced high-resolution image processing applications requires innovative solutions to ensure addressing this issue efficiently and cost effectively. This paper discusses the utilization of reconfigurable general-purpose softcore processors in image processing applications such that hardware resources are efficiently utilized and at the same time ensure high image...

chapter

An FPGA-based systolic array to accelerate the BWA-MEM genomic mapping algorithm

Ernst Joachim Houtgast, Vlad-Mihai Sima, Koen Bertels, Zaid Al-Ars

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS) > 221 - 227

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)

We present the first accelerated implementation of BWA-MEM, a popular genome sequence alignment algorithm widely used in next generation sequencing genomics pipelines. The Smith-Waterman-like sequence alignment kernel requires a significant portion of overall execution time. We propose and evaluate a number of FPGA-based systolic array architectures, presenting optimizations generally applicable to...

chapter

Accelerating complex brain-model simulations on GPU platforms

H.A. Du Nguyen, Zaid Al-Ars, Georgios Smaragdos, Christos Strydis

2015 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 974 - 979

2015 Design, Automation & Test in Europe Conference & Exhibition (DATE)

The Inferior Olive (IO) in the brain, in conjunction with the cerebellum, is responsible for crucial sensorimotor-integration functions in humans. In this paper, we simulate a computationally challenging IO neuron model consisting of three compartments per neuron in a network arrangement on GPU platforms. Several GPU platforms of the two latest NVIDIA GPU architectures (Fermi, Kepler) have been used...

article

Analysis of RNAseq datasets from a comparative infectious disease zebrafish model using GeneTiles bioinformatics

Wouter J. Veneman, Jan Sonneville, Kees-Jan Kolk, Anita Ordas, more

Immunogenetics > 2015 > 67 > 3 > 135-147

We present a RNA deep sequencing (RNAseq) analysis of a comparison of the transcriptome responses to infection of zebrafish larvae with Staphylococcus epidermidis and Mycobacterium marinum bacteria. We show how our developed GeneTiles software can improve RNAseq analysis approaches by more confidently identifying a large set of markers upon infection with these bacteria. For analysis of RNAseq data...

chapter

Automated Hybrid Interconnect Design for FPGA Accelerators Using Data Communication Profiling

Cuong Pham-Quoc, Zaid Al-Ars, Koen Bertels

2014 IEEE International Parallel & Distributed Processing Symposium Workshops > 151 - 160

2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW)

In this paper, we introduce an automated interconnect design strategy to create an efficient custom interconnect for kernels in an FPGA-based accelerator system to accelerate their communication behavior. Our custom interconnect includes an NoC, shared local memory solution or both. Depending on the quantitative communication profiling of the application, the interconnect is built using our proposed...

article

Efficent and highly portable deterministic multithreading (DetLock)

Hamid Mushtaq, Zaid Al-Ars, Koen Bertels

Computing > 2014 > 96 > 12 > 1131-1147

In this paper, we present DetLock, a runtime system to ensure deterministic execution of multithreaded programs running on multicore systems. DetLock does not rely on any hardware support or kernel modification to ensure determinism. For tracking the progress of the threads, logical clocks are used. Unlike previous approaches, which rely on non-portable hardware to update the logical clocks, DetLock...

chapter

Accurate and efficient identification of worst-case execution time for multicore processors: A survey

Hamid Mushtaq, Zaid Al-Ars, Koen Bertels

2013 8th IEEE Design and Test Symposium > 1 - 6

2013 Design and Test Symposium (IDT)

Parallel systems were for a long time confined to high-performance computing. However, with the increasing popularity of multicore processors, parallelization has also become important for other computing domains, such as desktops and embedded systems. Mission-critical embedded software, like that used in avionics and automotive industry, also needs to guarantee real time behavior. For that purpose,...

chapter

Fault tolerance on multicore processors using deterministic multithreading

Hamid Mushtaq, Zaid Al-Ars, Koen Bertels

2013 8th IEEE Design and Test Symposium > 1 - 6

2013 Design and Test Symposium (IDT)

This paper describes a software based fault tolerance approach for multithreaded programs running on multicore processors. Redundant multithreaded processes are used to detect soft errors and recover from them. Our scheme makes sure that the execution of the redundant processes is identical even in the presence of non-determinism due to shared memory accesses. This is done by making sure that the...

chapter

Heterogeneous hardware accelerator architecture for streaming image processing

Cuong Pham-Quoc, Zaid Al-Ars, Koen Bertels

2013 International Conference on Advanced Technologies for Communications (ATC 2013) > 374 - 379

2013 International Conference on Advanced Technologies for Communications (ATC 2013)

This paper proposes a heterogeneous hardware accelerator architecture to support streaming image processing. Each image in a data-set is pre-processed on a host processor and sent to hardware kernels. The host processor and the hardware kernels process a stream of images in parallel. The Convey hybrid computing system is used to develop our proposed architecture. We use the Canny edge detection algorithm...

chapter

Heterogeneous hardware accelerators interconnect: An overview

Cuong Pham-Quoc, Zaid Al-Ars, Koen Bertels

2013 NASA/ESA Conference on Adaptive Hardware and Systems (AHS-2013) > 189 - 197

2013 NASA/ESA Conference on Adaptive Hardware and Systems (AHS)

In this paper, we present an overview of interconnect solutions for hardware accelerator systems. A number of solutions are presented: bus-based, DMA, crossbar, NoC, as well as combinations of these. The paper proposes analytical models to predict the performance of these solutions and implements them in practice. The jpeg decoder application is implemented as our case study in different scenarios...

chapter

Rule-based data communication optimization using quantitative communication profiling

Cuong Pham Quoc, Zaid Al-Ars, Koen Bertels

2012 International Conference on Field-Programmable Technology > 104 - 108

2012 International Conference on Field-Programmable Technology (FPT)

Multicore architectures, especially hardware accelerator systems with heterogeneous processing elements, are being increasingly used due to the increasing processing demand of modern digital systems. However, data communication in multicore architectures is one of the main performance bottle-necks. Therefore, reducing data communication overhead is an important method to improve the speed-up of such...

chapter

DetLock: Portable and Efficient Deterministic Execution for Shared Memory Multicore Systems

Hamid Mushtaq, Zaid Al-Ars, Koen Bertels

2012 SC Companion: High Performance Computing, Networking Storage and Analysis > 721 - 730

2012 SC Companion: High-Performance Computing, Networking, Storage and Analysis (SCC)

Multicore systems are not only hard to program but also hard to test, debug and maintain. This is because the traditional way of accessing shared memory in multithreaded applications is to use lock-based synchronization, which is inherently non-deterministic and can cause a multithreaded application to have many different possible execution paths for the same input. This problem can be avoided however...

chapter

A user-level library for fault tolerance on shared memory multicore systems

Hamid Mushtaq, Zaid Al-Ars, Koen Bertels

2012 IEEE 15th International Symposium on Design and Diagnostics of Electronic Circuits & Systems (DDECS) > 266 - 269

2012 IEEE 15th International Symposium on Design and Diagnostics of Electronic Circuits & Systems (DDECS)

The ever decreasing transistor size has made it possible to integrate multiple cores on a single die. On the downside, this has introduced reliability concerns as smaller transistors are more prone to both transient and permanent faults. However, the abundant extra processing resources of a multicore system can be exploited to provide fault tolerance by using redundant execution. We have designed...

chapter

Survey of fault tolerance techniques for shared memory multicore/multiprocessor systems

Hamid Mushtaq, Zaid Al-Ars, Koen Bertels

2011 IEEE 6th International Design and Test Workshop (IDT) > 12 - 17

2011 IEEE 6th International Design and Test Workshop (IDT)

With the advent of modern nano-scale technology, it has become possible to implement multiple processing cores on a single die. The shrinking transistor sizes however have made reliability a concern for such systems as smaller transistors are more prone to permanent as well as transient faults. To reduce the probability of failures of such systems, online fault tolerance techniques can be applied...

chapter

4-D parity codes for soft error correction in aerospace applications

Muhammad Imran, Zaid Al-Ars, Georgi N. Gaydadjiev

2011 IEEE 6th International Design and Test Workshop (IDT) > 104 - 109

2011 IEEE 6th International Design and Test Workshop (IDT)

In order to reduce the overall system cost, the aerospace industry has been increasingly using commercial off the shelf components in their products. The sensitivity of these products to radiation induced soft errors becomes a major concern. In this paper, we propose a method to increase the reliability of a given off the shelf component by manipulating the software-based error correction algorithm...

chapter

A New Test Paradigm for Semiconductor Memories in the Nano-Era

Said Hamdioui, Venkataraman Krishnaswami, Ijeoma Sandra Irobi, Zaid Al-Ars

2011 Asian Test Symposium > 347 - 352

2011 IEEE 20th Asian Test Symposium (ATS)

Due to rapid and continuous technology scaling, faults in semiconductor memories (and ICs in general) are becoming pervasive and weak rather than strong, a weak fault is a fault that escape the test program (because it does not cause an error/system failure). However, multiple weak faults may cause an error during the application. Components with weak faults which fail at board and system level are...

chapter

Testing for Parasitic Memory Effect in SRAMs

Sandra Irobi, Zaid Al-Ars, Said Hamdioui, Claude Thibeault

2011 Asian Test Symposium > 407 - 412

2011 IEEE 20th Asian Test Symposium (ATS)

Parasitic memory effect can occur due to the impact of parasitic node capacitances and faulty node voltages on the electrical behavior of SRAMs. This memory effect can cause detectable faults to become undetectable using existing industrial tests. This paper analyzes, evaluates and identifies the unique detection conditions for faults in SRAMs. It demonstrates the limitation of existing industrial...

chapter

GPU-accelerated protein sequence alignment

Laiq Hasan, Marijn Kentie, Zaid Al-Ars

2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society > 2442 - 2446

2011 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society

Smith-Waterman (S-W) algorithm is an optimal sequence alignment method and is widely used for genetic databases. This paper presents a Graphics Processing Units (CPUs) accelerated S-W implementation for protein sequence alignment. The paper proposes a new sequence database organization and several optimizations to reduce the number of memory accesses. The new implementation achieves a performance...

chapter

Memory Test Optimization for Parasitic Bit Line Coupling in SRAMs

Sandra Irobi, Zaid Al-Ars, Said Hamdioui

2011 Sixteenth IEEE European Test Symposium > 205

2011 16th IEEE European Test Symposium (ETS)

Memory test optimization can significantly reduce test complexity, while retaining the quality of the test. In the presence of parasitic BL coupling, faults may only be detected by writing all possible coupling backgrounds (CBs) in the neighboring cells of the victim [2], [3]. However, using all possible CBs while testing for every fault consumes enormous test time, which can be significantly reduced,...

Publication date

Set your own date range

Publication type

book (34)
article (11)

INFONA - science communication portal

Search results for: Zaid Al-Ars

Calculation of worst-case execution time for multicore processors using deterministic execution

Using VLIW softcore processors for image processing applications

An FPGA-based systolic array to accelerate the BWA-MEM genomic mapping algorithm

Accelerating complex brain-model simulations on GPU platforms

Analysis of RNAseq datasets from a comparative infectious disease zebrafish model using GeneTiles bioinformatics

Automated Hybrid Interconnect Design for FPGA Accelerators Using Data Communication Profiling

Efficent and highly portable deterministic multithreading (DetLock)

Accurate and efficient identification of worst-case execution time for multicore processors: A survey

Fault tolerance on multicore processors using deterministic multithreading

Heterogeneous hardware accelerator architecture for streaming image processing

Heterogeneous hardware accelerators interconnect: An overview

Rule-based data communication optimization using quantitative communication profiling

DetLock: Portable and Efficient Deterministic Execution for Shared Memory Multicore Systems

A user-level library for fault tolerance on shared memory multicore systems

Survey of fault tolerance techniques for shared memory multicore/multiprocessor systems

4-D parity codes for soft error correction in aerospace applications

A New Test Paradigm for Semiconductor Memories in the Nano-Era

Testing for Parasitic Memory Effect in SRAMs

GPU-accelerated protein sequence alignment

Memory Test Optimization for Parasitic Bit Line Coupling in SRAMs

Filter options

Publication date

Publication type

Keywords

Data set

Journal

INFONA - science communication portal

Search results for: Zaid Al-Ars

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options