2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

book

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

IEEE

chapter

On-demand fault-tolerant loop processing on massively parallel processor arrays

Alexandru Tanase, Michael Witterauf, Jurgen Teich, Frank Hannig, more

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 194 - 201

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

We present a compilation-based technique for providing on-demand structural redundancy for massively parallel processor arrays. Thereby, application programmers gain the capability to trade throughput for reliability according to application requirements. To protect parallel loop computations against errors, we propose to apply the well-known fault tolerance schemes dual modular redundancy (DMR) and...

chapter

A scheduling and binding heuristic for high-level synthesis of fault-tolerant FPGA applications

Aniruddha Shastri, Greg Stitt, Eduardo Riccio

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 202 - 209

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Space computing systems commonly use field-programmable gate arrays to provide fault tolerance by applying triple modular redundancy (TMR) to existing register-transfer-level (RTL) code. Although effective, this approach has a 3× area overhead that can be prohibitive for many designs that often allocate resources before considering effects of redundancy. Although a designer could modify existing RTL...

chapter

Does arithmetic logic dominate data movement? a systematic comparison of energy-efficiency for FFT accelerators

Tung Thanh-Hoang, Amirali Shambayati, Henry Hoffmann, Andrew A. Chien

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 66 - 67

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

In this paper, we perform a systematic comparison to study the energy cost of varying data formats and data types w.r.t. arithmetic logic and data movement for accelerator-based heterogeneous systems in which both compute-intensive (FFT accelerator) and data-intensive accelerators (DLT accelerator) are added. We explore evaluation for a wide range of design processes (e.g. 32nm bulk-CMOS and projected...

chapter

Application-set driven exploration for custom processor architectures

Mehmet Ali Arslan, Flavius Gruian, Krzysztof Kuchcinski

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 70 - 71

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Custom architectures are often adopted as more efficient alternatives to general purpose processors in terms of performance and power. However, the design of such architectures requires experts both in hardware and the application domain. In this paper we propose a method for speeding up the design space exploration. Our method, based on Pareto points, identifies sets of solutions in terms of scalar...

chapter

An IEEE 754 double-precision floating-point multiplier for denormalized and normalized floating-point numbers

Ross Thompson, James E. Stine

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 62 - 63

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

This paper discusses an optimized double-precision floating-point multiplier that can handle both denormalized and normalized IEEE 754 floating-point numbers. Discussions of the optimizations are given and compared versus similar implementations, however, the main objective is keeping compliant for denormalized IEEE 754 floating-point numbers while still maintaining high performance operations for...

chapter

An efficient real-time data pipeline for the CHIME Pathfinder radio telescope X-engine

Andre Recnik, Kevin Bandura, Nolan Denman, Adam D. Hincks, more

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 57 - 61

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

The CHIME Pathfinder is a new interferometric radio telescope that uses a hybrid FPGA/GPU FX correlator. The GPU-based X-engine of this correlator processes over 819 Gb/s of 4+4-bit complex astronomical data from N=256 inputs across a 400MHz radio band. A software framework is presented to manage this real-time data flow, which allows each of 16 processing servers to handle 51.2 Gb/s of astronomical...

chapter

Accelerating data centers with reconfigurable logic

Derek Chiou

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 1

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Data centers are a highly competitive environment that demands high performance and energy efficiency and, in many cases, low latency. Custom hardware can provide significant improvements over conventional microprocessors on those metrics. Microsoft has been investigating the use of reconfigurable logic, in the form of field programmable gate arrays, to accelerate its data centers. In this talk, I...

chapter

Power and performance trade-offs for Space Time Adaptive Processing

Nitin A. Gawande, Joseph B. Manzano, Antonino Tumeo, Nathan R. Tallent, more

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 41 - 48

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Power efficiency - performance relative to power - is one of the most important concerns when designing RADAR processing systems. This paper analyzes power and performance trade-offs for a typical Space Time Adaptive Processing (STAP) application. We study STAP implementations for CUDA and OpenMP on two architectures, Intel Haswell Core I7-4770TE and NVIDIA Kayla with a GK208 GPU. We analyze the power...

chapter

Automatic frame rate-based DVFS of game

Zhinan Cheng, Xi Li, Beilei Sun, Ce Gao, more

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 158 - 159

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

The rapid development of mobile games highlights the power consumption problem in the mobile platform. Most of the power saving techniques use the prediction-based dynamic voltage frequency scaling (DVFS) scheme. However, the prediction could be inaccurate resulting from the frequent interactions of user when playing games. We have observed that frame rate is near-linear to CPU frequency, but there...

chapter

Mixed-signal implementation of differential decoding using binary message passing algorithms

Glenn Cowan, Kevin Cushon, Warren Gross

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 116 - 119

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

This paper presents the mixed-signal circuit implementation of reduced complexity algorithms for decoding low-density parity check (LDPC) codes. Based on modified differential decoding using binary message passing (MDD-BMP), binary addition using discrete-time digital circuits is replaced by continuous-time analog-current summation. Potential degradation due to the mismatch between current sources,...

chapter

Accelerating bootstrapping in FHEW using GPUs

Moon Sung Lee, Yongje Lee, Jung Hee Cheon, Yunheung Paek

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 128 - 135

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Recently, the usage of GPU is not limited to the jobs associated with graphics and a wide variety of applications take advantage of the flexibility of GPUs to accelerate the computing performance. Among them, one of the most emerging applications is the fully homomorphic encryption (FHE) scheme, which enables arbitrary computations on encrypted data. Despite much research effort, it cannot be considered...

chapter

GPU-based multifrontal optimizing method in sparse Cholesky factorization

Ran Zheng, Wei Wang, Hai Jin, Song Wu, more

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 90 - 97

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

In many scientific computing applications, sparse Cholesky factorization is used to solve large sparse linear equations in distributed environment. GPU computing is a new way to solve the problem. However, sparse Cholesky factorization on GPU is hardly to achieve excellent performance due to the structure irregularity of matrix and the low GPU resource utilization. A hybrid CPU-GPU implementation...

chapter

Multi-task support for security-enabled embedded processors

Tedy Thomas, Arman Pouraghily, Kekai Hu, Russell Tessier, more

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 136 - 143

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Embedded systems require low overhead security approaches to ensure that they are protected from attacks. In this paper, we propose a hardware-based approach to secure the operation of an embedded processor instruction-by-instruction, where deviations from expected program behavior are detected within the execution of an instruction. These security-enabled embedded processors provide effective defenses...

chapter

Hardware acceleration of Private Information Retrieval protocols using GPUs

Mihai Maruseac, Gabriel Ghinita, Ming Ouyang, Razvan Rughinis

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 120 - 127

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Private Information Retrieval (PIR) protocols allow users to search for data items stored at an untrusted server, without disclosing to the server the search attributes. Several computational PIR protocols provide cryptographic-strength guarantees for the privacy of users, building upon well-known hard mathematical problems, such as factorisation of large integers. Unfortunately, the computational-intensive...

chapter

Stochastic circuit design and performance evaluation of vector quantization

Ran Wang, Jie Han, Bruce Cockburn, Duncan Elliott

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 111 - 115

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Vector quantization (VQ) is a general data compression technique that has a scalable implementation complexity and potentially a high compression ratio. In this paper, a novel implementation of VQ using stochastic circuits is proposed and its performance is evaluated. The stochastic and binary designs are compared for the same compression quality and the circuits are synthesized for an industrial...

chapter

Accelerating persistent scatterer pixel selection for InSAR processing

Tahsin Reza, Aaron Zimmer, Parwant Ghuman, Tanuj kr Aasawat, more

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 49 - 56

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Interferometric Synthetic Aperture Radar (InSAR) is a remote sensing technology used for estimating displacement of the earth's surface. Phase unwrapping is the most important step in InSAR processing and relies on successful selection of points that appear stable across a set of satellite images taken over time. This paper presents a new algorithm for selecting these points, a problem known as persistent...

chapter

Program committee

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 1 - 3

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

chapter

Message from the ASAP 2015 chairs

Jason Anderson, Hayden Kwok-Hay So, Deshanand Singh

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 1 - 2

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

We welcome you to the 26th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP 2015). This year's event takes place in Toronto, Canada on the campus of the University of Toronto. Prior to this year's visit to Toronto, the conference has been held in many places around the globe including Oxford (1986), San Diego (1988), Killarney (1989), Princeton (1990),...

chapter

INFONA - science communication portal

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)