2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

Items from 1 to 20 out of 47 results

chapter

Publisher's Information

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 242

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

chapter

Parallelized Architecture of Multiple Classifiers for Face Detection

Junguk Cho, B. Benson, S. Mirzaei, R. Kastner

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 75 - 82

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

This paper presents a parallelized architecture of multiple classifiers for face detection based on the Viola and Jones object detection method. This method makes use of the AdaBoost algorithm which identifies a sequence of Haar classifiers that indicate the presence of a face. We describe the hardware design techniques including image scaling, integral image generation, pipelined processing of classifiers,...

chapter

Accelerating a Virtual Ecology Model with FPGAs

J. Lamoureux, T. Field, W. Luk

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 67 - 74

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

This paper describes the acceleration of virtual ecology models using field-programmable gate arrays (FPGAs). Our approach targets models generated by the Virtual Ecology Workbench (VEW); an existing tool used by biological oceanographers to build and analyze models of the plankton ecosystem in the upper ocean. Depending on the plankton study and required level of detail, the logic, memory, and data...

chapter

A System Framework for the Design of Embedded Software Targeting Heterogeneous Multi-core SoCs

X. Guerin, F. Petrot

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 153 - 160

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

Embedded appliances designers rely on heterogeneous multi-core system-on-chips (HMC-SoC) to provide the computing power required by modern applications. Due to the inherent complexity of this kind of platform, the development of specific system architectures is not considered as an option to provide low-level services to an application. Hence, the software is built either from scratch - when the softwarepsilas...

chapter

Low-Power ASIP Architecture Exploration and Optimization for Reed-Solomon Processing

A. Genser, C. Bachmann, C. Steger, J. Hulzink, more

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 177 - 182

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

The advent of the mobile age has heavily changed the requirements of today's communication devices. Data transmission over interference-prone wireless channels requires additional steps of data processing, such as forward error correction, to ensure reliable communication. In this work we present RS(63,55) Reed-Solomon encoding and decoding algorithms according to the IEEE 802.15.4a standard executed...

chapter

MSA-CUDA: Multiple Sequence Alignment on Graphics Processing Units with CUDA

Yongchao Liu, B. Schmidt, D.L. Maskell

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 121 - 128

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

Progressive alignment is a widely used approach for computing multiple sequence alignments (MSAs). However, aligning several hundred or thousand sequences with popular progressive alignment tools such as ClustalW requires hours or even days on state-of-the-art workstations. This paper presents MSA-CUDA, a parallel MSA program, which parallelizes all three stages of the ClustalW processing pipeline...

chapter

A 16-context Optically Reconfigurable Gate Array

M. Nakajima, M. Watanabe

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 227 - 230

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

Demand for fast dynamic reconfiguration has increased since dynamic reconfiguration can accelerate the performance of processors. Dynamic reconfiguration has two important prerequisites: fast reconfiguration and numerous reconfiguration contexts. Unfortunately, fast reconfigurations and numerous contexts share a tradeoff relation on current VLSIs. Therefore, optically reconfigurable gate arrays were...

chapter

Run-Time Detection of Malwares via Dynamic Control-Flow Inspection

Yong-Joon Park, Zhao Zhang, Songqing Chen

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 223 - 226

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

Conventional approach of detecting malwares relies on static scanning of malware signature. However, it may not work on the malwares that use software protection methods such as encryption and packing with run-time decryption and unpacking. We propose a hardware-assisted malware detection system that detects malwares during program run time to complement the conventional approach. It searches for...

chapter

Filtering Global History: Power and Performance Efficient Branch Predictor

R. Ayoub, A. Orailoglu

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 203 - 206

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

In this paper we present an Application Customizable Branch Predictor, ACBP, that delivers efficiency in energy savings and performance without compromising prediction accuracy. The idea of our technique is to filter unnecessary global history information within the global history register to minimize the predictor size while maintaining prediction accuracy. We suggest in this work an efficient algorithm...

chapter

P3FSM: Portable Predictive Pattern Matching Finite State Machine

L. Vespa, M. Mathew, Ning Weng

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 219 - 222

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

Signature-based network intrusion detection requires fast and reconfigurable pattern matching for deep packet inspection. In our previous work we address this problem with a hardware based pattern matching engine that utilizes a novel state encoding scheme to allow memory efficient use of Deterministic Finite Automata. In this work we expand on these concepts to create a completely software based...

chapter

Acceleration of Multiresolution Imaging Algorithms: A Comparative Study

R. Membarth, P. Kutzer, H. Dutta, F. Hannig, more

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 211 - 214

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

In this paper we consider a multiresolution filter and its realization on the Cell BE and GPUs. We not only present common and specific optimization strategies undertaken for obtaining maximum performance on these architectures, but also how to obtain a speedup of 6.57x and 33.24x compared to an optimized OpenMP baseline implementation. Furthermore, we also undertake automated configuration space...

chapter

Mapping Parallel FFT Algorithm onto SmartCell Coarse-Grained Reconfigurable Architecture

Cao Liang, Xinming Huang

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 231 - 234

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

This paper presents the implementation of a novel parallel FFT algorithm on SmartCell, a coarse-grained reconfigurable architecture, which is targeted on data streaming applications. The proposed FFT algorithm achieves balanced workload and memory requirement among the computational units, while maintaining optimized data flow at low configuration and communication cost. The proposed parallel FFT...

chapter

Application Specific Transistor Sizing for Low Power Full Adders

F. Eslami, A. Baniasadi, M. Farahani

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 195 - 198

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

Previously suggested transistor sizing algorithms assume that all input transitions are equally important. In this work we show that this is not an accurate assumption as input transitions appear in different frequencies. We take advantage from this phenomenon and introduce application specific transistor sizing. In application specific transistor sizing higher priority is given to more frequent transitions...

chapter

A High-Performance Hardware Architecture for Spectral Hash Algorithm

R.C.C. Cheung, C.K. Koc, J.D. Villasenor

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 215 - 218

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

The spectral hash algorithm is one of the round 1 candidates for the SHA-3 family, and is based on spectral arithmetic over a finite field, involving multidimensional discrete Fourier transformations over a finite field, data dependent permutations, rubic-type rotations, and affine and nonlinear functions. The underlying mathematical structures and operations pose interesting and challenging tasks...

chapter

Efficient Implementation of Carry-Save Adders in FPGAs

M. Ortiz, F. Quiles, J. Hormigo, F.J. Jaime, more

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 207 - 210

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

Most field programmable gate array (FPGA) devices have a special fast carry propagation logic intended to optimize addition operations. The redundant adders do not easily fit into this specialized carry-logic and, consequently, they require double hardware resources than carry propagate adders, while showing a similar delay for small size operands. Therefore, carry-save adders are not usually implemented...

chapter

Cover Art

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > C1

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

chapter

Title page i

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > i

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

The following topics are dealt with: application-specific system; architectures; arithmetic; field programmable gate array; media processing; image processing; cryptography; application-specific integrated circuit; computational biology; and application-specific instruction processor.

chapter

Author Index

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 239 - 240

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

chapter

A Combined Decimal and Binary Floating-Point Multiplier

C. Tsen, S. Gonzalez-Navarro, M. Schulte, B. Hickmann, more

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 8 - 15

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

In this paper, we describe the first hardware design of a combined binary and decimal floating-point multiplier, based on specifications in the IEEE 754-2008 floating-point standard. The multiplier design operates on either (1) 64-bit binary encoded decimal floating-point (DFP) numbers or (2) 64-bit binary floating-point (BFP) numbers. It returns properly rounded results for the rounding modes specified...

chapter

Organizing Committee

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > ix - x

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

Publication date

Set your own date range

Content availability

Available (46)
None (1)

Keywords

COMPUTER ARCHITECTURE (16)
FIELD PROGRAMMABLE GATE ARRAYS (14)
PROGRAM PROCESSORS (14)
HARDWARE (12)
DATA MINING (10)
FPGA (9)
MICROPROCESSOR CHIPS (9)
ADDERS (7)
MEMORY MANAGEMENT (7)
PARALLEL ARCHITECTURES (7)
DECODING (6)
DELAY (6)
DIGITAL ARITHMETIC (6)
IMAGE PROCESSING (6)
MICROPROCESSORS (6)
OPTIMIZATION (6)
PIXEL (6)
ALGORITHM DESIGN AND ANALYSIS (5)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (5)
KERNEL (5)
REGISTERS (5)
ARRAYS (4)
COMPUTER ARITHMETIC (4)
FIELD PROGRAMMABLE GATE ARRAY (4)
RANDOM ACCESS MEMORY (4)
RECONFIGURABLE ARCHITECTURES (4)
THROUGHPUT (4)
BIOLOGICAL SYSTEM MODELING (3)
CACHE STORAGE (3)
COMPUTATIONAL MODELING (3)
CRYPTOGRAPHY (3)
GPU (3)
INSTRUCTION SETS (3)
INTEGRATED CIRCUIT DESIGN (3)
LOGIC DESIGN (3)
MULTIMEDIA COMMUNICATION (3)
PIPELINES (3)
ACCELERATION (2)
ACCURACY (2)
ASIC (2)
BIOLOGY COMPUTING (2)
BRANCH PREDICTION (2)
CLASSIFICATION ALGORITHMS (2)
CMOS INTEGRATED CIRCUITS (2)
COMPUTER GRAPHICS (2)
CONFERENCES (2)
CORRELATION (2)
DIGITAL SIGNATURES (2)
DSP (2)
EMBEDDED SYSTEMS (2)
ENCODING (2)
FACE (2)
FACE DETECTION (2)
FACE RECOGNITION (2)
FAST FOURIER TRANSFORMS (2)
FPGAS (2)
LOGIC GATES (2)
LOW-POWER ELECTRONICS (2)
MULTICORE (2)
MULTIPLYING CIRCUITS (2)
MULTIPROCESSING SYSTEMS (2)
MULTIPROCESSOR INTERCONNECTION NETWORKS (2)
NEURAL NETS (2)
PARALLEL ALGORITHMS (2)
PARALLEL PROCESSING (2)
PATTERN MATCHING (2)
PIPELINE PROCESSING (2)
PROCESSOR SCHEDULING (2)
READ ONLY MEMORY (2)
SOC (2)
SOFTWARE (2)
SYSTEM-ON-CHIP (2)
TABLE LOOKUP (2)
VIDEO CODING (2)
VLIW (2)
16-CONTEXT OPTICALLY RECONFIGURABLE GATE ARRAY (1)
2D SRAM L2 CACHE (1)
3D DRAM (1)
3D DRAM L2 CACHE (1)
3D DRAM STACKING (1)
ACBP (1)
ACCELERATING VIRTUAL ECOLOGY MODEL (1)
ACCELERATION ENGINES (1)
ACCELERATOR-BASED HETEROGENEOUS MULTIPROCESSOR SYSTEM-ON-CHIP DESIGNS (1)
ALTERA STRATIX II FPGA (1)
ALU CIRCUITS (1)
ALU ENERGY CONSUMPTION (1)
ALWAYS NOT-TAKEN STATIC PREDICTION SCHEME (1)
ALWAYS TAKEN STATIC PREDICTION SCHEME (1)
AMD OPTERON 2200 SERIES (1)
APPLICATION CUSTOMIZABLE BRANCH PREDICTOR (1)
APPLICATION CUSTOMIZATION (1)
APPLICATION DOMAIN (1)
APPLICATION GRAPH (1)
APPLICATION PROGRAM INTERFACES (1)
APPLICATION SPECIFIC TRANSISTOR SIZING (1)
APPLICATION-SPECIFIC ARCHITECTURE (1)
APPLICATION-SPECIFIC INSTRUCTION PROCESSOR (1)
APPLICATION-SPECIFIC INSTRUCTIONS (1)
APPLICATION-SPECIFIC INTEGRATED CIRCUIT (1)
more

INFONA - science communication portal

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

Publisher's Information

Parallelized Architecture of Multiple Classifiers for Face Detection

Accelerating a Virtual Ecology Model with FPGAs

A System Framework for the Design of Embedded Software Targeting Heterogeneous Multi-core SoCs

Low-Power ASIP Architecture Exploration and Optimization for Reed-Solomon Processing

MSA-CUDA: Multiple Sequence Alignment on Graphics Processing Units with CUDA

A 16-context Optically Reconfigurable Gate Array

Run-Time Detection of Malwares via Dynamic Control-Flow Inspection

Filtering Global History: Power and Performance Efficient Branch Predictor

P3FSM: Portable Predictive Pattern Matching Finite State Machine

Acceleration of Multiresolution Imaging Algorithms: A Comparative Study

Mapping Parallel FFT Algorithm onto SmartCell Coarse-Grained Reconfigurable Architecture

Application Specific Transistor Sizing for Low Power Full Adders

A High-Performance Hardware Architecture for Spectral Hash Algorithm

Efficient Implementation of Carry-Save Adders in FPGAs

Cover Art

Title page i

Author Index

A Combined Decimal and Binary Floating-Point Multiplier

Organizing Committee

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors