2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

Items from 1 to 20 out of 56 results

chapter

Multi-Core Architecture on FPGA for Large Dictionary String Matching

Qingbo Wang, V.K. Prasanna

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 96 - 103

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

FPGA has long been considered an attractive platform for high performance implementations of string matching. However, as the size of pattern dictionaries continues to grow, such large dictionaries can be stored in external DRAM only. The increased memory latency and limited bandwidth pose new challenges to FPGA-based designs, and the lack of spatial and temporal locality in data access also leads...

chapter

Memory-Efficient Pipelined Architecture for Large-Scale String Matching

Y.-H.E. Yang, V.K. Prasanna

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 104 - 111

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

We propose a pipelined field-merge architecture for memory-efficient and high-throughput large-scale string matching (LSSM). Our proposed architecture partitions the (8-bit) character input into several bit-field inputs of smaller (usually 2-bit) widths. Each bit-field input is matched in a partial state machine (PSM) pipeline constructed from the respective bit-field patterns. The matching results...

chapter

Application Specific Customization and Scalability of Soft Multiprocessors

D. Unnikrishnan, Jia Zhao, R. Tessier

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 123 - 130

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

Although soft microprocessors are widely used in FPGAs, limited work has been performed regarding how to automatically and efficiently generate soft multiprocessors. In this paper, an automated parallel compilation environment for multiple soft processors which incorporates parallel compilation and inter-processor communication structures is described. A total of eight previously-developed parallel...

chapter

CAAD BLASTP: NCBI BLASTP Accelerated with FPGA-Based Accelerated Pre-Filtering

J.H. Park, Yunfei Qiu, M.C. Herbordt

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 81 - 87

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

NCBI BLAST has become the de facto standard in bioinformatic approximate string matching and so its acceleration is of fundamental importance. The problem is that it uses complex heuristics which make it difficult to simultaneously achieve both substantial speed-up and exact agreement with the original output. Our approach is to prefilter the database. To make this work we have developed a novel heuristic...

chapter

FPGA Floating Point Datapath Compiler

M. Langhammer, T. VanCourt

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 259 - 262

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

This paper will describe the architecture of a compiler which will convert an untimed C description of a floating point expression into a synthesizable datapath optimized for FPGAs. The concept of floating point fused datapath synthesis will be reviewed, along with the expected functional efficiency gains. The dataflow graph structure used by the compiler will be detailed, followed by the description...

chapter

Non-Preconditioned Conjugate Gradient on Cell and FPGA Based Hybrid Supercomputer Nodes

D. DuBois, A. DuBois, T. Boorman, C. Connor

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 201 - 208

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

This work presents a detailed implementation of a double precision, non-preconditioned, conjugate gradient algorithm on a Roadrunner heterogeneous supercomputer node. These nodes utilize the Cell Broadband Engine Architecturetrade in conjunction with x86 Opterontrade processors from AMD. We implement a common conjugate gradient algorithm, on a variety of systems, to compare and contrast performance...

chapter

Scalable High Throughput and Power Efficient IP-Lookup on FPGA

Hoang Le, V.K. Prasanna

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 167 - 174

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

Most high-speed Internet Protocol (IP) lookup implementations use tree traversal and pipelining. Due to the available on-chip memory and the number of I/O pins of Field Programmable Gate Arrays (FPGAs), state-of-the-art designs cannot support the current largest routing table(consisting of 257 K prefixes in backbone routers). We propose a novel scalable high-throughput, low-power SRAM-based linear...

chapter

Architectural Comparison of Instruments for Transaction Level Monitoring of FPGA-Based Packet Processing Systems

P.E. McKechnie, M. Blott, W.A. Vanderbauwhede

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 175 - 182

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

The fine-grained parallelism inherent in FPGAs has encouraged their use in packet processing systems. To facilitate debugging and performance evaluation, designers require on-chip monitors that provide abstractions of low-level details and a system-level perspective. In this paper, we present five architectures that permit transaction-based communication-centric monitoring of packet processing systems...

chapter

Optical Flow on the Ambric Massively Parallel Processor Array (MPPA)

B. Hutchings, B. Nelson, S. West, R. Curtis

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 141 - 148

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

The Ambric Massively Parallel Processor Array (MPPA) is a device that contains 336 32-bit RISC processors and is appropriate for embedded systems due to its relatively small physical and power footprint. Optical flow is a computationally-demanding and highly parallelizeable image-processing algorithm with applications in embedded systems such as robotics and autonomous vehicles. An optical flow algorithm...

chapter

FPGA-based Monte Carlo Computation of Light Absorption for Photodynamic Cancer Therapy

J. Luu, K. Redmond, W.C.Y. Lo, P. Chow, more

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 157 - 164

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

Photodynamic therapy (PDT) is a method of treating cancer that combines light and light-sensitive drugs to selectively destroy cancerous tumours without harming the healthy tissue. The success of PDT depends on the accurate computation of light dose distribution. Monte Carlo (MC) simulations can provide an accurate solution for light dose distribution, but have high computation time that prevents...

chapter

Copyright Page

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > iv

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

chapter

An FPGA Implementation for Solving Least Square Problem

Depeng Yang, G.D. Peterson, Husheng Li, Junqing Sun

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 303 - 306

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

This paper proposes a high performance least square solver on FPGAs using the Cholesky decomposition method. Our design can be realized by iteratively adopting a single triangular linear equation solver for modified Cholesky decomposition and forward/backward substitutions. Good performance is achieved by optimizing the Cholesky factorization algorithms, reordering the computation and thus alleviating...

chapter

HighEnd Reconfigurable Systems for Fast Windows' Password Cracking

K. Theoharoulis, C. Manifavas, I. Papaefstathiou

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 287 - 290

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

One of the most efficient methods for cracking passwords is the one based on ldquorainbow tablesrdquo; those lookup tables are offering an almost optimal time-memory tradeoff in the process of recovering the plaintext password from a password hash generated by a cryptographic hash function. In this paper, we demonstrate the first known system, implemented in a state-of-the-art reconfigurable device...

chapter

A Packet Generator on the NetFPGA Platform

G.A. Covington, G. Gibb, J.W. Lockwood, N. Mckeown

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 235 - 238

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

A packet generator and network traffic capture system has been implemented on the NetFPGA. The NetFPGA is an open networking platform accelerator that enables rapid development of hardware-accelerated packet processing applications. The packet generator application allows Internet packets to be transmitted at line rate on up to four gigabit Ethernet ports simultaneously. Data transmitted is specified...

chapter

Efficient Mapping of Hardware Tasks on Reconfigurable Computers Using Libraries of Architecture Variants

Miaoqing Huang, V.K. Narayana, T. El-Ghazawi

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 247 - 250

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

Scheduling and partitioning of task graphs on reconfigurable hardware needs to be carefully carried out in order to achieve the best possible performance. In this paper, we demonstrate that a significant improvement to the total execution time is possible by incorporating a library of hardware task implementations, which contains multiple architectural variants for each hardware task reflecting tradeoffs...

chapter

Accelerating the Gauss-Seidel Power Flow Solver on a High Performance Reconfigurable Computer

Jong-Ho Byun, A. Ravindran, A. Mukherjee, B. Joshi, more

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 227 - 230

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

The computationally intensive power flow problem determines the voltage magnitude and phase angle at each bus in a power system for hundreds of thousands of buses under balanced three-phase steady-state conditions. We report an FPGA acceleration of the Gauss-Seidel based power flow solver employed in the transmission module of the GridLAB-D power distribution simulator and analysis tool. The prototype...

chapter

AIREN: A Novel Integration of On-Chip and Off-Chip FPGA Networks

A.G. Schmidt, W.V. Kritikos, R.R. Sharma, R. Sass

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 271 - 274

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

The Reconfigurable Computing Cluster Project at the University of North Carolina at Charlotte is investigating the feasibility of using FPGAs as compute nodes to scale to PetaFLOP computing. To date the Spirit cluster, consisting of 64 FPGAs, has been assembled for the initial analysis. One important question is how to efficiently communicate among compute cores on-chip as well as between nodes. Tight...

chapter

FPGA Implementation of a Single-Precision Floating-Point Multiply-Accumulator with Single-Cycle Accumulation

A. Paidimarri, A. Cevrero, P. Brisk, P. Ienne

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 267 - 270

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

This paper describes an FPGA implementation of a single-precision floating-point multiply-accumulator (FPMAC) that supports single-cycle accumulation while maintaining high clock frequencies. A non-traditional internal representation reduces the cost of mantissa alignment within the accumulator. The FPMAC is evaluated on an Altera Stratix III FPGA.

chapter

Cover Art

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > C1

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

chapter

Benchmarking Reconfigurable Architectures in the Mobile Domain

P. Jamieson, T. Becker, W. Luk, P.Y.K. Cheung, more

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines > 131 - 138

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

In this paper, we introduce GroundHog 2009 benchmarking suite that can be used to evaluate the power consumption of reconfigurable technology implementing applications targeting the mobile computing domain. This benchmark suite includes seven designs; one design targets fine-grained FPGA fabrics, and six designs are specified at a high level, which allows them to target a range of reconfigurable technologies...

Publication date

Set your own date range

Content availability

Available (55)
None (1)

Keywords

FIELD PROGRAMMABLE GATE ARRAYS (43)
FPGA (27)
HARDWARE (17)
RANDOM ACCESS MEMORY (12)
COMPUTER ARCHITECTURE (9)
DATA MINING (9)
PIPELINES (9)
RECONFIGURABLE COMPUTING (9)
RECONFIGURABLE ARCHITECTURES (8)
ALGORITHM DESIGN AND ANALYSIS (6)
CLOCKS (6)
KERNEL (6)
PROGRAM PROCESSORS (6)
ADDERS (5)
PARALLEL ARCHITECTURES (5)
PARALLEL PROCESSING (5)
PIPELINE PROCESSING (5)
REGISTERS (5)
SYSTEM-ON-A-CHIP (5)
TABLE LOOKUP (5)
ARRAYS (4)
MAGNETIC CORES (4)
MATHEMATICAL MODEL (4)
MICROPROCESSOR CHIPS (4)
SOFTWARE (4)
TILES (4)
ACCELERATION (3)
EQUATIONS (3)
FIELD PROGRAMMABLE GATE ARRAY (3)
FLOATING POINT ARITHMETIC (3)
LOGIC DESIGN (3)
OPTIMIZATION (3)
PERFORMANCE EVALUATION (3)
PIXEL (3)
RECONFIGURABLE HARDWARE (3)
STRING MATCHING (3)
THROUGHPUT (3)
AMBRIC MASSIVELY PARALLEL PROCESSOR ARRAY (2)
BANDWIDTH (2)
BASIC LOCAL ALIGNMENT SEARCH TOOL (2)
BENCHMARK TESTING (2)
BIOINFORMATICS (2)
CLUSTERING ALGORITHMS (2)
COMPUTATIONAL MODELING (2)
CONJUGATE GRADIENT (2)
CONJUGATE GRADIENT METHODS (2)
COPROCESSORS (2)
CRYPTOGRAPHY (2)
DATABASES (2)
DEBUGGING (2)
DELAY (2)
DICTIONARIES (2)
DOPED FIBER AMPLIFIERS (2)
FPGA IMPLEMENTATION (2)
FPGAS (2)
GPU (2)
IMAGE PROCESSING (2)
INDEXES (2)
IP NETWORKS (2)
ITERATIVE METHODS (2)
MATRIX DECOMPOSITION (2)
MEMORY ARCHITECTURE (2)
MICROPROCESSORS (2)
MONITORING (2)
MPPA (2)
MULTIPLEXING (2)
NETWORK (2)
NETWORK-ON-CHIP (2)
ON-CHIP MEMORY (2)
PARALLEL ARCHITECTURE (2)
PARTIAL RECONFIGURATION (2)
PENTIUM 4 PROCESSOR (2)
POLYNOMIALS (2)
POWER CONSUMPTION (2)
PROCESSOR SCHEDULING (2)
PROGRAMMING (2)
ROBOTICS (2)
ROUTING (2)
RUNTIME (2)
SCHEDULING (2)
SENSITIVITY (2)
SPACE VEHICLES (2)
SPARSE MATRICES (2)
SWITCHES (2)
SYMMETRIC MATRICES (2)
SYSTEM-ON-CHIP (2)
TOPOLOGY (2)
YARN (2)
ABSORPTION (1)
ACCURACY (1)
ADAPTIVE SYSTEMS (1)
ADAPTIVE WIRELESS APPLICATION (1)
AFFINE ARITHMETIC (1)
AHO-CORASICK ALGORITHM (1)
AIRCRAFT (1)
AIREN (1)
ALGEBRA (1)
ALTERA STRATIX II (1)
ALTERA STRATIX II EP2S130F1020C5 (1)
ALTERA STRATIX III FPGA (1)
more

INFONA - science communication portal

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)

Multi-Core Architecture on FPGA for Large Dictionary String Matching

Memory-Efficient Pipelined Architecture for Large-Scale String Matching

Application Specific Customization and Scalability of Soft Multiprocessors

CAAD BLASTP: NCBI BLASTP Accelerated with FPGA-Based Accelerated Pre-Filtering

FPGA Floating Point Datapath Compiler

Non-Preconditioned Conjugate Gradient on Cell and FPGA Based Hybrid Supercomputer Nodes

Scalable High Throughput and Power Efficient IP-Lookup on FPGA

Architectural Comparison of Instruments for Transaction Level Monitoring of FPGA-Based Packet Processing Systems

Optical Flow on the Ambric Massively Parallel Processor Array (MPPA)

FPGA-based Monte Carlo Computation of Light Absorption for Photodynamic Cancer Therapy

Copyright Page

An FPGA Implementation for Solving Least Square Problem

HighEnd Reconfigurable Systems for Fast Windows' Password Cracking

A Packet Generator on the NetFPGA Platform

Efficient Mapping of Hardware Tasks on Reconfigurable Computers Using Libraries of Architecture Variants

Accelerating the Gauss-Seidel Power Flow Solver on a High Performance Reconfigurable Computer

AIREN: A Novel Integration of On-Chip and Off-Chip FPGA Networks

FPGA Implementation of a Single-Precision Floating-Point Multiply-Accumulator with Single-Cycle Accumulation

Cover Art

Benchmarking Reconfigurable Architectures in the Mobile Domain

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2009 17th IEEE Symposium on Field Programmable Custom Computing Machines (FCCM 2009)