Search results

Items from 1 to 20 out of 48 results

article

A Reconfigurable and Scalable FPGA Architecture for Bilateral Filtering

Swapnil Deelip Dabhade, G. N. Rathna, Kunal Narayan Chaudhury

IEEE Transactions on Industrial Electronics > 2018 > 65 > 2 > 1459 - 1469

Bilateral filter is an edge-preserving smoother that has applications in image processing, computer vision, and computational photography. In the past, field-programmable gate array (FPGA) implementations of the filter have been proposed that can achieve high throughput using parallelization and pipelining. An inherent limitation with direct implementations is that their complexity scales as

$O(\omega ^2)$

...

chapter

VLSI Realization of Lanczos Interpolation for a Generic Video Scaling Algorithm

S. Safinaz, A. V. Ravi Kumar

2017 International Conference on Recent Advances in Electronics and Communication Technology (ICRAECT) > 17 - 23

2017 International Conference on Recent Advances in Electronics and Communication Technology (ICRAECT)

Video scaling is a process of resizing a digital frame for preferred view-ability without losing the original content of the video, involving a trade-off between efficiency, smoothness and sharpness. In this research paper, a Generic Algorithm is proposed for enhancement of a motion picture with a given scaling factor without compromising on the picture quality. The proposed algorithm has been verified...

chapter

A comprehensive framework for synthesizing stencil algorithms on FPGAs using OpenCL model

Shuo Wang, Yun Liang

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

Iterative stencil algorithms find applications in a wide range of domains. FPGAs have long been adopted for computation acceleration due to its advantages of dedicated hardware design. Hence, FPGAs are a compelling alternative for executing iterative stencil algorithms. However, efficient implementation of iterative stencil algorithms on FPGAs is very challenging due to the data dependencies between...

chapter

Exploring heterogeneous algorithms for accelerating deep convolutional neural networks on FPGAs

Qingcheng Xiao, Yun Liang, Liqiang Lu, Shengen Yan, more

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

Convolutional neural network (CNN) finds applications in a variety of computer vision applications ranging from object recognition and detection to scene understanding owing to its exceptional accuracy. There exist different algorithms for CNNs computation. In this paper, we explore conventional convolution algorithm with a faster algorithm using Winograd's minimal filtering theory for efficient FPGA...

article

Efficient FPGA Implementation of OpenCL High-Performance Computing Applications via High-Level Synthesis

Fahad Bin Muslim, Liang Ma, Mehdi Roozmeh, Luciano Lavagno

IEEE Access > 2017 > 5 > 2747 - 2762

FPGA-based accelerators have recently evolved as strong competitors to the traditional GPU-based accelerators in modern high-performance computing systems. They offer both high computational capabilities and considerably lower energy consumption. High-level synthesis (HLS) can be used to overcome the main hurdle in the mainstream usage of the FPGA-based accelerators, i.e., the complexity of their...

chapter

Synthesis and evaluation of SHA-1 algorithm using altera SDK for OpenCL

Ian Janik, Mohammed A. S. Khalid

2016 IEEE 59th International Midwest Symposium on Circuits and Systems (MWSCAS) > 1 - 4

2016 IEEE 59th International Midwest Symposium on Circuits and Systems (MWSCAS)

This paper uses the Altera SDK for OpenCL (AOCL) High-Level Synthesis (HLS) tool to accelerate the computation of the SHA-1 hash function. Using FPGAs to increase throughput of this algorithm has been a popular topic in research. The work done thus far, focuses on HDL based design methodologies. The goal of this paper is to determine if the HLS implementation can compare in terms of speed to the HDL...

chapter

Scalable and modularized RTL compilation of Convolutional Neural Networks onto FPGA

Yufei Ma, Naveen Suda, Yu Cao, Jae-sun Seo, more

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 8

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Despite its popularity, deploying Convolutional Neural Networks (CNNs) on a portable system is still challenging due to large data volume, intensive computation and frequent memory access. Although previous FPGA acceleration schemes generated by high-level synthesis tools (i.e., HLS, OpenCL) have allowed for fast design optimization, hardware inefficiency still exists when allocating FPGA resources...

chapter

GraVF: A vertex-centric distributed graph processing framework on FPGAs

Nina Engelhardt, Hayden Kwok-Hay So

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

FPGAs are promising platforms to efficiently execute distributed graph algorithms. Unfortunately, they are notoriously hard to program, especially when the problem size and system complexity increases. In this paper, we propose GraVF, a high-level design framework for distributed graph processing on FPGAs. It leverages the vertex-centric paradigm, which is naturally distributed and requires the user...

chapter

On the Automation of High Level Synthesis of Convolutional Neural Networks

Emanuele Del Sozzo, Andrea Solazzo, Antonio Miele, Marco D. Santambrogio

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 217 - 224

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Convolutional Neural Networks (CNNs) are a particular type of Artificial Neural Networks (ANNs) inspired by cells in the primary visual cortex of animals, and represent the state of the art in image recognition and classification. Nowadays, such supervised learning technique is very popular in Big Data analytics. In this context, due to the huge amount of data to be processed, it is crucial to find...

chapter

Accelerating all-pairs shortest path using a message-passing reconfigurable architecture

Osama G. Attia, Alex Grieve, Kevin R. Townsend, Phillip Jones, more

2015 International Conference on ReConFigurable Computing and FPGAs (ReConFig) > 1 - 6

2015 International Conference on ReConFigurable Computing and FPGAs (ReConFig)

In this paper, we study the design and implementation of a reconfigurable architecture for graph processing algorithms. The architecture uses a message-passing model targeting shared-memory multi-FPGA platforms. We take advantage of our architecture to showcase a parallel implementation of the all-pairs shortest path algorithm (APSP) for unweighted directed graphs. Our APSP implementation adopts a...

chapter

OpenCL-based hardware-software co-design methodology for image processing implementation on heterogeneous FPGA platform

Sayed Omid Ayat, Mohamed Khalil-Hani, Rabia Bakhteri

2015 IEEE International Conference on Control System, Computing and Engineering (ICCSCE) > 36 - 41

2015 IEEE International Conference on Control System, Computing and Engineering (ICCSCE)

Recently, the OpenCL hardware-software co-design methodology has gained traction in realizing effective parallel architecture designs in heterogeneous FPGA platforms. In fact, the portability of OpenCL on hardware ready platforms such as GPU or multicore CPU enables ease of design verification. This is true especially for parallel algorithms before implementing them using cumbersome HDL-based RTL...

chapter

Performance and productivity evaluation of hybrid-threading HLS versus HDLs

Gongyu Wang, Herman Lam, Alan George, Glen Edwards

2015 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2015 IEEE High Performance Extreme Computing Conference (HPEC)

FPGA-based reconfigurable computing is finding its way into a wide range of application areas in which high performance and low power consumption are paramount. However, FPGA-application development using hardware-description languages (HDLs) faces many productivity challenges that limit its wide adoption, including a steep learning curve and lengthy compilation. High-level synthesis (HLS) languages...

chapter

Loop coarsening in C-based High-Level Synthesis

Moritz Schmid, Oliver Reiche, Frank Hannig, Jurgen Teich

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 166 - 173

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Current tools for High-Level Synthesis (HLS) excel at exploiting Instruction-Level Parallelism (ILP), the support for Data-Level Parallelism (DLP), one of the key advantages of Field Programmable Gate Arrays (FPGAs), is in contrast very limited. This work examines the exploitation of DLP on FPGAs using code generation for C-based HLS of image filters and streaming pipelines, consisting of point and...

chapter

Vector processor for online lithium-ion battery capacity prediction

Yeyong Pang, Shaojun Wang, Yu Peng, Philip H.W. Leong

2015 12th IEEE International Conference on Electronic Measurement & Instruments (ICEMI) > 1 > 254 - 259

2015 12th IEEE International Conference on Electronic Measurement & Instruments (ICEMI)

Battery capacity prediction in aerospace systems is a computationally expensive problem. In this paper, we propose a novel field programmable gate array-based (FPGA) vector processor to reduce latency in this application. This processor architecture is optimized for the kernel recursive least squares (KRLS) algorithm, and used to perform online regression. Pipelining is employed to increase performance...

chapter

Reducing FPGA algorithm area by avoiding redundant computation

Brian Axelrod, Michel Laverne

2015 IEEE International Conference on Robotics and Automation (ICRA) > 503 - 508

2015 IEEE International Conference on Robotics and Automation (ICRA)

We develop a new paradigm for designing fully streaming, area-efficient FPGA implementations of common building blocks for vision algorithm. By focusing on avoiding redundant computation we achieve a reduction of one to two orders of magnitude reduction in design area utilization as compared to previous implementations. We demonstrate that our design works in practice by building five 325 frames per...

chapter

Kernel-centric acceleration of high accuracy stereo-matching

Tobias Kenter, Henning Schmitz, Christian Plessl

2014 International Conference on ReConFigurable Computing and FPGAs (ReConFig14) > 1 - 8

2014 International Conference on ReConFigurable Computing and FPGAs (ReConFig)

Stereo-matching algorithms recently received a lot of attention from the FPGA acceleration community. Presented solutions range from simple, very resource efficient systems with modest matching quality for small embedded systems to sophisticated algorithms with several processing steps, implemented on big FPGAs. In order to achieve high throughput, most implementations strongly focus on pipelining...

chapter

High level performance model based design space exploration for energy-efficient designs on FPGAs

Sanmukh R. Kuppannagari, Yusong Hu, Viktor K. Prasanna

International Green Computing Conference > 1 - 6

2014 International Green Computing Conference (IGCC)

Energy efficiency has become a key performance metric in implementing application on FPGA. Several parameters such as parallelism, data layout, data re-usability etc. determine energy efficiency. Therefore a parameterized architecture is required to analyze the trade-offs and select the most energy-efficient design. However, increasing the number of parameters exponentially increases the number of...

chapter

High level programming framework for FPGAs in the data center

Oren Segal, Martin Margala, Sai Rahul Chalamalasetti, Mitch Wright

2014 24th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2014 24th International Conference on Field Programmable Logic and Applications (FPL)

Heterogeneous computing offers a promising solution for energy efficient computing in the data center. FPGA based heterogeneous computing is an especially promising direction since it allows for the creation of custom hardware solutions for data centric parallel applications. One of the main issues delaying wide spread adoption of FPGAs as main stream high performance computing devices is the difficulty...

chapter

Automated Hybrid Interconnect Design for FPGA Accelerators Using Data Communication Profiling

Cuong Pham-Quoc, Zaid Al-Ars, Koen Bertels

2014 IEEE International Parallel & Distributed Processing Symposium Workshops > 151 - 160

2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW)

In this paper, we introduce an automated interconnect design strategy to create an efficient custom interconnect for kernels in an FPGA-based accelerator system to accelerate their communication behavior. Our custom interconnect includes an NoC, shared local memory solution or both. Depending on the quantitative communication profiling of the application, the interconnect is built using our proposed...

chapter

Floorplanning for Partially-Reconfigurable FPGA Systems via Mixed-Integer Linear Programming

Marco Rabozzi, John Lillis, Marco D. Santambrogio

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines > 186 - 193

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

The aim of this paper is to show a novel floor planner based on Mixed-Integer Linear Programming (MILP), providing a suitable formulation that makes the problem tractable using state-of-the-art solvers. The proposed method takes into account an accurate description of heterogeneous resources and partially reconfigurable constraints of recent FPGAs. A global optimum can be found for small instances...

Data set:
ieee
Keywords:
FIELD PROGRAMMABLE GATE ARRAYS
ALGORITHM DESIGN AND ANALYSIS
KERNEL

Publication date

Set your own date range

INFONA - science communication portal

Search results

A Reconfigurable and Scalable FPGA Architecture for Bilateral Filtering

VLSI Realization of Lanczos Interpolation for a Generic Video Scaling Algorithm

A comprehensive framework for synthesizing stencil algorithms on FPGAs using OpenCL model

Exploring heterogeneous algorithms for accelerating deep convolutional neural networks on FPGAs

Efficient FPGA Implementation of OpenCL High-Performance Computing Applications via High-Level Synthesis

Synthesis and evaluation of SHA-1 algorithm using altera SDK for OpenCL

Scalable and modularized RTL compilation of Convolutional Neural Networks onto FPGA

GraVF: A vertex-centric distributed graph processing framework on FPGAs

On the Automation of High Level Synthesis of Convolutional Neural Networks

Accelerating all-pairs shortest path using a message-passing reconfigurable architecture

OpenCL-based hardware-software co-design methodology for image processing implementation on heterogeneous FPGA platform

Performance and productivity evaluation of hybrid-threading HLS versus HDLs

Loop coarsening in C-based High-Level Synthesis

Vector processor for online lithium-ion battery capacity prediction

Reducing FPGA algorithm area by avoiding redundant computation

Kernel-centric acceleration of high accuracy stereo-matching

High level performance model based design space exploration for energy-efficient designs on FPGAs

High level programming framework for FPGAs in the data center

Automated Hybrid Interconnect Design for FPGA Accelerators Using Data Communication Profiling

Floorplanning for Partially-Reconfigurable FPGA Systems via Mixed-Integer Linear Programming

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options