Search results

Items from 1 to 19 out of 19 results

article

A Resource-Limited Hardware Accelerator for Convolutional Neural Networks in Embedded Vision Applications

Shayan Moini, Bijan Alizadeh, Mohammad Emad, Reza Ebrahimpour

IEEE Transactions on Circuits and Systems II: Express Briefs > 2017 > 64 > 10 > 1217 - 1221

In this brief, we introduce an architecture for accelerating convolution stages in convolutional neural networks (CNNs) implemented in embedded vision systems. The purpose of the architecture is to exploit the inherent parallelism in CNNs to reduce the required bandwidth, resource usage, and power consumption of highly computationally complex convolution operations as required by real-time embedded...

chapter

Hardware architecture for 2D Gaussian filtering of HD images on resource constrained platforms

Carmine Cappetta, Gian Domenico Licciardo, Luigi Di Benedetto

2017 International Symposium on Signals, Circuits and Systems (ISSCS) > 1 - 4

2017 International Symposium on Signals, Circuits and Systems (ISSCS)

A bi-dimensional filter for high accuracy image processing is implemented by using a novel partitioning method. The method is based on a number theory theorem, which permits to reduce the complexity of the operation to that of an adder chain and also the amount of the coefficients stored in memory, improving the memory organization. To show the advantage of such method, we implemented a Floating Point...

chapter

Digital architecture for real-time CNN-based face detection for video processing

Smrity Bhattarai, Arjuna Madanayake, Renato J. Cintra, Stefan Duffner, more

2017 Cognitive Communications for Aerospace Applications Workshop (CCAA) > 1 - 6

2017 Cognitive Communications for Aerospace Applications Workshop (CCAA)

In this paper, we propose a hardware computing architecture for face detection that classifies an image as a face or non-face. The computing architecture is first designed, modeled and tested in MATLAB Simulink using Xilinx block set and was later tested using a Virtex-6 FPGA ML605 Evaluation Kit. The system uses learned filters which were previously extracted by training on a set of face and non-face...

chapter

End-to-end scalable FPGA accelerator for deep residual networks

Yufei Ma, Minkyu Kim, Yu Cao, Sarma Vrudhula, more

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

This work presents an efficient hardware accelerator design of deep residual learning algorithms, which have shown superior image recognition accuracy (>90% top-5 accuracy on ImageNet database). Two key objectives of the acceleration strategy are to (1) maximize resource utilization and minimize data movements, and (2) employ scalable and reusable computing primitives to optimize physical design...

chapter

Scalable and modularized RTL compilation of Convolutional Neural Networks onto FPGA

Yufei Ma, Naveen Suda, Yu Cao, Jae-sun Seo, more

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 8

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Despite its popularity, deploying Convolutional Neural Networks (CNNs) on a portable system is still challenging due to large data volume, intensive computation and frequent memory access. Although previous FPGA acceleration schemes generated by high-level synthesis tools (i.e., HLS, OpenCL) have allowed for fast design optimization, hardware inefficiency still exists when allocating FPGA resources...

chapter

An optimized FPGA implementation based on scale invariant feature transform feature points detection

Yue Gu, Xiujie Qu, Yue Sun, Liwen Gao, more

2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD) > 734 - 739

2016 12th International Conference on Natural Computation and 13th Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)

Aiming at the characteristics of SIFT (Scale Invariant Feature Transform) algorithm which has large amount of calculation and can be highly paralleled, we propose an optimized FPGA implementation so that it can be accelerated on hardware. In this method, we firstly simplify the process of filtering image and generating Gaussian pyramids through selecting appropriate parameters and hardware structure,...

chapter

OpenCL-based hardware-software co-design methodology for image processing implementation on heterogeneous FPGA platform

Sayed Omid Ayat, Mohamed Khalil-Hani, Rabia Bakhteri

2015 IEEE International Conference on Control System, Computing and Engineering (ICCSCE) > 36 - 41

2015 IEEE International Conference on Control System, Computing and Engineering (ICCSCE)

Recently, the OpenCL hardware-software co-design methodology has gained traction in realizing effective parallel architecture designs in heterogeneous FPGA platforms. In fact, the portability of OpenCL on hardware ready platforms such as GPU or multicore CPU enables ease of design verification. This is true especially for parallel algorithms before implementing them using cumbersome HDL-based RTL...

chapter

Reducing FPGA algorithm area by avoiding redundant computation

Brian Axelrod, Michel Laverne

2015 IEEE International Conference on Robotics and Automation (ICRA) > 503 - 508

2015 IEEE International Conference on Robotics and Automation (ICRA)

We develop a new paradigm for designing fully streaming, area-efficient FPGA implementations of common building blocks for vision algorithm. By focusing on avoiding redundant computation we achieve a reduction of one to two orders of magnitude reduction in design area utilization as compared to previous implementations. We demonstrate that our design works in practice by building five 325 frames per...

chapter

Image convolution processing: A GPU versus FPGA comparison

Lucas M. Russo, Emerson C. Pedrino, Edilson Kato, Valentin Obac Roda

2012 VIII Southern Conference on Programmable Logic > 1 - 6

2012 VIII Southern Conference on Programmable Logic (SPL)

Convolution is one of the most important operators used in image processing. With the constant need to increase the performance in high-end applications and the rise and popularity of parallel architectures, such as GPUs and the ones implemented in FPGAs, comes the necessity to compare these architectures in order to determine which of them performs better and in what scenario. In this article, convolution...

chapter

Feature extraction of Digital Aerial Images by FPGA based implementation of edge detection algorithms

R Harinarayan, R Pannerselvam, M M Ali, D K Tripathi

2011 International Conference on Emerging Trends in Electrical and Computer Technology > 631 - 635

2011 International Conference on Emerging Trends in Electrical and Computer Technology (ICETECT 2011)

Edge of image is one of the most fundamental and significant features. Edge detection is always one of the classical studying projects of computer vision and image processing field. It is the first step of image analysis and understanding. With the continuous improvement of remote sensing image, especially the appearance of Digital Aerial Image, edge detection is necessary step to extract information...

chapter

AER spike-processing filter simulator: Implementation of an AER simulator based on cellular automata

Manuel Rivas-Perez, A. Linares-Barranco, A. Jimenez-Fernandez, A. Civit, more

Proceedings of the International Conference on Signal Processing and Multimedia Applications > 1 - 6

2011 International Conference on Signal Processing and Multimedia Applications (SIGMAP)

Spike-based systems are neuro-inspired circuits implementations traditionally used for sensory systems or sensor signal processing. Address-Event-Representation (AER) is a neuromorphic communication protocol for transferring asynchronous events between VLSI spike-based chips. These neuro-inspired implementations allow developing complex, multilayer, multichip neuromorphic systems and have been used...

chapter

Image Edge Detection Based on FPGA

Zhengyang Guo, Wenbo Xu, Zhilei Chai

2010 Ninth International Symposium on Distributed Computing and Applications to Business, Engineering and Science > 169 - 171

2010 Ninth International Symposium on Distributed Computing and Applications to Business, Engineering and Science (DCABES 2010)

Field Programmable Gate Array (FPGA) is an effective device to realize real-time parallel processing of vast amounts of video data because of the fine-grain reconfigurable structures. This paper presents a kind of parallel processing construction of Sobel edge detection enhancement algorithm, which can quickly get the result of one pixel in only one clock periods. The algorithm is designed with a...

chapter

On the AER convolution processors for FPGA

A Linares-Barranco, R Paz-Vicente, F Gómez-Rodriguez, A Jiménez, more

Proceedings of 2010 IEEE International Symposium on Circuits and Systems > 4237 - 4240

2010 IEEE International Symposium on Circuits and Systems. ISCAS 2010

Image convolution operations in digital computer systems are usually very expensive operations in terms of resource consumption (processor resources and processing time) for an efficient Real-Time application. In these scenarios the visual information is divided into frames and each one has to be completely processed before the next frame arrives in order to warranty the real-time. A spike-based philosophy...

chapter

Hardware accelerated convolutional neural networks for synthetic vision systems

C Farabet, B Martini, P Akselrod, S Talay, more

Proceedings of 2010 IEEE International Symposium on Circuits and Systems > 257 - 260

2010 IEEE International Symposium on Circuits and Systems. ISCAS 2010

In this paper we present a scalable hardware architecture to implement large-scale convolutional neural networks and state-of-the-art multi-layered artificial vision systems. This system is fully digital and is a modular vision engine with the goal of performing real-time detection, recognition and segmentation of mega-pixel images. We present a performance comparison between a software, FPGA and...

chapter

A Massively Parallel Coprocessor for Convolutional Neural Networks

M. Sankaradas, V. Jakkula, S. Cadambi, S. Chakradhar, more

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 53 - 60

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

We present a massively parallel coprocessor for accelerating Convolutional Neural Networks (CNNs), a class of important machine learning algorithms. The coprocessor functional units, consisting of parallel 2D convolution primitives and programmable units performing sub-sampling and non-linear functions specific to CNNs, implement a ldquometa-operatorrdquo to which a CNN may be compiled to. The coprocessor...

chapter

Neighborhood dependent approach for low power 2D convolution in video processing applications

Hau Ngo, V. Asari

2009 4th IEEE Conference on Industrial Electronics and Applications > 656 - 661

2009 4th IEEE Conference on Industrial Electronics and Applications

Window-based operations such as two dimensional (2-D) convolution operations are commonly used in image and video processing applications. In this paper, a new design technique that considers the neighboring pixels within the window to detect and eliminate redundant or unnecessary computations for power reduction is presented. A novel on-chip detection technique is developed for the proposed neighborhood...

chapter

Design of a Logarithmic Domain 2-D Convolver for Low Power Video Processing Applications

H.T. Ngo, V.K. Asari

2009 Sixth International Conference on Information Technology: New Generations > 1280 - 1285

2009 Sixth International Conference on Information Technology: New Generations (ITNG 2009)

In this paper, a design and implementation of an efficient, low power log-based 2D convolution unit (convolver) for video processing applications is proposed. The design of the proposed convolver utilizes approximation method with error correction technique to transform data to logarithmic domain for reduced power consumption. A novel design and implementation of a modular approach for leading bit...

chapter

Hardware accelerated aerial image simulation by FPGA

H. Jamleh, C.C.-P. Chen

2009 5th Southern Conference on Programmable Logic (SPL) > 39 - 44

2009 5th Southern Conference on Programmable Logic

This paper describes a hardware implementation of aerial image simulation in lithography using FPGA. However, such simulators are presently performed using mainly software-based techniques on dedicated computers. The Hopkins partially coherent imaging equation is decomposed numerically by using singular value decomposition (SVD). The data input is a function which is consisting of rectangles as Manhattan...

chapter

Implementing Gabor Filter for fingerprint recognition using Verilog HDL

A.H.A. Razak, R.H. Taharim

2009 5th International Colloquium on Signal Processing&Its Applications > 423 - 427

2009 5th International Colloquium on Signal Processing & its Applications (CSPA 2009)

This paper present the implementations of Gabor filter for fingerprint recognition using Verilog HDL. This work demonstrates the application of Gabor filter technique to enhance the fingerprint image. The incoming signal in form of image pixel will be filter out or convolute by the Gabor filter to define the ridge and valley regions of fingerprint. This is done with the application of a real time...

Filter options

Data set:
ieee
Keywords:
KERNEL
CONVOLUTION
FPGA

Publication date

Set your own date range

Publication type

book (18)
article (1)

Keywords

FIELD PROGRAMMABLE GATE ARRAYS (16)
IMAGE PROCESSING (9)
HARDWARE (7)
PIXEL (6)
ACCELERATION (3)
ALGORITHM DESIGN AND ANALYSIS (3)
COMPUTER VISION (3)
CONVOLUTIONAL NEURAL NETWORK (3)
FIELD PROGRAMMABLE GATE ARRAY (3)
RANDOM ACCESS MEMORY (3)
ADDRESS-EVENT-REPRESENTATION (2)
CLOCKS (2)
COMPUTER ARCHITECTURE (2)
CONVOLUTIONAL NEURAL NETWORKS (2)
CONVOLVERS (2)
EDGE DETECTION (2)
FEATURE EXTRACTION (2)
FILTERING (2)
GPU (2)
HARDWARE ACCELERATION (2)
IMAGE EDGE DETECTION (2)
IMAGE FILTERING (2)
LOW POWER DESIGN (2)
MEMORY MANAGEMENT (2)
NEURAL NETS (2)
NEURAL NETWORKS (2)
NEURONS (2)
PARALLEL PROCESSING (2)
PROGRAM PROCESSORS (2)
VIDEO SIGNAL PROCESSING (2)
2-D CONVOLVER (1)
2D CONVOLUTION (1)
2D FILTER (1)
ACCELERATE (1)
ACCELERATOR (1)
ACCURACY (1)
ADDERS (1)
AER CONVOLUTION PROCESSORS (1)
APPLICATION SOFTWARE (1)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (1)
APPROXIMATION METHOD (1)
APPROXIMATION METHODS (1)
APPROXIMATION THEORY (1)
ARTIFICIAL NEURAL NETWORKS (1)
ASIC (1)
BANDWIDTH (1)
BIOMETRIC (1)
BUFFER STORAGE (1)
CELLULAR AUTOMATA (1)
CNN (1)
CNNS (1)
COMPUTATIONAL MODELING (1)
CONVOLUTION KERNELS (1)
CONVOLUTION MATRIX (1)
CONVOLUTION OPERATION (1)
COPROCESSORS (1)
CUDA (1)
DATA ACCESS PATTERN (1)
DATA BANDWIDTH (1)
DATA MINING (1)
DATA PARTITIONING METHODOLOGY (1)
DDR2 MEMORY BANK (1)
DEEP LEARNING (1)
DEEP RESIDUAL NETWORKS (1)
DELAY LINES (1)
DIGITAL AERIAL IMAGES (1)
DIGITAL DESIGN (1)
DIGITAL FILTER (1)
DIGITAL FILTERS (1)
DIGITAL SIGNAL PROCESSORS (1)
DISTRIBUTED ARITHMETIC (1)
DISTRIBUTED OFF-CHIP MEMORY BANK (1)
DOGS (1)
EDGE DETECTION ALGORITHMS (1)
EMBEDDED SYSTEMS (1)
ENGINES (1)
ERROR CORRECTION TECHNIQUE (1)
FACE (1)
FACE DETECTION (1)
FACE RECOGNITION (1)
FINGERPRINT (1)
FINGERPRINT IDENTIFICATION (1)
FINGERPRINT IMAGE ENHANCEMENT (1)
FINGERPRINT RECOGNITION (1)
FINITE IMPULSE RESPONSE FILTERS (1)
FIRST GRADIENT BASED OPERATORS (1)
FPGA IMPLEMENTATION (1)
GABOR COEFFICIENT (1)
GABOR FILTER (1)
GABOR FILTERS (1)
GATING TECHNIQUE (1)
GRAPHICS PROCESSING UNIT (1)
GRAPHICS PROCESSING UNITS (1)
HARDWARE ACCELERATED AERIAL IMAGE SIMULATION (1)
HARDWARE ACCELERATOR (1)
HARDWARE DESCRIPTION LANGUAGES (1)
HARRIS CORNER (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options