Search results

Items from 1 to 20 out of 2,200 results

chapter

Maximizing CNN accelerator efficiency through resource partitioning

Yongming Shen, Michael Ferdman, Peter Milder

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 535 - 547

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

Convolutional neural networks (CNNs) are revolutionizing machine learning, but they present significant computational challenges. Recently, many FPGA-based accelerators have been proposed to improve the performance and efficiency of CNNs. Current approaches construct a single processor that computes the CNN layers one at a time; the processor is optimized to maximize the throughput at which the collection...

chapter

Aggressive pipelining of irregular applications on reconfigurable hardware

Zhaoshi Li, Leibo Liu, Yangdong Deng, Shouyi Yin, more

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 575 - 586

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

CPU-FPGA heterogeneous platforms offer a promising solution for high-performance and energy-efficient computing systems by providing specialized accelerators with post-silicon reconfigurability. To unleash the power of FPGA, however, the programmability gap has to be filled so that applications specified in high-level programming languages can be efficiently mapped and scheduled on FPGA. The above...

chapter

Understanding and optimizing asynchronous low-precision stochastic gradient descent

Christopher De Sa, Matthew Feldman, Christopher Re, Kunle Olukotun

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 561 - 574

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

Stochastic gradient descent (SGD) is one of the most popular numerical algorithms used in machine learning and other domains. Since this is likely to continue for the foreseeable future, it is important to study techniques that can make it run fast on parallel hardware. In this paper, we provide the first analysis of a technique called BUCKWILD! that uses both asynchronous execution and low-precision...

chapter

Deep neural network accelerator based on FPGA

Thang Viet Huynh

2017 4th NAFOSTED Conference on Information and Computer Science > 254 - 257

2017 4th NAFOSTED Conference on Information and Computer Science

In this work, we propose an efficient architecture for the hardware realization of deep neural networks on reconfigurable computing platforms like FPGA. The proposed neural network architecture employs only one single physical computing layer to perform the whole computational fabric of fully-connected feedforward deep neural networks with customizable number of layers, number of neurons per layer...

chapter

Reusability is FIRRTL ground: Hardware construction languages, compiler frameworks, and transformations

Adam Izraelevitz, Jack Koenig, Patrick Li, Richard Lin, more

2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) > 209 - 216

2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Enabled by modern languages and retargetable compilers, software development is in a virtual “Cambrian explosion” driven by a critical mass of powerfully parameterized libraries; but hardware development practices lag far behind. We hypothesize that existing hardware construction languages (HCLs) and novel hardware compiler frameworks (HCFs) can put hardware development on a similar evolutionary path...

chapter

Cyclist: Accelerating hardware development

Jonathan Bachrach, Albert Magyar, Palmer Dabbelt, Patrick Li, more

2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) > 1011 - 1018

2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

The end of Dennard scaling has led to an increase in demand for energy-efficient custom hardware accelerators, but current hardware design is slow and laborious, partly because each iteration of the compile-run-debug cycle can take hours or even days with existing simulation and emulation platforms. Cyclist is a new emulation platform designed specifically to shorten the total compile-run-debug cycle...

chapter

An efficient runtime adaptable floating-point Gaussian filtering core

Cuong Pham-Quoc, Tran Ngoc Thinh

2017 4th NAFOSTED Conference on Information and Computer Science > 183 - 188

2017 4th NAFOSTED Conference on Information and Computer Science

With the fast increasingly use of image and video processing in many aspects, the requirements for high performance and high-quality systems lead to the use of reconfigurable computing to accelerate traditional image processing platforms. In this work, an efficient runtime adaptable floating-point Gaussian filtering core is proposed to achieve not only high performance and quality but also kernel...

chapter

Efficient FDWT/IDWT hardware implementation with line-based and dual-scan image memory accesses

Laila Ahmed Saad, Mohd Fadzli Mohd Salleh

TENCON 2017 - 2017 IEEE Region 10 Conference > 2576 - 2581

TENCON 2017 - 2017 IEEE Region 10 Conference

Image compression is of a great importance in multimedia system applications because it drastically reduces bandwidth for transmission and memory storage. Image compression algorithm, like JPEG2000, utilizes the Forward Discrete Wavelet Transform (FDWT) and Inverse Discrete Wavelet Transform (IDWT). The main problems face by researchers in the hardware implementation of the FDWT/IDWT are storage memory,...

chapter

Hardware and software infrastructure to implement many-core systems in modern FPGAs

Felipe T. Bortolon, Fernando G. Moraes

2017 30th Symposium on Integrated Circuits and Systems Design (SBCCI) > 79 - 83

2017 30th Symposium on Integrated Circuits and Systems Design (SBCCI)

Many-core systems are increasingly popular in embedded systems due to their high-performance and flexibility to execute different workloads. These many-core systems provide a rich processing fabric but lack the flexibility to accelerate critical operations with dedicated hardware cores. Modern Field Programmable Gate-Arrays (FPGAs) evolved to more than reconfigurable devices, providing embedded hard-core...

chapter

A SVM optimization tool and FPGA system architecture applied to NMPC

Carlos Eduardo Santos, Renato Coral Sampaio, Helon Ayala, Leandro dos S. Coelho, more

2017 30th Symposium on Integrated Circuits and Systems Design (SBCCI) > 96 - 102

2017 30th Symposium on Integrated Circuits and Systems Design (SBCCI)

Support Vector Machines (SVMs) are supervised learning models of the machine learning field whose performance strongly depended on its hyperparameters. The Bio-inspired Optimization Tool for SVM (BIOTS) tool is based on a Multi-Objective Particle Swarm Algorithm (MOPSO) to tune hyperparameters of SVMs. In this work, BIOTS is proposed along with a custom hardware design generator (VHDL) that implements...

chapter

Efficient hardware implementation of morphological reconstruction based on sequential reconstruction algorithm

Oscar Anacona-Mosquera, Gustavo Vinhal, Renato C. Sampaio, George Teodoro, more

2017 30th Symposium on Integrated Circuits and Systems Design (SBCCI) > 162 - 167

2017 30th Symposium on Integrated Circuits and Systems Design (SBCCI)

This work presents a hardware implementation of the morphological reconstruction algorithm for biomedical images analysis. The morphological reconstruction algorithm is based on the Sequential Reconstruction (SR). In this case. a hardware architecture has been developed and implemented by mapping the SR algorithm into an Altera Cyclone IV E FPGA based platform. including a NIOS II processor. The developed...

chapter

Real-time multi-scale pedestrian detection for driver assistance systems

Maryam Hemmati, Smail Niar, Morteza Biglari-Abhari, Stevan Berber

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

Pedestrian detection is one of the most challenging and vital tasks of driver assistance systems (DAS). Among several algorithms developed for human detection, histogram of oriented gradients (HOG) followed by support vector machine (SVM) has shown the most promising results. This paper presents a hardware accelerator for real-time pedestrian detection at different scales to fulfill the real-time...

chapter

An embedded system for acoustic pattern recognition

Constanze Tschope, Frank Duckhorn, Christian Richter, Peter Bluthgen, more

2017 IEEE SENSORS > 1 - 3

2017 IEEE SENSORS

We present a miniaturized universal hardware module for acoustic pattern recognition in various types of multichannel sensor signals. The module implements configurable signal analysis (signal transforms, filter banks, statistical transforms) and a GMM-HMM recognizer. The main hardware components are a XC7A75T FPGA performing almost all the computations, a TMS320C6746 digital signal processor organizing...

chapter

Efficient scalable hardware architecture for highly performant encoded neural networks

Hugues Wouafo, Cyrille Chavet, Philippe Coussy, Robin Danilo

2017 IEEE International Workshop on Signal Processing Systems (SiPS) > 1 - 6

2017 IEEE International Workshop on Signal Processing Systems (SiPS)

Different neural network models have been proposed to design efficient associative memories like Hopfield networks, Boltzmann machines or Cogent confabulation. Compared to the classical models, Encoded Neural Network (ENN) is a recently introduced formalism with a proven higher efficiency. This model has been improved through different contributions like Clone-based ENN (CbNNs) or Sparse ENNs (S-ENNs)...

chapter

Reduced complexity FPGA implementation for UF-OFDM frequency domain transmitter

Said Medjkouh, Jeremy Nadal, Charbel Abdel Nour, Amer Baghdadi

2017 IEEE International Workshop on Signal Processing Systems (SiPS) > 1 - 6

2017 IEEE International Workshop on Signal Processing Systems (SiPS)

Universal Filtered Orthogonal Frequency Division Multiplexing (UF-OFDM) is considered one of the main wave-form candidates to overcome the challenges facing the next generation of mobile communication systems. Due to its spectral properties it can support relaxed synchronization, low-latency communications and flexible time transmission interval. Nevertheless, the available recent literature addresses...

chapter

System-on-chip implementation of embedded real-time simulator for modular multilevel converters

Mattia Ricco, Marius Gheorghe, Laszlo Mathe, Remus Teodorescu

2017 IEEE Energy Conversion Congress and Exposition (ECCE) > 1500 - 1505

2017 IEEE Energy Conversion Congress and Exposition (ECCE)

The aim of this paper is to present the implementation of an Embedded Real-Time Simulator (ERTS) for Modular Multilevel Converters (MMCs), using low-cost System-on-Chip (SoC) platform. In order to achieve new functionalities such as sensor-less control, monitoring, diagnostic and fault detection, the MMC plant model can be implemented along with the controller. In MMC applications, the implementation...

chapter

FPGA-Based Collaborative Hardware Sorting Unit for Embedded Data Processing System

Zou Long, Zhenrong Zhang

2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA) > 260 - 264

2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA)

In recent years there has been a growing interest in Internet of Thing, Big Data and Mobile Internet. With the rapid growth of the amount of data in the embedded environment, using a traditional embedded processor is hard to satisfy the requirements of big data processing. Sorting is one of the fundamental operation in data processing and is also frequently used for search, filter, feature analysis...

chapter

FPGA-based system for heart rate calculation based on PPG signal

Karim Meddah, Malika Kedir-Talha, Hadjer Zairi

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B) > 1 - 5

2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B)

Recent technological advances allowed the word to construct several wearable products that can capture and process the human body bio-signals. The PPG signal becomes one of the most contenders in heart rate monitoring due to their prominent features, flexibility, effectiveness and low costs. This paper present a novel System of PPG Heart rate calculation based on FPGA, using the Pan and Tompkins as...

chapter

FPGA real-time implementation of a vector control scheme for a PMSM used to propel an electric scooter

Ioana-Cornelia Gros, Daniel Fodorean, Ignat-Calin Marginean

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE) > 1 - 5

2017 5th International Symposium on Electrical and Electronics Engineering (ISEEE)

This paper reports on the Field-Programmable Gate Array (FPGA) real-time implementation of a vector control scheme, by means of hardware-in-the-loop simulation. This approach will be applied for a PMSM used to propel an electric scooter, preceding its integration in a more complex experimental setup. The emerging need for powerful, flexible system-on-a-chip (SoC) platforms for developing complex drive...

chapter

Implementation of the LZMA compression algorithm on FPGA

Xia Zhao, Bing Li

2017 International Conference on Electron Devices and Solid-State Circuits (EDSSC) > 1 - 2

2017 International Conference on Electron Devices and Solid-State Circuits (EDSSC)

Data compression technology is the necessary technology in the age of big data. Compared with software compression techniques, hardware compression techniques can improve speed and reduce power consumption. LZMA is a lossless compression technology, and its hardware implementation has broad application prospects. This paper proposes a novel high-performance implementation of the LZMA compression algorithm...

Data set:
ieee
Keywords:
HARDWARE
FPGA
Publication type:
book

Publication date

Set your own date range

Content availability

Available (2,164)
None (36)

Keywords

FIELD PROGRAMMABLE GATE ARRAYS (1,876)
COMPUTER ARCHITECTURE (511)
SOFTWARE (383)
ALGORITHM DESIGN AND ANALYSIS (333)
CLOCKS (257)
RANDOM ACCESS MEMORY (230)
REGISTERS (194)
FIELD PROGRAMMABLE GATE ARRAY (156)
EMBEDDED SYSTEMS (144)
CRYPTOGRAPHY (136)
RECONFIGURABLE ARCHITECTURES (136)
THROUGHPUT (133)
MATHEMATICAL MODEL (128)
HARDWARE DESCRIPTION LANGUAGES (121)
SYSTEM-ON-CHIP (117)
VHDL (117)
DIGITAL SIGNAL PROCESSING (114)
REAL-TIME SYSTEMS (110)
TABLE LOOKUP (107)
IP NETWORKS (101)
PROGRAM PROCESSORS (100)
LOGIC DESIGN (98)
KERNEL (97)
IMAGE PROCESSING (95)
HARDWARE DESIGN LANGUAGES (94)
PIXEL (92)
PROTOCOLS (87)
COMPUTATIONAL MODELING (86)
SIGNAL PROCESSING ALGORITHMS (86)
PIPELINES (84)
REAL TIME SYSTEMS (81)
PARALLEL PROCESSING (79)
DATA MINING (78)
LOGIC GATES (78)
ARRAYS (77)
DECODING (76)
COMPUTERS (75)
SYNCHRONIZATION (74)
ENCRYPTION (73)
GENERATORS (73)
HARDWARE IMPLEMENTATION (73)
MICROPROCESSOR CHIPS (73)
ADDERS (70)
ACCELERATION (67)
OPTIMIZATION (66)
STREAMING MEDIA (65)
HARDWARE-SOFTWARE CODESIGN (64)
RADIATION DETECTORS (63)
SYSTEM-ON-A-CHIP (63)
ASIC (59)
SOFTWARE ALGORITHMS (59)
FEATURE EXTRACTION (58)
RECONFIGURABLE COMPUTING (58)
ENCODING (56)
PIPELINE PROCESSING (54)
SWITCHES (54)
MATLAB (52)
MULTIPROCESSING SYSTEMS (52)
NEURONS (52)
SECURITY (51)
CAMERAS (49)
DIGITAL SIGNAL PROCESSING CHIPS (49)
EQUATIONS (49)
IMAGE EDGE DETECTION (49)
MEMORY MANAGEMENT (49)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (48)
HARDWARE ARCHITECTURE (47)
STANDARDS (47)
DELAY (45)
MONITORING (45)
TRANSFORMS (45)
ARTIFICIAL NEURAL NETWORKS (44)
DSP (44)
EMBEDDED SYSTEM (44)
IMAGE COLOR ANALYSIS (44)
INTEGRATED CIRCUIT MODELING (44)
SOPC (44)
ENGINES (42)
CONTROL SYSTEMS (41)
SIGNAL PROCESSING (41)
VLSI (41)
DETECTORS (40)
LIBRARIES (40)
HARDWARE ACCELERATION (39)
LINUX (39)
WIRELESS COMMUNICATION (39)
COMPLEXITY THEORY (38)
PERFORMANCE EVALUATION (38)
PROCESS CONTROL (38)
AES (37)
IMAGE CODING (37)
ELLIPTIC CURVE CRYPTOGRAPHY (36)
HARDWARE DESIGN (36)
MULTIPLEXING (36)
PARALLEL ARCHITECTURES (36)
POWER DEMAND (36)
RECEIVERS (36)
COPROCESSORS (35)
more

INFONA - science communication portal

Search results

Maximizing CNN accelerator efficiency through resource partitioning

Aggressive pipelining of irregular applications on reconfigurable hardware

Understanding and optimizing asynchronous low-precision stochastic gradient descent

Deep neural network accelerator based on FPGA

Reusability is FIRRTL ground: Hardware construction languages, compiler frameworks, and transformations

Cyclist: Accelerating hardware development

An efficient runtime adaptable floating-point Gaussian filtering core

Efficient FDWT/IDWT hardware implementation with line-based and dual-scan image memory accesses

Hardware and software infrastructure to implement many-core systems in modern FPGAs

A SVM optimization tool and FPGA system architecture applied to NMPC

Efficient hardware implementation of morphological reconstruction based on sequential reconstruction algorithm

Real-time multi-scale pedestrian detection for driver assistance systems

An embedded system for acoustic pattern recognition

Efficient scalable hardware architecture for highly performant encoded neural networks

Reduced complexity FPGA implementation for UF-OFDM frequency domain transmitter

System-on-chip implementation of embedded real-time simulator for modular multilevel converters

FPGA-Based Collaborative Hardware Sorting Unit for Embedded Data Processing System

FPGA-based system for heart rate calculation based on PPG signal

FPGA real-time implementation of a vector control scheme for a PMSM used to propel an electric scooter

Implementation of the LZMA compression algorithm on FPGA

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options