Search results

Items from 1 to 20 out of 476 results

chapter

An asynchronous loop structure based on the click element

Yuxuan Liu, Hong Chen, Dengjie Wang, Anping He

2017 International Conference on Electron Devices and Solid-State Circuits (EDSSC) > 1 - 2

2017 International Conference on Electron Devices and Solid-State Circuits (EDSSC)

We present a click-element-based asynchronous loop structure for control path of asynchronous micro-control unit (MCU). The loop, which has one-stage control circuit instead of cascade circuits, can be triggered by only one trigger signal and stopped by a preset number. To verify the loop structure, we design an asynchronous MCU simulated in FPGA. The experimental results show that the MCU can be...

chapter

Fast RNS implementation of elliptic curve point multiplication in GF(p) with selected base pairs

Yifeng Mo, Shuguo Li

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 6

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Implementing elliptic curve point multiplication (ECPM) based on residue number system (RNS) can efficiently use FPGA resources. In this paper, we propose a modular reduction method, where a kind of RNS pair is selected to achieve fast reduction. Our reduction method mainly needs several parallel additions while the reduction unit of previous designs require two multiplications which are computed...

chapter

An implementation method of poisson image editing on FPGA

Ryouhei Maeda, Tsutomu Maruyama

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 6

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

In this paper, we describe an FPGA system for the real-time processing of Poisson image Editing. Poisson Image Editing is a powerful method to overlay an image on another image seamlessly. In this method, however, a simple equation is repeatedly applied to each pixel, and this repetition makes its computational complexity very high. In our system, a very deep pipeline is used to apply the equation...

chapter

Transparent memory encryption and authentication

Mario Werner, Thomas Unterluggauer, Robert Schilling, David Schaffenrath, more

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 6

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Security features of modern (SoC) FPGAs permit to protect the confidentiality of hard- and software IP when the devices are powered off as well as to validate the authenticity of IP when being loaded at startup. However, these approaches are insufficient since attackers with physical access can also perform attacks during runtime, demanding for additional security measures. In particular, RAM used...

chapter

Broken-Karatsuba multiplication and its application to Montgomery modular multiplication

Jinnan Ding, Shuguo Li

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Large number multiplication has always been an essential operation in cryptographic algorithms. In this paper, we propose Broken-Karatsuba multiplication by applying the non-least-positive form to represent large numbers and dig the parallelism hidden in conventional Karatsuba multiplication. Further, we modify Montgomery modular multiplication algorithm with Broken-Karatsuba multiplication to make...

chapter

Parallel RRT∗ architecture design for motion planning

Size Xiao, Neil Bergmann, Adam Postula

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

A motion planning algorithm aims to calculate one obstacle-free trajectory which meets the dynamical constraints of a vehicle and leads the vehicle from the start state to the target state. RRT∗ (RRT star) is one sampling-based algorithm which is widely used in many applications because of its speed in quickly finding a trajectory. In contrast with basic RRT (Rapidly-exploring Random Trees) algorithm,...

chapter

Rapid implementation of a partially reconfigurable video system with PYNQ

Brad Hutchings, Mike Wirthlin

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 8

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Undergraduate students rapidly implement a partially-reconfigured, real-time video processor on the Xilinx PYNQ board. The video processor performs various real-time operations including Sobel edge detection, embossing, averaging, an interactive Pong game, etc., using a separate partially-reconfigurable bit-stream for each distinct function. Selection of image-processing functions is accomplished...

chapter

Flexible FPGA design for FDTD using OpenCL

Tobias Kenter, Jens Forstner, Christian Plessl

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 7

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Compared to classical HDL designs, generating FPGA with high-level synthesis from an OpenCL specification promises easier exploration of different design alternatives and, through ready-to-use infrastructure and common abstractions for host and memory interfaces, easier portability between different FPGA families. In this work, we evaluate the extent of this promise. To this end, we present a parameterized...

chapter

Optimizing streaming stencil time-step designs via FPGA floorplanning

Marco Rabozzi, Giuseppe Natale, Biagio Festa, Antonio Miele, more

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Stencil computations represent a highly recurrent class of algorithms in various high performance computing scenarios. The Streaming Stencil Time-step (SST) architecture is a recent implementation of stencil computations on Field Programmable Gate Array (FPGA). In this paper, we propose an automated framework for SST-based architectures capable of achieving the maximum performance level for a given...

chapter

TAIGA: A new RISC-V soft-processor framework enabling high performance CPU architectural features

Eric Matthews, Lesley Shannon

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Recently, there has been an increased focus on integration of reconfigurable fabric with modern processors. However, existing soft-processors are optimized to leverage older FPGA fabrics, focus primarily on resource minimization and have fixed-pipeline designs that limit the scope for tightly integrated hardware accelerators. In this work, we present Taiga: a RISC-V, 32-bit, soft-processor architecture...

chapter

Area-optimized montgomery multiplication on IGLOO 2 FPGAs

Pedro Maat C. Massolino, Lejla Batina, Ricardo Chaves, Nele Mentens

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

This paper presents the first area-optimized Montgomery modular multiplication module on low-power reconfigurable IGLOO® 2 FPGAs, from Microsemi. In order to obtain a good response time with few resources, the FPGA pipelined Math blocks and the embedded memory blocks are fully leveraged. As a result, 256-bit modular multiplications can be done in 2.33 μs, at a cost of 505 LUT4 cells, 257 Flip Flops,...

chapter

OpenCL for HPC with FPGAs: Case study in molecular electrostatics

Chen Yang, Jiayi Sheng, Rushi Patel, Ahmed Sanaullah, more

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 8

2017 IEEE High Performance Extreme Computing Conference (HPEC)

FPGAs have emerged as a cost-effective accelerator alternative in clouds and clusters. Programmability remains a challenge, however, with OpenCL being generally recognized as a likely part of the solution. In this work we seek to advance the use of OpenCL for HPC on FPGAs in two ways. The first is by examining a core HPC application, Molecular Dynamics. The second is by examining a fundamental design...

chapter

Auto-SI: An adaptive reconfigurable processor with run-time loop detection and acceleration

Tanja Harbaum, Christoph Schade, Marvin Damschen, Carsten Tradowsky, more

2017 30th IEEE International System-on-Chip Conference (SOCC) > 153 - 158

2017 30th IEEE International System-on-Chip Conference (SOCC)

Modern computer architectures have an ever-increasing demand for performance, but are constrained in power dissipation and chip area. To tackle these demands, architectures with application-specific accelerators have gained traction in research and industry. While this is a very promising direction, hard-wired accelerators fall short when too many applications need to be supported or flexibility is...

chapter

Implementation of application specific instruction-set processor for the artificial neural network acceleration using LISA ADL

Damjan Rakanovic, Rastislav Struharik

2017 IEEE East-West Design & Test Symposium (EWDTS) > 1 - 6

2017 IEEE East-West Design & Test Symposium (EWDTS)

In fields like embedded vision, where algorithms are computationally expensive, hardware accelerators play a major role in high throughput applications. These accelerators could be implemented as hardwired IP cores or Application Specific Instruction-set Processors (ASIPs). While hardwired solutions often provide the best possible performance, they are less flexible then ASIP implementation. In this...

chapter

Packet Classification with Limited Memory Resources

Michal Kekely, Jan Korenek

2017 Euromicro Conference on Digital System Design (DSD) > 179 - 183

2017 Euromicro Conference on Digital System Design (DSD)

Network security and monitoring devices use packet classification to match packet header fields in a set of rules. Many hardware architectures have been designed to accelerate packet classification and achieve wire-speed throughput for 100 Gbps networks. The architectures are designed for high throughput even for the shortest packets. However, FPGA SoC and Intel Xeon with FPGA have limited resources...

chapter

Optimizing compiler for a specialized real-time floating point softcore processor

Michael Kirchhoff, Natalia Kaptsova, Detlef Streitpferdt, Wolfgang Fengler

2017 8th Annual Industrial Automation and Electromechanical Engineering Conference (IEMECON) > 181 - 188

2017 8th Annual Industrial Automation and Electromechanical Engineering Conference (IEMECON)

This paper presents the authors' research work in the fields of embedded real-time softcore systems on FPGAs and specialized optimizing assembly language compiler. With this softcore processor, we are targeting a highly specialized field of applications that require a large floating point precision and other unique characteristics. Therefore, a specialized optimizing assembly language compiler is...

chapter

On How to Design Dataflow FPGA-Based Accelerators for Convolutional Neural Networks

Giuseppe Natale, Marco Bacis, Marco Domenico Santambrogio

2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) > 639 - 644

2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)

In the past few years we have experienced an extremely rapid growth of modern applications based on deep learning algorithms such as Convolutional Neural Network (CNN), and consequently, an intensification of academic and industrial research focused on the optimization of their imple- mentation. Among the different alternatives that have been ex- plored, FPGAs seems to be one of the most attractive,...

chapter

Efficient FPGA Implementation of the SHA-3 Hash Function

Magnus Sundal, Ricardo Chaves

2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) > 86 - 91

2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)

In this paper, three different approaches are considered for FPGA based implementations of the SHA-3 hash functions. While the performance of proposed unfolded and pipelined structures just match the state of the art, the dependencies of the structures which are folded slice-wise allow to further improve the efficiency of the existing state of the art. By solving the intra-round dependencies caused...

chapter

Real-time scene flow on COTS embedded systems by coarse-grained software pipeline

Long Chen, Mingyue Cui, Kai Huang, Zhe Xuanyuan

2017 IEEE Intelligent Vehicles Symposium (IV) > 1164 - 1169

2017 IEEE Intelligent Vehicles Symposium (IV)

Scene flow is a key function of stereo-based environment perception system for mobile robotics and autonomous vehicle. Due to the heavy computing requirement and the limited computing resource, parallelized and embedded algorithms become quite important for the application of the mobile robotics. This paper develops a cross-platform embedded scene flow algorithm by using a coarse-grained software...

chapter

Architecture of a synchronized low-latency network node targeted to research and education

Christian Liss, Marian Ulbricht, Umar Farooq Zia, Hartmut Muller

2017 IEEE 18th International Conference on High Performance Switching and Routing (HPSR) > 1 - 7

2017 IEEE 18th International Conference on High Performance Switching and Routing (HPSR)

As line-speeds and packet losses are sufficient well for most applications, reduction of latency and jitter are gaining in importance. We introduce and discuss the architecture of a novel networking device that provides low-latency switching and routing. It integrates an up-to-date FPGA with a standard ×86-64 processor and targets Time-Sensitive Networking (TSN) and machine-to-machine communication...

Keywords:
FIELD PROGRAMMABLE GATE ARRAYS
PIPELINES

Publication date

Set your own date range

Content availability

Available (472)
None (4)

Keywords

FPGA (192)
HARDWARE (181)
COMPUTER ARCHITECTURE (120)
CLOCKS (105)
REGISTERS (93)
RANDOM ACCESS MEMORY (87)
PIPELINE PROCESSING (83)
ALGORITHM DESIGN AND ANALYSIS (72)
THROUGHPUT (71)
TABLE LOOKUP (45)
PARALLEL PROCESSING (44)
ADDERS (37)
SOFTWARE (37)
PROGRAM PROCESSORS (33)
LOGIC DESIGN (31)
PIPELINE (30)
HARDWARE DESCRIPTION LANGUAGES (29)
IP NETWORKS (27)
ARRAYS (26)
COMPUTATIONAL MODELING (26)
DIGITAL SIGNAL PROCESSING (26)
FIELD PROGRAMMABLE GATE ARRAY (26)
DELAY (25)
LOGIC GATES (25)
MICROPROCESSOR CHIPS (24)
RECONFIGURABLE ARCHITECTURES (24)
SYSTEM-ON-CHIP (24)
ACCELERATION (23)
HARDWARE DESIGN LANGUAGES (23)
KERNEL (23)
MATHEMATICAL MODEL (23)
DATA MINING (22)
DECODING (22)
EQUATIONS (22)
PARALLEL ARCHITECTURES (22)
EMBEDDED SYSTEMS (21)
MEMORY MANAGEMENT (21)
FAST FOURIER TRANSFORMS (20)
FLOATING POINT ARITHMETIC (19)
OPTIMIZATION (19)
SYNCHRONIZATION (19)
BANDWIDTH (18)
PIXEL (18)
ENGINES (17)
COMPUTERS (16)
INSTRUCTION SETS (16)
MULTIPROCESSING SYSTEMS (16)
CRYPTOGRAPHY (15)
GENERATORS (15)
REAL TIME SYSTEMS (15)
ROUTING (15)
SIGNAL PROCESSING ALGORITHMS (15)
SWITCHES (15)
TIMING (15)
FPGA IMPLEMENTATION (14)
IMAGE PROCESSING (14)
MULTIPLEXING (14)
PIPELINE ARITHMETIC (14)
VHDL (14)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (13)
ENCODING (13)
FFT (13)
DELAYS (12)
PROTOCOLS (12)
REAL-TIME SYSTEMS (12)
RECONFIGURABLE COMPUTING (12)
SYSTEM-ON-A-CHIP (12)
VECTORS (12)
VIDEO CODING (12)
SIGNAL PROCESSING (11)
STREAMING MEDIA (11)
ASYNCHRONOUS CIRCUITS (10)
CAMERAS (10)
COMPLEXITY THEORY (10)
COPROCESSORS (10)
FORCE (10)
GRAPHICS PROCESSING UNITS (10)
INDEXES (10)
INTERNET (10)
OFDM (10)
RECONFIGURABLE HARDWARE (10)
BENCHMARK TESTING (9)
DSP (9)
FINITE IMPULSE RESPONSE FILTER (9)
IMAGE CODING (9)
INTERPOLATION (9)
POWER CONSUMPTION (9)
PROCESS CONTROL (9)
REDUCED INSTRUCTION SET COMPUTING (9)
CLOCK FREQUENCY (8)
DETECTORS (8)
DIGITAL SIGNAL PROCESSING CHIPS (8)
OFDM MODULATION (8)
PIPELINE ARCHITECTURE (8)
POWER DEMAND (8)
SRAM CHIPS (8)
STANDARDS (8)
VLSI (8)
more

INFONA - science communication portal

Search results

An asynchronous loop structure based on the click element

Fast RNS implementation of elliptic curve point multiplication in GF(p) with selected base pairs

An implementation method of poisson image editing on FPGA

Transparent memory encryption and authentication

Broken-Karatsuba multiplication and its application to Montgomery modular multiplication

Parallel RRT∗ architecture design for motion planning

Rapid implementation of a partially reconfigurable video system with PYNQ

Flexible FPGA design for FDTD using OpenCL

Optimizing streaming stencil time-step designs via FPGA floorplanning

TAIGA: A new RISC-V soft-processor framework enabling high performance CPU architectural features

Area-optimized montgomery multiplication on IGLOO 2 FPGAs

OpenCL for HPC with FPGAs: Case study in molecular electrostatics

Auto-SI: An adaptive reconfigurable processor with run-time loop detection and acceleration

Implementation of application specific instruction-set processor for the artificial neural network acceleration using LISA ADL

Packet Classification with Limited Memory Resources

Optimizing compiler for a specialized real-time floating point softcore processor

On How to Design Dataflow FPGA-Based Accelerators for Convolutional Neural Networks

Efficient FPGA Implementation of the SHA-3 Hash Function

Real-time scene flow on COTS embedded systems by coarse-grained software pipeline

Architecture of a synchronized low-latency network node targeted to research and education

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options