Search results

Items from 81 to 100 out of 691 results

chapter

Dynamic FPGA-accelerator sharing among concurrently running virtual machines

Hamid Nasiri, Maziar Goudarzi

2016 IEEE East-West Design & Test Symposium (EWDTS) > 1 - 4

2016 IEEE East-West Design & Test Symposium (EWDTS)

Using an FPGA as a hardware accelerator has been prevalent, to speed up compute intensive workloads. However, employing an accelerator in virtualized environment enhances complexity, because accessing the accelerator from virtual machines has significant overhead and sharing it needs some considerations. We have implemented adequate infrastructure to share an FPGA-based accelerator between multiple...

chapter

Low cost resilient regular expression matching on FPGAs

Marcos T. Leipnitz, Eduardo Nunes de Souza, Gabriel L. Nazar

2016 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT) > 75 - 80

2016 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)

The Network Function Virtualization (NFV) paradigm promises to make networks more scalable and flexible by decoupling the network functions (NFs) from dedicated and vendor-specific hardware. However, network and compute intensive NFs may be difficult to virtualize without performance degradation. In this context, Field-Programmable Gate Arrays (FPGAs) have been shown to be a good option for hardware...

chapter

Design of Media Access Control in visible light communication system and a simple way to avoid dual transmit over dual Access Point

Baohua Xu, Minglun Zhang, Yi Sha

2016 15th International Conference on Optical Communications and Networks (ICOCN) > 1 - 3

2016 15th International Conference on Optical Communications and Networks (ICOCN)

Visible light communication (VLC) has won much attention in recent years. In this work, an experimental visible light communication system of its' Media Access Control (MAC) layer on the digital signal process and a simple method to avoid one packet dual transferred between two or more Access Point (AP) is presented. The work is implemented in FPGA (Field Programmable Gate Array), which are based...

chapter

Efficient implementation of the AES algorithm for security applications

Shady Mohamed Soliman, Baher Magdy, Mohamed A. Abd El Ghany

2016 29th IEEE International System-on-Chip Conference (SOCC) > 206 - 210

2016 29th IEEE International System-on-Chip Conference (SOCC)

Throughput, area and power optimized designs for the advanced encryption standard algorithm are proposed in this paper. The presented designs are suitable for the encrypt-only AES-128 algorithm. Both designs integrate pipelining and iterative architectures in one design. This is achieved through applying the concept of partial loop unrolling where iterations and multistage pipelining are used to optimize...

chapter

Overcoming resource underutilization in spatial CNN accelerators

Yongming Shen, Michael Ferdman, Peter Milder

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Convolutional neural networks (CNNs) are revolutionizing a variety of machine learning tasks, but they present significant computational challenges. Recently, FPGA-based accelerators have been proposed to improve the speed and efficiency of CNNs. Current approaches construct an accelerator optimized to maximize the overall throughput of iteratively computing the CNN layers. However, this approach...

chapter

LYNX: CAD for FPGA-based networks-on-chip

Mohamed S. Abdelfattah, Vaughn Betz

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 10

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

We present a computer-aided design (CAD) tool that automatically connects an FPGA application using an embedded network-on-chip (NoC). After discussing the CAD flow steps, we delve into the details of implementing transaction communication using our CAD tool. This request-reply type of communication requires special consideration on FPGAs, for example: low round-trip latency, fair arbitration and...

chapter

Memory efficient and high performance key-value store on FPGA using Cuckoo hashing

Wei Liang, Wenbo Yin, Ping Kang, Lingli Wang

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Key-value stores (KVS) become critical in many applications because of the data explosion recently. There is a strong demand to improve the throughput and reduce the latency for KVS. FPGA-based parallel architecture can bring excellent performance and power efficiency. Cuckoo hashing has proven to be an efficient approach to implement KVS with good memory utilization and constant worst case access...

chapter

JetStream: An open-source high-performance PCI Express 3 streaming library for FPGA-to-Host and FPGA-to-FPGA communication

Malte Vesper, Dirk Koch, Kizheppatt Vipin, Suhaib A. Fahmy

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 9

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Many FPGA-based accelerators are constrained by the available resources and multi-FPGA solutions can be necessary for building more capable systems. Available PCIe solutions provide only FPGA-to-Host communication. In this paper we present JetStream, an open-source¹ modular PCIe 3 library, supporting not only fast FPGA-to-Host communication, but also allowing direct FPGA-to-FPGA communication which...

chapter

Improved resource sharing for FPGA DSP blocks

Bajaj Ronak, Suhaib A. Fahmy

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Sharing multi-cycle hardware blocks like the DSP48E1 primitive in Xilinx FPGAs can result in significant resource savings, but complicates scheduling. For high-throughput, DSP blocks must be pipelined, which results in a high initiation interval (II) for resource shared implementations. In this paper, we propose a resource reduction technique that minimises DSP block usage while also offering improved...

chapter

Exploring the use of shift register lookup tables for Keccak implementations on Xilinx FPGAs

Jori Winderickx, Joan Daemen, Nele Mentens

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

We explore the possibility of using shift register lookup tables (SRLs) for the implementation of Keccak on Xilinx FPGAs. The approach originates from the observation that the ρ step in combination with the state storage can be implemented as a collection of shift registers. This way, we achieve a slice-wise implementation using 25 shift registers of various lengths, resulting in 75 32-bit and 6 16-bit...

chapter

Packet processing on FPGA SoC with DPDK

Jan Viktorin, Jan Korenek

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 2

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

One of the most important topics of today is a packet processing in data centers with respect to the power consumption and efficient utilization of computational resources. The ARM architecture has proved to be an energy efficient computational system. Together with an integrated FPGA on a single die, it offers potentially a high performance with respect to the power consumption. DPDK - a set of libraries...

chapter

Design and implementation of embedded DAQ using spatial parallelism on FPGA for better throughput

Janice Jia Min, Muataz H. Salih, Zheng Ng, Torry Kho, more

2016 3rd International Conference on Electronic Design (ICED) > 275 - 280

2016 3rd International Conference on Electronic Design (ICED)

Data acquisition (DAQ) is the process of acquire analog signals from different types of sources and further process the acquired signals through personal computer (PC) in digital form. Compared to traditional measurement system, PC-based DAQ system provides a more flexible and cost-effective measurement solution to the industry and utilizes the efficiency, processing power and connectivity capabilities...

chapter

OpenCL-based erasure coding on heterogeneous architectures

Guoyang Chen, Huiyang Zhou, Xipeng Shen, Josh Gahm, more

2016 IEEE 27th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 33 - 40

2016 IEEE 27th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Erasure coding, Reed-Solomon coding in particular, is a key technique to deal with failures in scale-out storage systems. However, due to the algorithmic complexity, the performance overhead of erasure coding can become a significant bottleneck in storage systems attempting to meet service level agreements (SLAs). Previous work has mainly leveraged SIMD (single-instruction multiple-data) instruction...

chapter

Architecture for quadruple precision floating point division with multi-precision support

Manish Kumar Jaiswal, Hayden K.-H So

2016 IEEE 27th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 239 - 240

2016 IEEE 27th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

This paper proposes a FPGA based hardware architecture for quadruple precision (QP) division arithmetic which can also process a single, a double and a double-extended precision (SP, DP, DPE) computations. The mantissa division employs a series expansion methodology of division, integrated with a wide integer multiplier further optimized for FPGA implementations facilitating the built-in DSP blocks...

chapter

QR decomposition using FPGAs

Michael Parker, Volker Mauer, Dan Pritsker

2016 IEEE National Aerospace and Electronics Conference (NAECON) and Ohio Innovation Summit (OIS) > 416 - 421

2016 IEEE National Aerospace and Electronics Conference (NAECON) and Ohio Innovation Summit (OIS)

This paper describes the architecture and implementation of a high performance QR decomposition IEEE754 single precision floating point core, using a modified Gram-Schmidt algorithm. Using Intel's new floating point Arria 10 FPGAs, synthesis is used to generate column high functional units, giving O(n²) processing times. The modified Gram-Schmidt algorithm is expressed in a different order to combine...

chapter

Multi-GSPS FFTs using FPGAs

Michael Parker, Simon Finn, Hong Shan Neoh

2016 IEEE National Aerospace and Electronics Conference (NAECON) and Ohio Innovation Summit (OIS) > 430 - 436

2016 IEEE National Aerospace and Electronics Conference (NAECON) and Ohio Innovation Summit (OIS)

This paper describes the implementation of a high throughput FFTs implemented on FPGAs, using a modified version of the Radix 2^N architecture. The implementation uses a synthesis method which supports “super-sampling” to provide very high throughput. Special vector structures in the tools and hardware architecture are supported where complex vectors form the input on each clock cycle, and multiple...

chapter

Review on realization of AES encryption and decryption with power and area optimization

Mohini Mohurle, Vishal V. Panchbhai

2016 IEEE 1st International Conference on Power Electronics, Intelligent Control and Energy Systems (ICPEICES) > 1 - 3

2016 IEEE 1st International Conference on Power Electronics, Intelligent Control and Energy Systems (ICPEICES)

In this project, a hardware implementation of the AES-256 encryption and decryption algorithm is proposed. The AES cryptography algorithm can be used to encryption and decryption blocks of 128 bits and is capable of using cipher keys of 256 bits. Feature of the proposed pipeline design is depending on the round keys, which are consumed different round of encryption, are generated in parallel way with...

chapter

RCA on FPGAs designed by the RTL design methodology and wave-pipelined operation

Tomoaki Sato, Sorawat Chivapreecha, Phichet Moungnoul, Kohji Higuchi

2016 13th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON) > 1 - 6

2016 13th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

Field-programmable gate arrays (FPGAs) are used in various systems that use reconfigurable function. Conventional FPGAs have been developed by a transistor-level description for minimizing routing delay. Although FPGAs developed by the register transfer level (RTL) design methodology provide various benefits to the designers of a system-on-a-chip (SoC), they have not been realized. Therefore, the...

chapter

FPGA based area optimized parallel pipelined radix-2² feed forward FFT architecture

S A Ajmal, S L Gangadharaiah

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT) > 1302 - 1307

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT)

The design of pipelined Fast Fourier transform (PFFT) in modern communication systems provides an efficient way for computation of FFT with better area utilizing hardware architecture. Previously, the radix-2² had been used only for single path delay feedback architectures. Later with many types of research works the radix 2² was extended to multi-path delay commutator (MDC) architectures. This paper...

chapter

FPGA kernels for classification rule induction

P. Skoda, B. Medved Rogina

2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) > 337 - 342

2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

Classification is one of the core tasks in machine learning data mining. One of several models of classification are classification rules, which use a set of if-then rules to describe a classification model. In this paper we present a set of FPGA-based compute kernels for accelerating classification rule induction. The kernels can be combined to perform specific procedures in rule induction process,...

Keywords:
THROUGHPUT
FIELD PROGRAMMABLE GATE ARRAYS

Publication date

Set your own date range

Content availability

Available (682)
None (9)

Keywords

HARDWARE (277)
FPGA (253)
COMPUTER ARCHITECTURE (186)
CLOCKS (140)
ALGORITHM DESIGN AND ANALYSIS (108)
RANDOM ACCESS MEMORY (101)
CRYPTOGRAPHY (86)
PIPELINES (71)
REGISTERS (68)
PIPELINE PROCESSING (65)
TABLE LOOKUP (65)
DECODING (55)
ENCRYPTION (55)
MEMORY MANAGEMENT (49)
PARALLEL PROCESSING (40)
SOFTWARE (39)
DIGITAL SIGNAL PROCESSING (36)
IP NETWORKS (35)
PROTOCOLS (35)
DELAY (34)
MIMO (32)
BANDWIDTH (31)
FIELD PROGRAMMABLE GATE ARRAY (31)
LOGIC DESIGN (30)
OPTIMIZATION (30)
ENGINES (29)
ADDERS (28)
ENCODING (28)
KERNEL (28)
RECONFIGURABLE ARCHITECTURES (28)
COMPLEXITY THEORY (26)
PARALLEL ARCHITECTURES (26)
POWER DEMAND (26)
PROGRAM PROCESSORS (26)
DATA MINING (25)
FPGA IMPLEMENTATION (25)
GENERATORS (24)
SIGNAL PROCESSING ALGORITHMS (24)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (23)
PARITY CHECK CODES (23)
STANDARDS (23)
ARRAYS (22)
LOGIC GATES (22)
SECURITY (22)
ROUTING (21)
NIST (19)
SYSTEM-ON-CHIP (19)
AES (18)
MATHEMATICAL MODEL (18)
SWITCHES (18)
SYNCHRONIZATION (18)
VLSI (18)
DETECTORS (17)
MICROPROCESSOR CHIPS (17)
RESOURCE MANAGEMENT (17)
SHA-3 (17)
CIPHERS (16)
REAL TIME SYSTEMS (16)
EQUATIONS (15)
MIMO COMMUNICATION (15)
PERFORMANCE EVALUATION (15)
POLYNOMIALS (15)
TELECOMMUNICATION NETWORK ROUTING (15)
ACCELERATION (14)
INTERNET (14)
NETWORK-ON-CHIP (14)
PATTERN MATCHING (14)
RADIATION DETECTORS (14)
SYSTEM-ON-A-CHIP (14)
ADVANCED ENCRYPTION STANDARD (13)
COMPUTATIONAL MODELING (13)
DELAYS (13)
HARDWARE DESCRIPTION LANGUAGES (13)
HARDWARE IMPLEMENTATION (13)
MULTIPLEXING (13)
POWER CONSUMPTION (13)
WIRELESS COMMUNICATION (13)
FFT (12)
INDEXES (12)
INTEGRATED CIRCUIT DESIGN (12)
PACKET CLASSIFICATION (12)
PIPELINING (12)
REAL-TIME SYSTEMS (12)
SHIFT REGISTERS (12)
VHDL (12)
BENCHMARK TESTING (11)
MAGNETIC CORES (11)
PIPELINE (11)
SIGNAL PROCESSING (11)
SRAM CHIPS (11)
ASIC (10)
BIT ERROR RATE (10)
DYNAMIC PARTIAL RECONFIGURATION (10)
ETHERNET NETWORKS (10)
FPGAS (10)
HASH FUNCTION (10)
IMAGE CODING (10)
ITERATIVE DECODING (10)
more

INFONA - science communication portal

Search results

Dynamic FPGA-accelerator sharing among concurrently running virtual machines

Low cost resilient regular expression matching on FPGAs

Design of Media Access Control in visible light communication system and a simple way to avoid dual transmit over dual Access Point

Efficient implementation of the AES algorithm for security applications

Overcoming resource underutilization in spatial CNN accelerators

LYNX: CAD for FPGA-based networks-on-chip

Memory efficient and high performance key-value store on FPGA using Cuckoo hashing

JetStream: An open-source high-performance PCI Express 3 streaming library for FPGA-to-Host and FPGA-to-FPGA communication

Improved resource sharing for FPGA DSP blocks

Exploring the use of shift register lookup tables for Keccak implementations on Xilinx FPGAs

Packet processing on FPGA SoC with DPDK

Design and implementation of embedded DAQ using spatial parallelism on FPGA for better throughput

OpenCL-based erasure coding on heterogeneous architectures

Architecture for quadruple precision floating point division with multi-precision support

QR decomposition using FPGAs

Multi-GSPS FFTs using FPGAs

Review on realization of AES encryption and decryption with power and area optimization

RCA on FPGAs designed by the RTL design methodology and wave-pipelined operation

FPGA based area optimized parallel pipelined radix-2² feed forward FFT architecture

FPGA kernels for classification rule induction

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options