Search results

Items from 101 to 120 out of 782 results

1 ...
3
4
5
6
7
8
9

chapter

Exploring the use of shift register lookup tables for Keccak implementations on Xilinx FPGAs

Jori Winderickx, Joan Daemen, Nele Mentens

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

We explore the possibility of using shift register lookup tables (SRLs) for the implementation of Keccak on Xilinx FPGAs. The approach originates from the observation that the ρ step in combination with the state storage can be implemented as a collection of shift registers. This way, we achieve a slice-wise implementation using 25 shift registers of various lengths, resulting in 75 32-bit and 6 16-bit...

chapter

Packet processing on FPGA SoC with DPDK

Jan Viktorin, Jan Korenek

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 2

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

One of the most important topics of today is a packet processing in data centers with respect to the power consumption and efficient utilization of computational resources. The ARM architecture has proved to be an energy efficient computational system. Together with an integrated FPGA on a single die, it offers potentially a high performance with respect to the power consumption. DPDK - a set of libraries...

chapter

Design and implementation of embedded DAQ using spatial parallelism on FPGA for better throughput

Janice Jia Min, Muataz H. Salih, Zheng Ng, Torry Kho, more

2016 3rd International Conference on Electronic Design (ICED) > 275 - 280

2016 3rd International Conference on Electronic Design (ICED)

Data acquisition (DAQ) is the process of acquire analog signals from different types of sources and further process the acquired signals through personal computer (PC) in digital form. Compared to traditional measurement system, PC-based DAQ system provides a more flexible and cost-effective measurement solution to the industry and utilizes the efficiency, processing power and connectivity capabilities...

chapter

OpenCL-based erasure coding on heterogeneous architectures

Guoyang Chen, Huiyang Zhou, Xipeng Shen, Josh Gahm, more

2016 IEEE 27th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 33 - 40

2016 IEEE 27th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Erasure coding, Reed-Solomon coding in particular, is a key technique to deal with failures in scale-out storage systems. However, due to the algorithmic complexity, the performance overhead of erasure coding can become a significant bottleneck in storage systems attempting to meet service level agreements (SLAs). Previous work has mainly leveraged SIMD (single-instruction multiple-data) instruction...

chapter

Architecture for quadruple precision floating point division with multi-precision support

Manish Kumar Jaiswal, Hayden K.-H So

2016 IEEE 27th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 239 - 240

2016 IEEE 27th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

This paper proposes a FPGA based hardware architecture for quadruple precision (QP) division arithmetic which can also process a single, a double and a double-extended precision (SP, DP, DPE) computations. The mantissa division employs a series expansion methodology of division, integrated with a wide integer multiplier further optimized for FPGA implementations facilitating the built-in DSP blocks...

chapter

QR decomposition using FPGAs

Michael Parker, Volker Mauer, Dan Pritsker

2016 IEEE National Aerospace and Electronics Conference (NAECON) and Ohio Innovation Summit (OIS) > 416 - 421

2016 IEEE National Aerospace and Electronics Conference (NAECON) and Ohio Innovation Summit (OIS)

This paper describes the architecture and implementation of a high performance QR decomposition IEEE754 single precision floating point core, using a modified Gram-Schmidt algorithm. Using Intel's new floating point Arria 10 FPGAs, synthesis is used to generate column high functional units, giving O(n²) processing times. The modified Gram-Schmidt algorithm is expressed in a different order to combine...

chapter

Multi-GSPS FFTs using FPGAs

Michael Parker, Simon Finn, Hong Shan Neoh

2016 IEEE National Aerospace and Electronics Conference (NAECON) and Ohio Innovation Summit (OIS) > 430 - 436

2016 IEEE National Aerospace and Electronics Conference (NAECON) and Ohio Innovation Summit (OIS)

This paper describes the implementation of a high throughput FFTs implemented on FPGAs, using a modified version of the Radix 2^N architecture. The implementation uses a synthesis method which supports “super-sampling” to provide very high throughput. Special vector structures in the tools and hardware architecture are supported where complex vectors form the input on each clock cycle, and multiple...

chapter

Review on realization of AES encryption and decryption with power and area optimization

Mohini Mohurle, Vishal V. Panchbhai

2016 IEEE 1st International Conference on Power Electronics, Intelligent Control and Energy Systems (ICPEICES) > 1 - 3

2016 IEEE 1st International Conference on Power Electronics, Intelligent Control and Energy Systems (ICPEICES)

In this project, a hardware implementation of the AES-256 encryption and decryption algorithm is proposed. The AES cryptography algorithm can be used to encryption and decryption blocks of 128 bits and is capable of using cipher keys of 256 bits. Feature of the proposed pipeline design is depending on the round keys, which are consumed different round of encryption, are generated in parallel way with...

article

Cuckoo Cache: A Technique to Improve Flow Monitoring Throughput

Salvatore Pontarelli, Pedro Reviriego

IEEE Internet Computing > 2016 > 20 > 4 > 46 - 53

By leveraging the uneven distribution of traffic among network flows, the authors improve the query throughput of Cuckoo hashing. They achieve this by placing the most frequently used items in the table that's accessed first during the Cuckoo query operation. Their scheme is named Cuckoo cache, as it's conceptually similar to a cache but implemented inside the Cuckoo hash with little additional cost...

chapter

RCA on FPGAs designed by the RTL design methodology and wave-pipelined operation

Tomoaki Sato, Sorawat Chivapreecha, Phichet Moungnoul, Kohji Higuchi

2016 13th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON) > 1 - 6

2016 13th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

Field-programmable gate arrays (FPGAs) are used in various systems that use reconfigurable function. Conventional FPGAs have been developed by a transistor-level description for minimizing routing delay. Although FPGAs developed by the register transfer level (RTL) design methodology provide various benefits to the designers of a system-on-a-chip (SoC), they have not been realized. Therefore, the...

chapter

FPGA based area optimized parallel pipelined radix-2² feed forward FFT architecture

S A Ajmal, S L Gangadharaiah

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT) > 1302 - 1307

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT)

The design of pipelined Fast Fourier transform (PFFT) in modern communication systems provides an efficient way for computation of FFT with better area utilizing hardware architecture. Previously, the radix-2² had been used only for single path delay feedback architectures. Later with many types of research works the radix 2² was extended to multi-path delay commutator (MDC) architectures. This paper...

chapter

FPGA kernels for classification rule induction

P. Skoda, B. Medved Rogina

2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) > 337 - 342

2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

Classification is one of the core tasks in machine learning data mining. One of several models of classification are classification rules, which use a set of if-then rules to describe a classification model. In this paper we present a set of FPGA-based compute kernels for accelerating classification rule induction. The kernels can be combined to perform specific procedures in rule induction process,...

chapter

FPGA design of approximate semidefinite relaxation for data detection in large MIMO wireless systems

Oscar Castaneda, Tom Goldstein, Christoph Studer

2016 IEEE International Symposium on Circuits and Systems (ISCAS) > 2659 - 2662

2016 IEEE International Symposium on Circuits and Systems (ISCAS)

We propose a novel, near-optimal data detection algorithm and a corresponding FPGA design for large multiple-input multiple-output (MIMO) wireless systems. Our algorithm, referred to as TASER (short for triangular approximate semidefinite relaxation), relaxes the maximum-likelihood (ML) detection problem to a semidefinite program and solves a non-convex approximation using a preconditioned forward-backward...

chapter

Stochastic image processing and simultaneous dewarping for aerial vehicles

Jamal Lottier Molin, John Rattray, Ralph Etienne-Cummings

2016 IEEE International Symposium on Circuits and Systems (ISCAS) > 2086 - 2089

2016 IEEE International Symposium on Circuits and Systems (ISCAS)

There is increasing interest for aerial vehicles to perform image processing tasks (i.e. object recognition and detection) in real-time. Such systems systems should have minimal data throughput, low computational complexity, and low-power. Traditional frame-based digital cameras are not ideal for meeting such specifications. More recent cameras, inspired by biology, drastically reduce data throughput...

chapter

High-Throughput and Energy-Efficient Graph Processing on FPGA

Shijie Zhou, Charalampos Chelmis, Viktor K. Prasanna

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 103 - 110

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

In this paper, we propose a novel design for large-scale graph processing on FPGA. Our design uses large external memory for storing massive graph data and FPGA for acceleration, and leverages edge-centric computing principles. We propose a data layout which optimizes the external memory performance and leads to an efficient memory activation schedule to reduce on-chip memory power consumption. Further,...

chapter

Initiation Interval Aware Resource Sharing for FPGA DSP Blocks

Ronak Bajaj, Suhaib A. Fahmy

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 135

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Resource sharing attempts to minimise usage of hardware blocks by mapping multiple operations onto same block at the cost of an increase in schedule length and initiation interval (II). Sharing multi-cycle high-throughput DSP blocks using traditional approaches results in significantly high II, determined by structure of dataflow graph of the design, thus limiting achievable throughput. We have developed...

chapter

A LUT-Based Approximate Adder

Andreas Becher, Jorge Echavarria, Daniel Ziener, Stefan Wildermann, more

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 27

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

In this paper, we propose a novel approximate adder structure for LUT-based FPGA technology. Compared with a full featured accurate carry-ripple adder, the longest path is significantly shortened which enables the clocking with an increased clock frequency. By using the proposed adder structure, the throughput of an FPGA-based implementation can be significantly increased. On the other hand, the resulting...

chapter

VLSI implementation of incremental fixed-complexity LLL lattice reduction for MIMO detection

Qingsong Wen, Xiaoli Ma

2016 IEEE International Symposium on Circuits and Systems (ISCAS) > 1898 - 1901

2016 IEEE International Symposium on Circuits and Systems (ISCAS)

Lenstra-Lenstra-Lovász (LLL) algorithm is a common technique for lattice reduction (LR) aided multiple-input multiple-output (MIMO) detectors. This paper presents the first VLSI implementation of a recently published Incremental fixed-complexity LLL algorithm (Incremental fcLLL) with fewer iterations than other existing fcLLL algorithms. We propose a modified Incremental fcLLL algorithm with simplified...

chapter

Increasing Network Size and Training Throughput of FPGA Restricted Boltzmann Machines Using Dropout

Jiang Su, David B. Thomas, Peter Y.K. Cheung

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 48 - 51

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Restricted Boltzmann Machines (RBMs) are widely used in modern machine learning tasks. Existing implementations are limited in network size and training throughput by available DSP resources. In this work we propose a new algorithm and architecture for FPGAs called dropout-RBM (dRBM) system. Compared to the state-of-art design methods on the same FPGA, dRBM with a dropout rate 0.5 doubles the maximum...

chapter

Efficient error detection architectures for CORDIC through recomputing with encoded operands

Mehran Mozaffari Kermani, Rajkumar Ramadoss, Reza Azarderakhsh

2016 IEEE International Symposium on Circuits and Systems (ISCAS) > 2154 - 2157

2016 IEEE International Symposium on Circuits and Systems (ISCAS)

Various optimized coordinate rotation digital computer (CORDIC) designs have been proposed to date. Nonetheless, in the presence of natural faults, such architectures could lead to erroneous outputs. In this paper, we propose error detection schemes for CORDIC architectures used vastly in applications such as complex number multiplication, and singular value decomposition for signal and image processing...

1 ...
3
4
5
6
7
8
9

Data set:
ieee
Keywords:
FIELD PROGRAMMABLE GATE ARRAYS
THROUGHPUT

Publication date

Set your own date range

Content availability

Available (773)
None (9)

Publication type

book (691)
article (91)

Keywords

HARDWARE (313)
FPGA (276)
COMPUTER ARCHITECTURE (206)
CLOCKS (157)
ALGORITHM DESIGN AND ANALYSIS (114)
RANDOM ACCESS MEMORY (106)
CRYPTOGRAPHY (93)
PIPELINES (84)
PIPELINE PROCESSING (76)
REGISTERS (76)
TABLE LOOKUP (74)
DECODING (67)
ENCRYPTION (58)
MEMORY MANAGEMENT (55)
PARALLEL PROCESSING (44)
DIGITAL SIGNAL PROCESSING (43)
SOFTWARE (42)
DELAY (40)
IP NETWORKS (37)
FIELD PROGRAMMABLE GATE ARRAY (36)
LOGIC DESIGN (36)
PROTOCOLS (36)
MIMO (35)
BANDWIDTH (34)
PARITY CHECK CODES (33)
ENGINES (32)
OPTIMIZATION (32)
RECONFIGURABLE ARCHITECTURES (32)
KERNEL (31)
PROGRAM PROCESSORS (31)
ADDERS (29)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (29)
ENCODING (29)
PARALLEL ARCHITECTURES (29)
ARRAYS (28)
POWER DEMAND (28)
FPGA IMPLEMENTATION (27)
COMPLEXITY THEORY (26)
DATA MINING (26)
GENERATORS (26)
LOGIC GATES (26)
STANDARDS (26)
SIGNAL PROCESSING ALGORITHMS (25)
SWITCHES (24)
ROUTING (23)
SECURITY (23)
VLSI (23)
NIST (22)
DETECTORS (21)
EQUATIONS (21)
ITERATIVE DECODING (20)
SYSTEM-ON-CHIP (20)
AES (19)
MATHEMATICAL MODEL (19)
RESOURCE MANAGEMENT (19)
SYNCHRONIZATION (19)
MICROPROCESSOR CHIPS (18)
MIMO COMMUNICATION (18)
ACCELERATION (17)
CIPHERS (17)
DELAYS (17)
PACKET CLASSIFICATION (17)
PERFORMANCE EVALUATION (17)
REAL TIME SYSTEMS (17)
SHA-3 (17)
POLYNOMIALS (16)
RADIATION DETECTORS (16)
SIGNAL PROCESSING (16)
ADVANCED ENCRYPTION STANDARD (15)
FIELD-PROGRAMMABLE GATE ARRAY (FPGA) (15)
HARDWARE DESCRIPTION LANGUAGES (15)
HARDWARE IMPLEMENTATION (15)
INTEGRATED CIRCUIT DESIGN (15)
INTERNET (15)
NETWORK-ON-CHIP (15)
PATTERN MATCHING (15)
POWER CONSUMPTION (15)
TELECOMMUNICATION NETWORK ROUTING (15)
WIRELESS COMMUNICATION (15)
ASIC (14)
FIELD PROGRAMMABLE GATE ARRAY (FPGA) (14)
IMAGE CODING (14)
INDEXES (14)
PIPELINE (14)
SHIFT REGISTERS (14)
SYSTEM-ON-A-CHIP (14)
BENCHMARK TESTING (13)
COMPUTATIONAL MODELING (13)
MULTIPLEXING (13)
PIPELINING (13)
REAL-TIME SYSTEMS (13)
BIT ERROR RATE (12)
FFT (12)
FIELD-PROGRAMMABLE GATE ARRAY (12)
FPGAS (12)
HASH FUNCTION (12)
SCHEDULES (12)
VHDL (12)
more

INFONA - science communication portal

Search results

Exploring the use of shift register lookup tables for Keccak implementations on Xilinx FPGAs

Packet processing on FPGA SoC with DPDK

Design and implementation of embedded DAQ using spatial parallelism on FPGA for better throughput

OpenCL-based erasure coding on heterogeneous architectures

Architecture for quadruple precision floating point division with multi-precision support

QR decomposition using FPGAs

Multi-GSPS FFTs using FPGAs

Review on realization of AES encryption and decryption with power and area optimization

Cuckoo Cache: A Technique to Improve Flow Monitoring Throughput

RCA on FPGAs designed by the RTL design methodology and wave-pipelined operation

FPGA based area optimized parallel pipelined radix-2² feed forward FFT architecture

FPGA kernels for classification rule induction

FPGA design of approximate semidefinite relaxation for data detection in large MIMO wireless systems

Stochastic image processing and simultaneous dewarping for aerial vehicles

High-Throughput and Energy-Efficient Graph Processing on FPGA

Initiation Interval Aware Resource Sharing for FPGA DSP Blocks

A LUT-Based Approximate Adder

VLSI implementation of incremental fixed-complexity LLL lattice reduction for MIMO detection

Increasing Network Size and Training Throughput of FPGA Restricted Boltzmann Machines Using Dropout

Efficient error detection architectures for CORDIC through recomputing with encoded operands

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options