Advanced search

Advanced search in people

From:

To:

Items from 1 to 20 out of 68 results

chapter

A 0.3mm² 280MHz GF(3^m) η_T pairing accelerator for lightweight system

Xusheng Wang, Xiangyu Li

2017 International Conference on Electron Devices and Solid-State Circuits (EDSSC) > 1 - 2

2017 International Conference on Electron Devices and Solid-State Circuits (EDSSC)

In this paper, a low-cost accelerator for the η_T pairing in characteristic three over the super-singular elliptic curves is designed. As the critical operations of η_T pairing, the cubing and sparse multiplications over GF(3^6m) in the Miller's algorithm are merged and their arithmetic are modified and scheduled to reduce the intermediate data related overhead. With these optimizations, the Miller's...

chapter

High level synthesis using vivado HLS for optimizations of SHA-3

H S. Jacinto, Luka Daoud, Nader Rafla

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS) > 563 - 566

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS)

Hash functions represent a fundamental building block of many network security protocols. The SHA-3 hashing algorithm is the most recently developed hash function, and the most secure. Implementation of the SHA-3 hashing algorithm in Hardware Description Language (HDL) is time demanding and tedious to debug. On the other hand, High-Level Synthesis (HLS) tools offer potential solutions to the hardware...

chapter

DeepPump: Multi-pumping deep Neural Networks

Ruizhe Zhao, Tim Todman, Wayne Luk, Xinyu Niu

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 206

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

This paper presents DeepPump, an approach that generates CNN hardware designs with multi-pumping, which have competitive performance when compared with previous designs. Future work includes integrating DeepPump with other optimisations, and providing further evaluations on various FPGA platforms.

chapter

High-level synthesis of approximate hardware under joint precision and voltage scaling

Seogoo Lee, Lizy K. John, Andreas Gerstlauer

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 187 - 192

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

In recent years, approximate computing has emerged as a promising approach to trade off quality of computed outputs for energy savings. In this paper, we present an approximate high-level synthesis (AHLS) approach that outputs a quality-energy optimized register-transfer-level implementation from an accurate high-level C description. Existing AHLS work only considers switching activity for energy...

chapter

Greybox design methodology: A program driven hardware co-optimization with ultra-dynamic clock management

Tianyu Jia, Russ Joseph, Jie Gu

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

In this paper, a novel Greybox design methodology is proposed to establish a design and co-optimization flow across the boundary of conventional software and hardware design. The dynamic timing of each software instruction is simulated and associated with processor hardware design, which provides the basis of ultra-dynamic clock management. The proposed scheme effectively implements the instruction-based...

chapter

Precise digital implementations of hyperbolic tanh and sigmoid function

Shaghayegh Gomar, Mitra Mirhassani, Majid Ahmadi

2016 50th Asilomar Conference on Signals, Systems and Computers > 1586 - 1589

2016 50th Asilomar Conference on Signals, Systems and Computers

Sigmoid and Hyperbolic Tangent are widely used as activation functions in artificial neural networks. Exponential term and division are basic building blocks of these functions. This paper proposes precise and efficient hardware implementations for sigmoid and hyperbolic tangent functions using exponential function approximation. Performance of both functions has been verified which shows that the...

chapter

High Level Synthesis based FPGA Implementation of H.264/AVC Sub-Pixel Luma Interpolation Filters

Waqar Ahmad, Javed Iqbal, Maurizio Martina, Guido Masera

2016 European Modelling Symposium (EMS) > 79 - 82

2016 European Modelling Symposium (EMS)

In High Efficiency Video Coding (HEVC) and H.264/AVC video coding standards, Interpolation filtering used for sub-pixel interpolation is one of the most computational intensive parts of the standards. Video processing systems are becoming more complex thus decreasing the productivity of the hardware designers and the software programmers, producing design productivity gap. To fill this productivity...

chapter

An ultra-low power AES encryption core in 65nm SOTB CMOS process

Van-Phuc Hoang, Van-Lan Dao, Cong-Kha Pham

2016 International SoC Design Conference (ISOCC) > 89 - 90

2016 International SoC Design Conference (ISOCC)

This paper presents an efficient ASIC implementation of the low area and ultra-low power AES encryption core with an optimized S-box, Rcon and control blocks optimization, combined with a simple clock gating technique using an ultra-low power 65nm SOTB CMOS technology. The ASIC implementation results show that the proposed AES encryption core requires a small number of clock cycles with ultra-low...

chapter

FFT/IFFT implementation using Vivado™ HLS

Amit Salaskar, Nitin Chandrachoodan

2016 20th International Symposium on VLSI Design and Test (VDAT) > 1 - 2

2016 20th International Symposium on VLSI Design and Test (VDAT)

High level synthesis tools are an attractive option for rapid prototyping and implementation of hardware designs. In this paper we present a case study of using such a tool for the design and implementation of an FFT core for use in a wireless modem. The optimizations used for directing the conversion of C code to hardware are discussed and the impact of the different directives is analyzed. The resulting...

chapter

Loop Splitting for Efficient Pipelining in High-Level Synthesis

Junyi Liu, John Wickerson, George A. Constantinides

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 72 - 79

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Loop pipelining is widely adopted as a key optimization method in high-level synthesis (HLS). However, when complex memory dependencies appear in a loop, commercial HLS tools are still not able to maximize pipeline performance. In this paper, we leverage parametric polyhedral analysis to reason about memory dependence patterns that are uncertain (i.e., parameterised by an undetermined variable) and/or...

chapter

Design and verification using high-level synthesis

Andres Takach

2016 21st Asia and South Pacific Design Automation Conference (ASP-DAC) > 198 - 203

2016 21st Asia and South Pacific Design Automation Conference (ASP-DAC)

The adoption of HLS has been driven by the need to tackle growing verification costs in traditional RTL design flows. This paper presents an overview of design, optimization and verification using HLS. It also outlines some of the requirements for HLS design to fit into existing design and verification flows and ways in which such flows might be adapted as HLS is more widely deployed.

chapter

Hardware and software performance of image processing applications on reconfigurable systems

Ashish Mishra, Mohit Agarwal, Kota Solomon Raju

2015 Annual IEEE India Conference (INDICON) > 1 - 5

2015 Annual IEEE India Conference (INDICON)

Field Programmable Gate Arrays (FPGAs) have been extensively used in accelerating applications in many digital domains, examples include image and signal processing. These applications have been abundantly tested in high level languages like C, C++ and Matlab programming. Many standard libraries exist for image processing applications like OpenCV for end to end solutions. Applications centered around...

chapter

Hardware optimization of complex multiplication scheme for DSP application

Monika Hemnani, Sangeeta Palekar, Preeti Dixit, Pankaj Joshi

2015 International Conference on Computer, Communication and Control (IC4) > 1 - 4

2015 International Conference on Computer, Communication and Control (IC4)

Complex multiplications are the backbones of almost all Digital Signal Processing (DSP) algorithms and several other scientific applications. Complexity Reduction of these operations at architectural level or algorithmic level can certainly save the chip area, which ultimately can be a driver parameter for selection of power or speed optimized architectures. Improvement in these performance parameters...

chapter

Hardware Design Space Exploration with a New Dimension -- IP Protection Robustness

Qiang Liu, Haie Li

2015 Euromicro Conference on Digital System Design > 599 - 605

2015 Euromicro Conference on Digital System Design (DSD)

Design space exploration (DSE) is now an important phase of the SoC design process, in order to realize high-efficiency design. In conventional DSE, design metrics such as speed, power and area are extensively used to evaluate various design options. As IP-reuse is widely adopted, protection of hardware IPs has been paid more and more attention at advanced design processes. This paper considers IP...

chapter

FPGA autonomous logic analyzer using innovative BERC filter optimization

Aleodor Daniel Ioan, Mihael Cristian Ignat

2015 7th International Conference on Electronics, Computers and Artificial Intelligence (ECAI) > E-39 - E-46

2015 7th International Conference on Electronics, Computers and Artificial Intelligence (ECAI)

In this work is presented a new hardware implementation of a high speed logic analyzer inside FPGA (Field Programmable Gate Array) chips that is fully autonomous by directly driving a VGA compatible computer monitor for multiple signals display. It can be used as a very low cost and real time testing instrument for both external hardware and internal FPGA designs. The implementation is optimized at...

chapter

LFSR Reseeding Based Test Compression Respecting Different Controllability of Decompressor Outputs

Ondrej Novak, Jiri Jenicek, Martin Rozkovec

2015 IEEE 18th International Symposium on Design and Diagnostics of Electronic Circuits & Systems > 9 - 14

2015 IEEE 18th International Symposium on Design and Diagnostics of Electronic Circuits & Systems (DDECS)

The paper discusses possibilities of rearranging test decompress or internal structure and linking its outputs with the parallel scan chain inputs in order to obtain better compression efficiency while the hardware overhead is not increased. We have experimentally verified that the controllability of decompress or outputs can be used as a simple and easily computable measure of the decompress or efficiency...

chapter

Compact and low power AES block cipher using lightweight key expansion mechanism and optimal number of S-Boxes

J. J. Tay, M. M. Wong, I. Hijazin

2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) > 108 - 114

2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)

In the past decade, we observed the trend of technological advancement towards the field of portable electronics. As electronic devices shrink in size, constraints emerge in the form of limited power supply and area for the implementation of information security mechanisms. In this work, our goal is to produce a complete AES block cipher for data encryption and perform optimization in terms of power...

chapter

High throughput channel tracking for JTRS wireless channel emulation

Dajung Lee, Janarbek Matai, Brad Weals, Ryan Kastner

2014 24th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2014 24th International Conference on Field Programmable Logic and Applications (FPL)

Testing and verifying wireless systems in a real world environments is a challenging but an important problem. This is particular true for the Joint Tactical Radio System (JTRS) where the modulation techniques are optimized towards environments that are difficult to reproduce (e.g., ship to plane, plane to satellite communications). Such cases necessitate a wireless channel emulator to facilitate...

chapter

Efficient speed (ES): Adaptive DVFS and clock modulation for energy efficiency

Pietro Cicotti, Ananta Tiwari, Laura Carrington

2014 IEEE International Conference on Cluster Computing (CLUSTER) > 158 - 166

2014 IEEE International Conference On Cluster Computing (CLUSTER)

Meeting the 20MW power envelope sought for exascale is one of the greatest challenges in designing those class of systems. Addressing this challenge requires over-provisioned and dynamically reconfigurable system with fine-grained control on power and speed of the individual cores. In this paper, we present EfficientSpeed (ES), a library that improves energy efficiency in scientific computing by carefully...

chapter

Optimization of a dot product accelerator

Amitava Biswas

2014 IEEE 57th International Midwest Symposium on Circuits and Systems (MWSCAS) > 619 - 622

2014 IEEE 57th International Midwest Symposium on Circuits and Systems (MWSCAS)

Vector dot product is an important computation which needs hardware accelerators. We present an optimized accelerator chip that has larger capacity than our prior designs. This design can compute product for 10000 component vectors within 1000 clock cycles, with average being 80 cycles. Our design has superior speed compared to other accelerators.

Keywords:
HARDWARE
OPTIMIZATION
CLOCKS

Publication date

Set your own date range

INFONA - science communication portal

Advanced search

Advanced search in people

A 0.3mm² 280MHz GF(3^m) η_T pairing accelerator for lightweight system

High level synthesis using vivado HLS for optimizations of SHA-3

DeepPump: Multi-pumping deep Neural Networks

High-level synthesis of approximate hardware under joint precision and voltage scaling

Greybox design methodology: A program driven hardware co-optimization with ultra-dynamic clock management

Precise digital implementations of hyperbolic tanh and sigmoid function

High Level Synthesis based FPGA Implementation of H.264/AVC Sub-Pixel Luma Interpolation Filters

An ultra-low power AES encryption core in 65nm SOTB CMOS process

FFT/IFFT implementation using Vivado™ HLS

Loop Splitting for Efficient Pipelining in High-Level Synthesis

Design and verification using high-level synthesis

Hardware and software performance of image processing applications on reconfigurable systems

Hardware optimization of complex multiplication scheme for DSP application

Hardware Design Space Exploration with a New Dimension -- IP Protection Robustness

FPGA autonomous logic analyzer using innovative BERC filter optimization

LFSR Reseeding Based Test Compression Respecting Different Controllability of Decompressor Outputs

Compact and low power AES block cipher using lightweight key expansion mechanism and optimal number of S-Boxes

High throughput channel tracking for JTRS wireless channel emulation

Efficient speed (ES): Adaptive DVFS and clock modulation for energy efficiency

Optimization of a dot product accelerator

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options