Search results

Items from 61 to 80 out of 978 results

chapter

Building Fast but Flexible Software Routers

Sebastian Gallenmuller, Paul Emmerich, Rainer Schonberger, Daniel Raumer, more

2017 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS) > 101 - 102

2017 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS)

Creating quick and dirty prototypes is a simple and effective way to demonstrate the feasibility of new ideas in network research. Though, small scale proof-of-concepts may lack the performance needed to apply them to real world test cases. Thanks to powerful packet processing frameworks such as netmap and DPDK, high-performance packet forwarding systems can be implemented in software today.We present...

chapter

Mind the Gap - A Comparison of Software Packet Generators

Paul Emmerich, Sebastian Gallenmuller, Gianni Antichi, Andrew W. Moore, more

2017 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS) > 191 - 203

2017 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS)

Network research relies on packet generators to assess performance and correctness of new ideas. Software-based generators in particular are widely used by academic researchers because of their flexibility, affordability, and open-source nature. The rise of new frameworks for fast IO on commodity hardware is making them even more attractive. Longstanding performance differences of software generation...

chapter

Empirical investigation of IEEE 802.11ad network

Kien Nguyen, Mirza Golam Kibria, Kentaro Ishizu, Fumihide Kojima

2017 IEEE International Conference on Communications Workshops (ICC Workshops) > 192 - 197

2017 IEEE International Conference on Communications Workshops (ICC Workshops)

The IEEE 802.11ad standard allows wireless devices to operate in the unlicensed spectrum band of 60 GHz. By utilizing the channel with 2.16 GHz width, the devices are able to transmit at multi-Gigabit data rates that potentially satisfy demanding requirements of quality of services. Additionally, the advent of off-the-shelf IEEE 802.11ad device motivates research efforts to exploit this 60 GHz opportunity...

chapter

Performance analysis of virtualized VPN endpoints

D. Lackovic, M. Tomic

2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) > 466 - 471

2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

Virtual Private Networks (VPN) are an established technology that provides users a way to achieve secure communication over an insecure communication channel, such as the public Internet. It has been widely accepted due to its flexibility and availability on many platforms. It is often used as an alternative to expensive leased lines. In traditional setups, VPN endpoints are set up in hardware appliances,...

chapter

Automatic generation of high-performance modular multipliers for arbitrary mersenne primes on FPGAs

Philipp Koppermann, Fabrizio De Santis, Johann Heyszl, Georg Sigl

2017 IEEE International Symposium on Hardware Oriented Security and Trust (HOST) > 35 - 40

2017 IEEE International Symposium on Hardware Oriented Security and Trust (HOST)

Modular multiplication is a fundamental and performance determining operation in various public-key cryptosystems. High-performance modular multipliers on FPGAs are commonly realized by several small-sized multipliers, an adder tree for summing up the digit-products, and a reduction circuit. While small-sized multipliers are available in pre-fabricated high-speed DSP slices, the adder tree and the...

chapter

Designing Virtual Network Functions for 100 GbE Network Using Multicore Processors

Peilong Li, Xiaoban Wu, Yongyi Ran, Yan Luo

2017 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS) > 49 - 59

2017 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS)

Network function virtualization (NFV) introduces great flexibility in designing software-based network appliances to reduce cost and accelerate service deployment for network operators. However, with the fast development of high speed network of 100 GbE and beyond, how to efficiently design virtual network functions (VNF) on commodity servers has become a challenging problem. Although the advances...

chapter

High Throughput FPGA Implementation for regular Non-Surjective Finite Alphabet Iterative Decoders

Thien Truong Nguyen-Ly, Valentin Savin, Xavier Popon, David Declercq

2017 IEEE International Conference on Communications Workshops (ICC Workshops) > 961 - 966

2017 IEEE International Conference on Communications Workshops (ICC Workshops)

This paper deals with the recently introduced class of Non-Surjective Finite Alphabet Iterative Decoders (NS-FAIDs). First, optimization results for an extended class of regular NS-FAIDs are presented. They reveal different possible trade-offs between decoding performance and hardware implementation efficiency. To validate the promises of optimized NS-FAIDs in terms of hardware implementation benefits,...

chapter

Application Level Reordering of Remote Direct Memory Access Operations

Wim Lavrijsen, Costin Iancu

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 988 - 997

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

We present methods for the effective application level reordering of non-blocking RDMA operations. We supplement out-of-order hardware delivery mechanisms with heuristics to account for the CPU side overhead of communication and for differences in network latency: a runtime scheduler takes into account message sizes, destination and concurrency and reorders operations to improve overall communication...

chapter

Sharing the instruction cache among lean cores on an asymmetric CMP for HPC applications

Ugljesa Milic, Alejandro Rico, Paul Carpenter, Alex Ramirez

2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) > 3 - 12

2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

High performance computing (HPC) applications have parallel code sections that must scale to large numbers of cores, which makes them sensitive to serial regions. Current supercomputing systems with heterogeneous or asymmetric CMPs (ACMP) combine few high-performance big cores for serial regions, together with many low-power lean cores for throughput computing. The low requirements of HPC applications...

chapter

High-Performance Hardware Merge Sorter

Susumu Mashimo, Thiem Van Chu, Kenji Kise

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 1 - 8

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

State-of-the-art studies show that FPGA-based hardware merge sorters (HMSs) can achieve superior performance compared with optimized algorithms on CPUs and GPUs. The performance of any HMS is proportional to its operating frequency (F) and the number of records that can be output each cycle (E). However, all existing HMSs have a problem that F drops significantly with increasing E due to the increase...

chapter

Microarchitecture level reliability comparison of modern GPU designs: First findings

Alessandro Vallero, Stefano Di Carlo, Sotiris Tselonis, Dimitris Gizopoulos

2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) > 129 - 130

2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

State-of-the-art GPU chips are designed to deliver extreme throughput for graphics as well as for data-parallel general purpose computing workloads (GPGPU computing). Unlike graphics computing, GPGPU computing requires highly reliable operation. The performance-oriented design of GPUs requires to jointly evaluate the vulnerability of GPU workloads to soft-errors with the performance of GPU chips....

chapter

A Scalable FPGA-Based Accelerator for High-Throughput MCMC Algorithms

Morteza Hosseini, Rashidul Islam, Amey Kulkarni, Tinoosh Mohsenin

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 201

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Markov Chain Monte Carlo (MCMC) algorithms are used to obtain samples from any target probability distribution and are widely used in stochastic processing techniques. Stochastic processing techniques such as machine learning and image processing need to compute large amounts of data in real-time, thus high throughput MCMC samplers are of utmost importance. Parallel Tempering (PT) MCMC has proven...

chapter

Low-complexity high-speed soft-hard decoding for turbo-product codes

Yaroslav Krainyk, Vladislav Perov, Maksym Musiyenko

2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO) > 471 - 474

2017 IEEE 37th International Conference on Electronics and Nanotechnology (ELNANO)

Combined (soft-hard) method for decoding block turbo-product codes is proposed in the paper. The method allows leveraging advantages of soft input data usage with the speed of hard-decoding procedure. The main peculiarity of the method is rule-based decoding stage. The proposed approach simplifies calculation procedure and reaches better correction ability than hard-decision decoder. Mathematical...

chapter

High-level low-power system design optimization

David Pursley, Tung-Hua Yeh

2017 International Symposium on VLSI Design, Automation and Test (VLSI-DAT) > 1 - 4

2017 International Symposium on VLSI Design, Automation and Test (VLSI-DAT)

High-level decisions have the most impact on power consumption, but the effect of those decisions cannot be known until the hardware is implemented. This paper walks the reader through an industrial high-level low-power design methodology that enables the designer to consider and quantitatively evaluate a broad range of hardware implementations to find the most power-efficient architecture. This paper...

chapter

Selective In-Place Appends for Real: Reducing Erases on Wear-prone DBMS Storage

Sergey Hardock, Ilia Petrovy, Robert Gottstein, Alejandro Buchmann

2017 IEEE 33rd International Conference on Data Engineering (ICDE) > 1375 - 1376

2017 IEEE 33rd International Conference on Data Engineering (ICDE)

Abstract-In the present paper we demonstrate the novel technique to apply the recently proposed approach of In-Place Appends - overwrites on Flash without a prior erase operation. IPA can be applied selectively: only to DB-objects that have frequent and relatively small updates. To do so we couple IPA to the concept of NoFTL regions, allowing the DBA to place update-intensive DB-objects into special...

chapter

Syntax Element Partitioning for high-throughput HEVC CABAC decoding

Philipp Habermann, Chi Ching Chi, Mauricio Alvarez-Mesa, Ben Juurlink

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1308 - 1312

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Encoder and decoder implementations of the High Efficiency Video Coding (HEVC) standard have been subject to many optimization approaches since the release in 2013. However, the real-time decoding of high quality and ultra high resolution videos is still a very challenging task. Especially entropy decoding (CABAC) is most often the throughput bottleneck for very high bitrates. Syntax Element Partitioning...

chapter

Multiple parallel branch with folding architecture for multichannel filtered-x least mean square algorithm

Dongyuan Shi, Jianjun He, Chuang Shi, Tatsuya Murao, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1188 - 1192

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Multichannel active noise control (MCANC) systems are commonly used in acoustic noise or vibration control, such as large-dimension ventilation ducts, open windows and mechanical structures. However, its computational load far exceeds the capabilities of digital signal processors (DSPs) and microcontrollers. Even the field programmable gate array (FPGA) cannot straightforwardly cope with the exponential...

chapter

Energy efficient stochastic computing with Sobol sequences

Siting Liu, Jie Han

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 650 - 653

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Energy efficiency presents a significant challenge for stochastic computing (SC) due to the long random binary bit streams required for accurate computation. In this paper, a type of low discrepancy (LD) sequences, the Sobol sequence, is considered for energy-efficient implementations of SC circuits. The use of Sobol sequences improves the output accuracy of a stochastic circuit with a reduced sequence...

chapter

Real-time anomaly detection for streaming data using burst code on a neurosynaptic processor

Qiuwen Chen, Qinru Qiu

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 205 - 207

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Real-time anomaly detection for streaming data is a desirable feature for mobile devices or unmanned systems. The key challenge is how to deliver required performance under the stringent power constraint. To address the paradox between performance and power consumption, brain-inspired hardware, such as the IBM Neurosynaptic System, has been developed to enable low power implementation of large-scale...

chapter

A coordinated multi-agent reinforcement learning approach to multi-level cache co-partitioning

Rahul Jain, Preeti Ranjan Panda, Sreenivas Subramoney

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 800 - 805

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

The widening gap between the processor and memory performance has led to the inclusion of multiple levels of caches in the modern multi-core systems. Processors with simultaneous multithreading (SMT) support multiple hardware threads on the same physical core, which results in shared private caches. Any inefficiency in the cache hierarchy can negatively impact the system performance and motivates...

Keywords:
THROUGHPUT
HARDWARE

Publication date

Set your own date range

INFONA - science communication portal

Search results

Building Fast but Flexible Software Routers

Mind the Gap - A Comparison of Software Packet Generators

Empirical investigation of IEEE 802.11ad network

Performance analysis of virtualized VPN endpoints

Automatic generation of high-performance modular multipliers for arbitrary mersenne primes on FPGAs

Designing Virtual Network Functions for 100 GbE Network Using Multicore Processors

High Throughput FPGA Implementation for regular Non-Surjective Finite Alphabet Iterative Decoders

Application Level Reordering of Remote Direct Memory Access Operations

Sharing the instruction cache among lean cores on an asymmetric CMP for HPC applications

High-Performance Hardware Merge Sorter

Microarchitecture level reliability comparison of modern GPU designs: First findings

A Scalable FPGA-Based Accelerator for High-Throughput MCMC Algorithms

Low-complexity high-speed soft-hard decoding for turbo-product codes

High-level low-power system design optimization

Selective In-Place Appends for Real: Reducing Erases on Wear-prone DBMS Storage

Syntax Element Partitioning for high-throughput HEVC CABAC decoding

Multiple parallel branch with folding architecture for multichannel filtered-x least mean square algorithm

Energy efficient stochastic computing with Sobol sequences

Real-time anomaly detection for streaming data using burst code on a neurosynaptic processor

A coordinated multi-agent reinforcement learning approach to multi-level cache co-partitioning

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options