Search results

Items from 21 to 40 out of 978 results

chapter

Incremental high throughput network traffic classifier

H. R. Loo, Alireza Monemi, Trias Andromeda, M. N. Marsono

2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI) > 1 - 6

2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI)

Today's network traffic are dynamic and fast. Conventional network traffic classification based on flow feature and data mining are not able to process traffic efficiently. Hardware based network traffic classifier is needed to be adaptable to dynamic network state and to provide accurate and updated classification at high speed. In this paper, a hardware architecture of online incremental semi-supervised...

chapter

Invited paper: Resource sharing in feed forward neural networks for energy efficiency

Abdullah M. Zyarah, Dhireesha Kudithipudi

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS) > 543 - 546

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS)

Exploiting resource reusability and low precision in neural networks is a promising approach to achieve energy efficient computational platforms. This research presents two generalizable approaches to reuse resources in feed-forward neural networks and demonstrated on extreme learning machines. In the first approach, coalescing, a single stack of neuronal units perform both feature extraction and...

chapter

High level synthesis using vivado HLS for optimizations of SHA-3

H S. Jacinto, Luka Daoud, Nader Rafla

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS) > 563 - 566

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS)

Hash functions represent a fundamental building block of many network security protocols. The SHA-3 hashing algorithm is the most recently developed hash function, and the most secure. Implementation of the SHA-3 hashing algorithm in Hardware Description Language (HDL) is time demanding and tedious to debug. On the other hand, High-Level Synthesis (HLS) tools offer potential solutions to the hardware...

chapter

Packet Classification with Limited Memory Resources

Michal Kekely, Jan Korenek

2017 Euromicro Conference on Digital System Design (DSD) > 179 - 183

2017 Euromicro Conference on Digital System Design (DSD)

Network security and monitoring devices use packet classification to match packet header fields in a set of rules. Many hardware architectures have been designed to accelerate packet classification and achieve wire-speed throughput for 100 Gbps networks. The architectures are designed for high throughput even for the shortest packets. However, FPGA SoC and Intel Xeon with FPGA have limited resources...

chapter

A Scalable Parameterized NoC Emulator Built Upon Xilinx Virtex-7 FPGA

Ming Zhu, Yingtao Jiang, Mei Yang, Louie De Luna

2017 25th International Conference on Systems Engineering (ICSEng) > 287 - 290

2017 25th International Conference on Systems Engineering (ICSEng)

A number of critical design decisions, such as network topology, buffer sizes, flow control mechanism and so on so forth, have to be evaluated in any NoC the design. Designs and verifications of NoCs are based on either software simulations, which are extremely slow and inaccurate for complex models, or hardware emulations using low/mid-class FPGAs, where the scalability of the NoC system is intensively...

chapter

Hardware optimization for belief propagation polar code decoder with early stopping criteria using high-speed parallel-prefix ling adder

Cemaleddin Simsek, Kadir Turk

2017 40th International Conference on Telecommunications and Signal Processing (TSP) > 182 - 185

2017 40th International Conference on Telecommunications and Signal Processing (TSP)

Belief propagation (BP) polar code decoder is well-studied from many aspects. This study proposes a hardware optimization to improve performance of polar BP decoder by modifying both processing element (PE) and early stopping criterion (ESC). PE is optimized by using high-speed parallel-prefix Ling adder instead of carry ripple adder and WIB ESC introduced in literature is optimized by removing unnecessary...

chapter

Emerging 5G applications over mmWave: Hands-on assessment of WiGig radios

Krystof Zeman, Martin Stusek, Jiri Pokorny, Pavel Masek, more

2017 40th International Conference on Telecommunications and Signal Processing (TSP) > 86 - 90

2017 40th International Conference on Telecommunications and Signal Processing (TSP)

Nowadays, many emerging technologies, such as Augmented and Virtual reality, require extremely high-rate data transmissions. This imposes an increasing demand on the network throughput, which currently surpasses the capabilities of commercially available wireless communication systems. To address this constraint, some companies are considering the implementation of high-throughput wired technologies,...

chapter

Test software design and implemetation for benchmarking of stateless IPv4/IPv6 translation implementations

Peter Balint, Szechenyi Istvan Egyetem

2017 40th International Conference on Telecommunications and Signal Processing (TSP) > 74 - 78

2017 40th International Conference on Telecommunications and Signal Processing (TSP)

There are many available NAT64 implementations, but we can not measure their performance per the standards, due to the lack of complaint testers. The aim of our effort is to design and write the first implementation of a test program that could provide the first answer to these needs. For benchmarking Network Interconnect Devices we could use the recommendation of the 2544 (IP version independent)...

chapter

Fair Work-Conserving Bandwidth Guarantees in Datacenters Using MPTCP

Baraa Saeed Ali, Kang Chen

2017 26th International Conference on Computer Communication and Networks (ICCCN) > 1 - 9

2017 26th International Conference on Computer Communication and Networks (ICCCN)

Datacenters should provide bandwidth guarantees to tenants for performance predictability. Ideally, this process should attain three important characteristics: work conservation, fairness, and simplicity. The first one indicates that tenants can utilize unused bandwidths effectively without harming the bandwidth guarantee. The second one means that tenants share the unused bandwidth following a certain...

chapter

A novel algorithm and architecture for a high-throughput VLSI implementation of DST using short pseudo-cycle convolutions

Doru Florin Chiper, Arcadie Cracan

2017 International Symposium on Signals, Circuits and Systems (ISSCS) > 1 - 4

2017 International Symposium on Signals, Circuits and Systems (ISSCS)

Using a new input restructuring sequence and an appropriate reordering of the elements involved, a new VLSI algorithm that uses short length pseudo-cycle convolution structures for the VLSI implementation of discrete sine transform is presented. It uses a new parallel decomposition of discrete sine transform (DST) that leads to a high throughput VLSI implementation with a low hardware cost. The proposed...

chapter

Erlang-k-based packet latency prediction model for optimal configuration of software routers

Kalika Suksomboon, Nobutaka Matsumoto, Shuichi Okamoto, Michiaki Hayashi, more

2017 IEEE Conference on Network Softwarization (NetSoft) > 1 - 9

2017 IEEE Conference on Network Softwarization (NetSoft)

Providing the optimal configuration for a software router poses a lot of technical challenges that do not present in the dedicated hardware router. One of them is how to characterize performance varying due to different configurations on commodity hardware. This paper addresses the problem of configuring a software router that provides the minimum of average packet latency. Since changing all combinations...

chapter

Multi-VNF performance characterization for virtualized network functions

Nikolai Pitaev, Matthias Falkner, Aris Leivadeasy, Ioannis Lambadarisy

2017 IEEE Conference on Network Softwarization (NetSoft) > 1 - 5

2017 IEEE Conference on Network Softwarization (NetSoft)

Network Function Virtualization promises to reduce the overall operational and capital expenses experienced by the network operators. Running multiple network functions on top of a standard x86 server instead of dedicated appliances can increase the utilization of the underlying hardware and reduce the maintenance and management costs. However, total cost of ownership calculations are typically a...

chapter

Renovate high performance user-level stacks' innovation utilizing commodity network adaptors

Mao Miao, Xiaohui Luo, Fengyuan Ren, Wenxue Cheng, more

2017 IEEE Symposium on Computers and Communications (ISCC) > 906 - 911

2017 IEEE Symposium on Computers and Communications (ISCC)

Today's data center servers are equipped with high speed and complex network adaptors, featuring an array of functions, e.g. hardware TX/RX queues, packet filters, rate limiters, etc. Recent work like IX, Arrakis, MultiStack has made us rekindle the user-level network stacks' innovation utilizing these commodity network adaptors. In this paper, we revisit the idea to move stacks' design from in-kernel...

chapter

Hardware design and analysis of efficient loop coarsening and border handling for image processing

M. Akif Ozkan, Oliver Reiche, Frank Hannig, Jurgen Teich

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 155 - 163

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Field Programmable Gate Arrays (FPGAs) excel at the implementation of local operators in terms of throughput per energy since the off-chip communication can be reduced with an application-specific on-chip memory configuration. Furthermore, data-level parallelism can efficiently be exploited through socalled loop coarsening, which processes multiple horizontal pixels simultaneously. Moreover, existing...

chapter

A fast and accurate logarithm accelerator for scientific applications

Jing Chen, Xue Liu

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 208

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Many scientific applications rely on evaluation of elementary functions. Nowadays, high-level programming languages provide their own elementary function libraries in software by using lookup table and/or polynomial approximation. However, one downside is slow since lookup tables could keep cache thrashing and polynomial approximations require a number of iterations to converge. Thus, elementary functions...

chapter

RVNet: A fast and high energy efficiency network packet processing system on RISC-V

Yanpeng Wang, Mei Wen, Chunyuan Zhang, Jie Lin

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 107 - 110

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

RISC-V is a new open-source general-purpose instruction set architecture (ISA) developed by the University of California, Berkeley. It allows everyone to design their hardware circuits based on application characteristics and can be used in embedded devices, desktop computer and high-performance servers. In this paper, we use the RISC-V processor to design a fast network packet processing system....

chapter

High performance hardware architectures for Intra Block Copy and Palette Coding for HEVC screen content coding extension

Rishan Senanayake, Namitha Liyanage, Sasindu Wijeratne, Sachille Atapattu, more

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 164 - 169

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Screen content coding (SCC) extension to High Efficiency Video Coding (HEVC) offers substantial compression efficiency over the existing HEVC standard for computer generated content. However, this gain in compression efficiency is achieved at the expense of further computational complexity with several resource hungry coding tools. Hence, extension of SCC to HEVC hardware encoders can be challenging...

chapter

Taming Performance Hotspots in Cloud Storage with Dynamic Load Redistribution

Ridwan Rashid Noel, Palden Lama

2017 IEEE 10th International Conference on Cloud Computing (CLOUD) > 42 - 49

2017 IEEE 10th International Conference on Cloud Computing (CLOUD)

Cloud storage services are associated with high latency variance, and degraded throughput which is problematic when users are fetching and storing content for interactive applications. This can be attributed to performance hotspots created by slow nodes in a storage cluster, and performance interference caused by multi-tenancy, and background tasks such as data scrubbing, backfilling, recovery, etc...

chapter

FPGA systolic array GZIP compressor

Ovidiu Plugariu, Alexandru Dumitru Gegiu, Lucian Petrica

2017 9th International Conference on Electronics, Computers and Artificial Intelligence (ECAI) > 1 - 6

2017 9th International Conference on Electronics, Computers and Artificial Intelligence (ECAI)

In this paper we present a complete, open-source GZIP compressor implementation for FPGA based on a systolic array architecture. GZIP is one of the most utilized compression algorithms. Besides the usual use-case of compression for data storage, distributed computing systems such as Hadoop utilize compression to reduce the amount of data which is transferred between computing nodes in a cluster. However,...

chapter

VLSI design of an ultra-high-speed Polar encoder architecture using 16-parallel radix-2 processing engines for next-generation 5G applications

Xin-Yu Shih, Po-Chun Huang

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW) > 113 - 114

2017 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-TW)

This paper presents an ultra-high-speed/area-efficient Polar encoder design with very high system throughput for emerging next-generation 5G applications. In a demonstrated design example, the proposed hardware architecture is mainly based on 16-parallel radix-2 processing engines. An 8192-point Polar encoder is designed and synthesized with TSMC 40-nm CMOS technology, operating at clock frequency...

Keywords:
THROUGHPUT
HARDWARE

Publication date

Set your own date range

INFONA - science communication portal

Search results

Incremental high throughput network traffic classifier

Invited paper: Resource sharing in feed forward neural networks for energy efficiency

High level synthesis using vivado HLS for optimizations of SHA-3

Packet Classification with Limited Memory Resources

A Scalable Parameterized NoC Emulator Built Upon Xilinx Virtex-7 FPGA

Hardware optimization for belief propagation polar code decoder with early stopping criteria using high-speed parallel-prefix ling adder

Emerging 5G applications over mmWave: Hands-on assessment of WiGig radios

Test software design and implemetation for benchmarking of stateless IPv4/IPv6 translation implementations

Fair Work-Conserving Bandwidth Guarantees in Datacenters Using MPTCP

A novel algorithm and architecture for a high-throughput VLSI implementation of DST using short pseudo-cycle convolutions

Erlang-k-based packet latency prediction model for optimal configuration of software routers

Multi-VNF performance characterization for virtualized network functions

Renovate high performance user-level stacks' innovation utilizing commodity network adaptors

Hardware design and analysis of efficient loop coarsening and border handling for image processing

A fast and accurate logarithm accelerator for scientific applications

RVNet: A fast and high energy efficiency network packet processing system on RISC-V

High performance hardware architectures for Intra Block Copy and Palette Coding for HEVC screen content coding extension

Taming Performance Hotspots in Cloud Storage with Dynamic Load Redistribution

FPGA systolic array GZIP compressor

VLSI design of an ultra-high-speed Polar encoder architecture using 16-parallel radix-2 processing engines for next-generation 5G applications

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options