Search results

Items from 1 to 20 out of 188 results

chapter

Parallel automata processor

Arun Subramaniyan, Reetuparna Das

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 600 - 612

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

Finite State Machines (FSM) are widely used computation models for many application domains. These embarrassingly sequential applications with irregular memory access patterns perform poorly on conventional von-Neumann architectures. The Micron Automata Processor (AP) is an in-situ memory-based computational architecture that accelerates non-deterministic finite automata (NFA) processing in hardware...

chapter

Hybrid trie based approach for longest prefix matching in IP packet processing

Surajeet Ghosh, Suraj Kesharwani, Vipul Mishra, Sanchita Saha Ray

TENCON 2017 - 2017 IEEE Region 10 Conference > 1532 - 1537

TENCON 2017 - 2017 IEEE Region 10 Conference

A hybrid trie based approach for longest prefix match (LPM) search scheme is proposed in this paper to handle the current prefix growth in an efficient manner. The proposed scheme is built around two sub-schemes, the first one is the tree bitmap structure and the second one is the trie based data structure. The main idea of this approach is to simplify the required prefix operations, viz., insertion,...

chapter

Toward a programmable FIB caching architecture

Garegin Grigoryan, Yaoqing Liu

2017 IEEE 25th International Conference on Network Protocols (ICNP) > 1 - 2

2017 IEEE 25th International Conference on Network Protocols (ICNP)

The current Internet routing ecosystem is neither sustainable nor economical. More than 711K IPv4 routes and more than 41K IPv6 routes exist in current global Forwarding Information Base (FIBs) with growth rates increasing. This rapid growth has serious consequences, such as creating the need for costly FIB memory upgrades and increased potential for Internet service outages. And while FIB memories...

chapter

Quantifying and mitigating the costs of FPGA virtualization

Sadegh Yazdanshenas, Vaughn Betz

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 7

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

FPGAs are being incorporated into contemporary datacenters in order to improve computational capacity, power consumption, and processing latency. Efficiently integrating FP-GAs in datacenters is, however, quite challenging. Ideally, smaller tasks could share a device and the cloud management layer would be able to partially reconfigure the device to allocate its free resources to incoming tasks. Moreover,...

chapter

HPC on FPGA clouds: 3D FFTs and implications for molecular dynamics

Jiayi Sheng, Chen Yang, Ahmed Sanaullah, Michael Papamichael, more

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

The architecture of the Microsoft Catapult II cloud places the accelerator (FPGA) as a bump-in-the-wire on the way to the network and thus promises a dramatic reduction in latency as layers of hardware and software are avoided. We demonstrate this capability with an implementation of the 3D FFT. Next we examine phased application elasticity, i.e., the use of a reduced set of nodes for some phases...

chapter

The onion routing performance using shadow-plugin-TOR

Hartanto Kusuma Wardana, Liauw Frediczen Handianto, Banu Wirawan Yohanes

2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI) > 1 - 5

2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI)

Anonymous network provides user privacy to protect identity. The onion routing (TOR) project is one kind of Internet anonymous networks which attracts many researchers and clients nowadays, because of its simplicity and scalability. However, there are some difficulties to analyze TOR performance within live TOR networks since it is distributed and its security nature. This paper presents a TOR network...

chapter

Automotive sip LPDDR4 design space exploration for achieving system level SI performance

Wang Yao, Lakshmi Baskaran, Jaemin Shin, Tim Michalka, more

2017 IEEE International Symposium on Electromagnetic Compatibility & Signal/Power Integrity (EMCSI) > 203 - 208

2017 IEEE International Symposium on Electromagnetic Compatibility & Signal/Power Integrity (EMCSI)

A novel multi-chip System-in-Package (SiP) was designed specifically for automotive applications. This paper discussed the challenges and approaches of enabling the dual x32 LPDDR4 channels with external DRAMs running at 1866 MHz. System level design space was explored to achieve better SI performance. Several key design parameters were studied separately to investigate their impacts on the SI performance...

chapter

An IPv6 routing lookup algorithm based on subsection intensive compression and multi-branch tree

Yan Pan, Zhonghe Wei, Jianxiu Zhao, Min Guo

2017 IEEE International Conference on Information and Automation (ICIA) > 1168 - 1172

2017 IEEE International Conference on Information and Automation (ICIA)

The balance of searching time and storage space is a problem in routing lookup. The algorithm has solved it to some extent. It is based on IPv6 prefix distribution and adopts different approaches to divide and compress different prefixes. The prefixes that can be divided exactly are concentrated compression. Other prefixes that can't be divided exactly are handled with multi-branch tree method. According...

chapter

Architecting large-scale SRAM arrays with monolithic 3D integration

Joonho Kong, Young-Ho Gong, Sung Woo Chung

2017 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED) > 1 - 6

2017 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)

In this paper, we architect large-scale SRAM arrays with monolithic 3D (M3D) integration technology. We introduce M3D-based SRAM arrays with three different ways of integration: M3D-R (vertical routing-only), M3D-VBL (vertical bitline), and M3D-VWL (vertical wordline). We also apply M3D-based SRAM arrays to last-level caches: tag arrays for eDRAM LLCs and data arrays for SRAM LLCs. The proposed LLCs...

chapter

Exploring DDR4 Address Bus Design for High Speed Memory Interface

Nanju Na, Juan Wang, Sean Long, Changyi Su, more

2017 IEEE 67th Electronic Components and Technology Conference (ECTC) > 1843 - 1848

2017 IEEE 67th Electronic Components and Technology Conference (ECTC)

This paper discusses multi-point address channel design in fly-by topology for high speed memory interface. Waveform behaviors at DRAM locations along the channel are examined in depth with eye opening data in various channel design factors and device termination settings. Eye opening is exacerbated on the front DRAM from the controller more prominently due to ring-backs from high frequency reflections...

chapter

Implementing FPGA Overlay NoCs Using the Xilinx UltraScale Memory Cascades

Nachiket Kapre

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 40 - 47

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

We can enhance the performance and efficiency of deflection-routed FPGA overlay NoCs by exploiting the cascading featureof the Xilinx UltraScale BlockRAMs. This allows us to (1) hardenthe multiplexers in the NoC switch crossbars, and (2) efficientlyadd buffering support to deflection-routing. While buffering isnot required for correct operation of a deflection routed NoC, it can boost network throughputs...

chapter

Optimization opportunities in RRAM-based FPGA architectures

Xifan Tang, Giovanni De Micheli, Pierre-Emmanuel Gaillardon

2017 IEEE 8th Latin American Symposium on Circuits & Systems (LASCAS) > 1 - 4

2017 IEEE 8th Latin American Symposium on Circuits & Systems (LASCAS)

Static Random Access Memory (SRAM)-based routing multiplexers, whatever structure is employed, share a common limitation: their area, delay and power increase linearly with the input size. This property results in most SRAM-based FPGA architectures typically avoiding the use of large multiplexers. Resistive Random Access Memory (RRAM)-based multiplexers, built with one-level structure, have a unique...

chapter

High density, low energy, magnetic tunnel junction based block RAMs for memory-rich FPGAs

Kosuke Tatsumura, Sadegh Yazdanshenas, Vaughn Betz

2016 International Conference on Field-Programmable Technology (FPT) > 4 - 11

2016 International Conference on Field-Programmable Technology (FPT)

Many important applications demand large amounts of on-chip memory both to fully utilize an FPGA's computational capacity and to minimize energy-consuming off-chip memory accesses, leading some recent commercial FPGAs to add higher-capacity on-chip block RAMs (BRAMs). While memory is becoming more important to FPGA designs, SRAM scaling is becoming more difficult because of increasing device variation...

chapter

Effect of different design stages on the SEU failure rate of FPGA systems

Igor Villalta, Unai Bidarte, Julen Gomez-Cornejo, Jaime Jimenez, more

2016 Conference on Design of Circuits and Integrated Systems (DCIS) > 1 - 6

2016 Conference on Design of Circuits and Integrated Systems (DCIS)

This work analyzes the effect of the different design stages on the failure rate of circuits implemented in FPGAs. A bitstream-based SEU emulation platform is used to inject faults in order to analyze the critical bits of the circuit. Experiments are done on two different testbenchs, an FIR filter and a CORDIC chain. Tests consist on loading different variations of the designs in order to estimate...

chapter

On the Break-Even Point between Cloud-Assisted and Legacy Routing (Short Paper)

Prasun Kanti Dey, Murat Yuksel

2016 5th IEEE International Conference on Cloud Networking (Cloudnet) > 154 - 157

2016 5th IEEE International Conference on Cloud Networking (Cloudnet)

As more than 40K service providers are advertising 600K or more IP prefixes, scalability of routing has emerged to be a matter of great concern. In this paper, to explore a spectrum of designs, we consider a Cloud-Assisted Routing (CAR) framework which follows a hybrid and opportunistic approach by keeping the high priority tasks at the router and use an adaptive router-cloud integration when beneficial...

chapter

High throughput neural network based embedded streaming multicore processors

Raqibul Hasan, Tarek M. Taha, Chris Yakopcic, David J. Mountain

2016 IEEE International Conference on Rebooting Computing (ICRC) > 1 - 8

2016 IEEE International Conference on Rebooting Computing (ICRC)

With power consumption becoming a critical processor design issue, specialized architectures for low power processing are becoming popular. Several studies have shown that neural networks can be used for signal processing and pattern recognition applications. This study examines the design of memristor based multicore neural processors that would be used primarily to process data directly from sensors...

chapter

FPGA testing points optimization method based on important analysis

Guochang Zhou, Xiang Gao, Xiaoling Lai, Qi Zhu, more

2016 Prognostics and System Health Management Conference (PHM-Chengdu) > 1 - 5

2016 Prognostics and System Health Management Conference (PHM-Chengdu)

From the space and time dimension, the FPGA circuit is devised some levels with “computing unit + memory/register” via analyzing the characteristics of the FPGA circuit. Combined with the location importance, the connection degree among the nodes and their own soft error probability, an importance analysis model is proposed. And then the testing points are optimized based on the importance of each...

chapter

Heterogeneous memory assembly exploration using a floorplan and interconnect aware framework

Prakhar Raj Gupta, G.S. Visweswaran, Gaurav Narang, Anuj Grover

2016 29th IEEE International System-on-Chip Conference (SOCC) > 290 - 295

2016 29th IEEE International System-on-Chip Conference (SOCC)

Embedded SRAM based memory sub-systems are an integral part of SoCs and have a large area footprint in modern SoCs today. Huge memory requirements are typically met by using an array of SRAM instances and optimal selection of these memory instances becomes imperative for SoC designers. We propose a framework based on the following approach: pre-sort a list of most suitable SRAM instances; create a...

chapter

Hoplite-DSP: Harnessing the Xilinx DSP48 multiplexers to efficiently support NoCs on FPGAs

Kumar H B Chethan, Nachiket Kapre

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 10

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

We can embed the crossbar functionality of NoC (network-on-chip) routers onto the hard multiplexers of Xilinx DSP48E primitives to support resource efficient mapping of FPGA overlay NoCs. This embedding also permits the use of dedicated hard wiring resources of the DSP cascade links to support vertical NoC channels. This unique mapping allows us to significantly reduce soft logic (LUTs+FFs) utilization...

chapter

Design of efficient NOC router for chip multiprocessor

Kiran, Kamna Solanki

2016 International Conference on Inventive Computation Technologies (ICICT) > 3 > 1 - 4

2016 International Conference on Inventive Computation Technologies (ICICT)

Router architecture plays an important role in a Network-on-chip design for achieving high throughput and low latency. In this paper, output buffer router has been emulated using the concept of Distributed Shared Buffer Router. Main focus of the design was to increase the throughput and lower the latency with minimum area and power overhead.

Keywords:
ROUTING
RANDOM ACCESS MEMORY

Publication date

Set your own date range

Content availability

Available (185)
None (3)

Keywords

FIELD PROGRAMMABLE GATE ARRAYS (65)
COMPUTER ARCHITECTURE (30)
IP NETWORKS (30)
TABLE LOOKUP (29)
MEMORY MANAGEMENT (28)
HARDWARE (25)
SWITCHES (25)
ROUTING PROTOCOLS (23)
BANDWIDTH (22)
CLOCKS (22)
INTERNET (21)
TOPOLOGY (21)
FPGA (20)
LOGIC GATES (17)
SRAM CHIPS (17)
REGISTERS (16)
ALGORITHM DESIGN AND ANALYSIS (15)
THROUGHPUT (15)
INDEXES (14)
PROTOCOLS (14)
NETWORK TOPOLOGY (13)
WIRELESS SENSOR NETWORKS (13)
DELAY (12)
OPTIMIZATION (12)
SYSTEM-ON-A-CHIP (12)
TELECOMMUNICATION NETWORK ROUTING (12)
ARRAYS (11)
COMPUTERS (11)
MULTIPLEXING (11)
POWER DEMAND (11)
TRANSISTORS (11)
WIRES (11)
INTEGRATED CIRCUIT INTERCONNECTIONS (10)
SOFTWARE (10)
CIRCUIT FAULTS (9)
CMOS INTEGRATED CIRCUITS (9)
COMPLEXITY THEORY (9)
FAULT TOLERANCE (9)
NETWORK ROUTING (9)
SYSTEM-ON-CHIP (9)
TESTING (9)
COMPUTER AIDED MANUFACTURING (8)
LAYOUT (8)
MEMORY ARCHITECTURE (8)
RESOURCE MANAGEMENT (8)
AD HOC NETWORKS (7)
INTEGRATED CIRCUIT DESIGN (7)
LOGIC DESIGN (7)
MICROPROCESSORS (7)
PERFORMANCE EVALUATION (7)
PIPELINE PROCESSING (7)
PIPELINES (7)
REDUNDANCY (7)
RELIABILITY (7)
SILICON (7)
TILES (7)
WIRE (7)
WIRELESS COMMUNICATION (7)
BUILT-IN SELF-TEST (6)
COMPUTATIONAL MODELING (6)
CRYPTOGRAPHY (6)
DECODING (6)
EMBEDDED SYSTEMS (6)
IMPEDANCE (6)
MOBILE COMPUTING (6)
NETWORK-ON-CHIP (6)
PORTS (COMPUTERS) (6)
PROGRAMMING (6)
PROTOTYPES (6)
SUBSTRATES (6)
TRANSIENT ANALYSIS (6)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (5)
BENCHMARK TESTING (5)
COMPUTER SCIENCE (5)
DATA MINING (5)
DELAYS (5)
DESIGN AUTOMATION (5)
DIGITAL SIGNAL PROCESSING (5)
ENGINES (5)
FAULT TOLERANT SYSTEMS (5)
IP LOOKUP (5)
LATCHES (5)
MONITORING (5)
NICKEL (5)
NONVOLATILE MEMORY (5)
PACKET FORWARDING (5)
PINS (5)
RADIATION DETECTORS (5)
RECEIVERS (5)
ROBUSTNESS (5)
ROUTING TABLE (5)
RUNTIME (5)
SECURITY (5)
SERVERS (5)
SYNCHRONIZATION (5)
TELECOMMUNICATION NETWORK TOPOLOGY (5)
TRANSPORT PROTOCOLS (5)
AEROSPACE ELECTRONICS (4)
more

INFONA - science communication portal

Search results

Parallel automata processor

Hybrid trie based approach for longest prefix matching in IP packet processing

Toward a programmable FIB caching architecture

Quantifying and mitigating the costs of FPGA virtualization

HPC on FPGA clouds: 3D FFTs and implications for molecular dynamics

The onion routing performance using shadow-plugin-TOR

Automotive sip LPDDR4 design space exploration for achieving system level SI performance

An IPv6 routing lookup algorithm based on subsection intensive compression and multi-branch tree

Architecting large-scale SRAM arrays with monolithic 3D integration

Exploring DDR4 Address Bus Design for High Speed Memory Interface

Implementing FPGA Overlay NoCs Using the Xilinx UltraScale Memory Cascades

Optimization opportunities in RRAM-based FPGA architectures

High density, low energy, magnetic tunnel junction based block RAMs for memory-rich FPGAs

Effect of different design stages on the SEU failure rate of FPGA systems

On the Break-Even Point between Cloud-Assisted and Legacy Routing (Short Paper)

High throughput neural network based embedded streaming multicore processors

FPGA testing points optimization method based on important analysis

Heterogeneous memory assembly exploration using a floorplan and interconnect aware framework

Hoplite-DSP: Harnessing the Xilinx DSP48 multiplexers to efficiently support NoCs on FPGAs

Design of efficient NOC router for chip multiprocessor

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options