Search results

Items from 21 to 40 out of 529 results

chapter

Investigating TI KeyStone II and quad-core ARM Cortex-A53 architectures for on-board space processing

Benjamin Schwaller, Barath Ramesh, Alan D. George

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Future space missions require reliable architectures with higher performance and lower power consumption. Exploring new architectures worthy of undergoing the expensive and time-consuming process of radiation hardening is critical for this endeavor. Two such architectures are the Texas Instruments KeyStone II octal-core processor and the ARM® Cortex®-A53 (ARMv8) quad-core CPU. DSPs have been proven...

chapter

Algorithm and hardware co-optimized solution for large SpMV problems

Fazle Sadi, Larry Fileggi, Franz Franchetti

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

Sparse Matrix-Vector multiplication (SpMV) is a fundamental kernel for many scientific and engineering applications. However, SpMV performance and efficiency are poor on commercial of-the-shelf (COTS) architectures, specially when the data size exceeds on-chip memory or last level cache (LLC). In this work we present an algorithm co-optimized hardware accelerator for large SpMV problems. We start...

chapter

Toward Managing HPC Burst Buffers Effectively: Draining Strategy to Regulate Bursty I/O Behavior

Kun Tang, Ping Huang, Xubin He, Tao Lu, more

2017 IEEE 25th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS) > 87 - 98

2017 IEEE 25th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)

HPC (high-performance computing) applications usually show bursty I/O behaviors. In order to expedite the applications, permanent storage systems are usually provisioned to serve such I/O bursts. Approaching the era of exascale computing, non-volatile RAM is introduced as burst buffers, to absorb the bursty bulk data and relax the I/O provisioning requirement of the permanent storage systems. However,...

chapter

The memory challenge in computing systems: A survey

Norbert Wehn

2017 30th IEEE International System-on-Chip Conference (SOCC) > 1 - 2

2017 30th IEEE International System-on-Chip Conference (SOCC)

It is well known that DRAM memory performance cannot keep pace with the performance of today's multicore compute systems. In addition to the memory bandwidth problem, there is another major challenge, namely, the power/energy challenge. DRAMs are largely contributing to the overall power consumption. Thus, there is a need for power and bandwidth optimization of the DRAM memory subsystems. Moreover,...

chapter

The onion routing performance using shadow-plugin-TOR

Hartanto Kusuma Wardana, Liauw Frediczen Handianto, Banu Wirawan Yohanes

2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI) > 1 - 5

2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI)

Anonymous network provides user privacy to protect identity. The onion routing (TOR) project is one kind of Internet anonymous networks which attracts many researchers and clients nowadays, because of its simplicity and scalability. However, there are some difficulties to analyze TOR performance within live TOR networks since it is distributed and its security nature. This paper presents a TOR network...

chapter

A novel ReRAM-based processing-in-memory architecture for graph computing

Lei Han, Zhaoyan Shen, Zili Shao, H. Howie Huang, more

2017 IEEE 6th Non-Volatile Memory Systems and Applications Symposium (NVMSA) > 1 - 6

2017 IEEE 6th Non-Volatile Memory Systems and Applications Symposium (NVMSA)

Graph algorithms such as breadth-first search (BFS) have been gaining ever-increasing importance in the era of Big Data. However, the memory bandwidth remains the key performance bottleneck for graph processing. To address this problem, we utilize processing-in-memory (PIM), combined with non-volatile metal-oxide resistive random access memory (ReRAM), to improve the performance of both computation...

chapter

A Staged Memory Resource Management Method for CMP systems

Yangguo Liu, Junlin Lu, Dong Tong, Xu Cheng

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 91 - 98

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Memory interference is a critical impediment to system performance in CMP systems. To address this problem, we first propose a Dynamically Proportional Bandwidth Throttling policy (DPBT), which dynamically throttles back memory-intensive applications based on their memory access behavior. DPBT achieves a more balance memory bandwidth partitioning. Moreover, we improve the previous memory channel partitioning...

chapter

Sub-megahertz linewidth single photon source suitable for quantum memories

Markus Rambach, Wing Yung Sarah Lau, Aleksandrina Nikolova, Till Weinhold, more

2017 Conference on Lasers and Electro-Optics Europe & European Quantum Electronics Conference (CLEO/Europe-EQEC) > 1

2017 Conference on Lasers and Electro-Optics Europe & European Quantum Electronics Conference (CLEO/Europe-EQEC)

Hybrid quantum technologies seek to combine the advantages of two individual quantum architectures by transferring the information between the two systems. We want to benefit from the high mobility and ease of transmission of photons for quantum communication and exploit the excellent readout and storage capabilities of atomic qubits as a quantum memory, which is essential to build up quantum repeater...

chapter

SRAM: A State-Aware Risk Assessment Model for Intrusion Response

Fenghua Li, Fangxin Xiong, Chao Li, Lihua Yin, more

2017 IEEE Second International Conference on Data Science in Cyberspace (DSC) > 232 - 237

2017 IEEE Second International Conference on Data Science in Cyberspace (DSC)

Recent advances in Intrusion Risk Assessment (IRA) have brought promising solutions to enhance Intrusion Response Systems (IRS). However, current researches lack reasonable solutions to exploit system state information. Without the system state, the IRA results may suffer from the high false rate of Intrusion Detection Systems (IDS). To address this limitation, we propose a novel State-Aware Risk...

chapter

A fully-integrated energy-efficient H.265/HEVC decoder with eDRAM for wearable devices

Mehul Tikekar, Vivienne Sze, Anantha Chandrakasan

2017 Symposium on VLSI Circuits > C230 - C231

2017 Symposium on VLSI Circuits

Data movement to and from off-chip memory dominates energy consumption in most video decoders, with DRAM accesses consuming 2.8x–6x more energy than the processing itself. We present a H.265/HEVC video decoder with embedded DRAM (eDRAM) as main memory. We propose the following techniques to optimize data movement and reduce the power consumption of eDRAM: 1) lossless compression is used to store reference...

chapter

Augmenting Amdahl's Second Law: A Theoretical Model to Build Cost-Effective Balanced HPC Infrastructure for Data-Driven Science

Arghya Kusum Das, Jaeki Hong, Sayan Goswami, Richard Platania, more

2017 IEEE 10th International Conference on Cloud Computing (CLOUD) > 147 - 154

2017 IEEE 10th International Conference on Cloud Computing (CLOUD)

High-performance analysis of big data demands more computing resources, forcing similar growth in computation cost. So, the challenge to the HPC system designers is providing not only high performance but also high performance at lower cost. For high performance yet cost effective cyberinfrastructure, we propose a new system model augmenting Amdahl's second law for balanced system to optimize price-performance-ratio...

chapter

CacheDOCS: A Dynamic Key-Value Object Caching Service

Julien Gascon-Samson, Michael Coppinger, Fan Jin, Jorg Kienzle, more

2017 IEEE 37th International Conference on Distributed Computing Systems Workshops (ICDCSW) > 383 - 388

2017 IEEE 37th International Conference on Distributed Computing Systems Workshops (ICDCSW)

Caching plays an important role in many domains, as it can lead to important performance improvements. A key-value based caching system typically stores the results of popular queries in efficient storage locations. While caching enjoys widespread usage in the context of dynamic web applications, most mainstream caching systems store static binary items, which makes them impractical for many real-world...

chapter

CubeX: Leveraging glocality of cube-based networks for RAM-based key-value store

Yiming Zhang, Dongsheng Li, Tian Tian, Ping Zhong

IEEE INFOCOM 2017 - IEEE Conference on Computer Communications > 1 - 9

IEEE INFOCOM 2017 - IEEE Conference on Computer Communications

RAM-based storage aggregates the RAM of servers in data center networks (DCN) to provide extremely high storage performance. For quick recovery of storage server failures, Mem-Cube [1] exploits the proximity of the BCube network to limit the recovery traffic to the recovery servers' 1-hop neighborhood. However, previous design is applicable only to BCube, and has suboptimal recovery performance due...

chapter

Energy-efficient scheduling method with cross-loop model for resource-limited CNN accelerator designs

Kaiyi Yang, Shihao Wang, Jianbin Zhou, Takeshi Yoshimura

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

The state-of-the-art customized accelerators of convolution neural networks (CNN) have achieved high throughput while the huge amount of data movements still remains as the dominant part of the total energy costs. In this paper, we propose an energy-efficient scheduling approach to find an efficient dataflow that minimizes data movements with limited hardware resource budgets. In detail, two-level...

chapter

Per-Server Dominant-Share Fairness (PS-DSF): A multi-resource fair allocation mechanism for heterogeneous servers

Jalal Khamse-Ashari, Ioannis Lambadaris, George Kesidis, Bhuvan Urgaonkar, more

2017 IEEE International Conference on Communications (ICC) > 1 - 7

ICC 2017 - 2017 IEEE International Conference on Communications

Users of cloud computing platforms pose different types of demands for multiple resources on servers (physical or virtual machines). Besides differences in their resource capacities, servers may be additionally heterogeneous in their ability to service users — certain users' tasks may only be serviced by a subset of the servers. We identify important shortcomings in existing multi-resource fair allocation...

chapter

Improving CPU Performance Through Dynamic GPU Access Throttling in CPU-GPU Heterogeneous Processors

Siddharth Rai, Mainak Chaudhuri

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 18 - 29

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Heterogeneous chip-multiprocessors with integrated CPU and GPU cores on the same die allow sharing of critical memory system resources among the applications executing on the twotypes of cores. In this paper, we explore memory system management driven by the quality of service (QoS) requirement of the GPU applications executing simultaneously with CPUapplications in such heterogeneous platforms. Our...

chapter

Extending Message Passing Interface Windows to Storage

Sergio Rivas-Gomez, Stefano Markidis, Ivy Bo Peng, Erwin Laure, more

2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) > 727 - 730

2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)

This paper presents an extension to MPI supporting the one-sided communication model and window allocations in storage. Our design transparently integrates with the current MPI implementations, enabling applications to target MPI windows in storage, memory or both simultaneously, without major modifications. Initial performance results demonstrate that the presented MPI window extension could potentially...

chapter

GNSS interference monitoring and characterisation station

J. Rossouw van der Merwe, Daniel Meister, Christian Otto, Manuel Stahl, more

2017 European Navigation Conference (ENC) > 170 - 178

2017 European Navigation Conference (ENC)

An interference monitoring station is presented which detects, characterises and logs interference events. The system operates autonomously and continuously over multiple global navigation satellite system (GNSS) bands. With a bandwidth of up to 80 MHz, input resolution of 8 bit an overall data rate of approximately 1.3 Gbit/s can be supported. The interference detection is carried out in real-time,...

chapter

Benchmarking and Metrics for Emerging Memory

Kirk Prall

2017 IEEE International Memory Workshop (IMW) > 1 - 5

2017 IEEE International Memory Workshop (IMW)

Comparison of the dominant emerging memory technologies at the fundamental cell level will be presented. Metrics will be discussed and the technologies will be compared with the objective of judging the suitability for high density memory applications.

chapter

Capability Models for Manycore Memory Systems: A Case-Study with Xeon Phi KNL

Sabela Ramos, Torsten Hoefler

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 297 - 306

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Increasingly complex memory systems and onchip interconnects are developed to mitigate the data movement bottlenecks in manycore processors. One example of such a complex system is the Xeon Phi KNL CPU with three different types of memory, fifteen memory configuration options, and a complex on-chip mesh network connecting up to 72 cores. Users require a detailed understanding of the performance characteristics...

Keywords:
BANDWIDTH
RANDOM ACCESS MEMORY

Publication date

Set your own date range

Keywords

MEMORY MANAGEMENT (145)
COMPUTER ARCHITECTURE (70)
HARDWARE (62)
SERVERS (51)
DRAM CHIPS (49)
FIELD PROGRAMMABLE GATE ARRAYS (45)
BENCHMARK TESTING (43)
PROGRAM PROCESSORS (43)
SYSTEM-ON-CHIP (43)
CLOCKS (41)
PERFORMANCE EVALUATION (39)
PARALLEL PROCESSING (36)
RESOURCE MANAGEMENT (36)
ARRAYS (35)
THROUGHPUT (35)
NONVOLATILE MEMORY (32)
SYSTEM-ON-A-CHIP (32)
OPTIMIZATION (31)
MEMORY ARCHITECTURE (30)
POWER DEMAND (30)
COMPUTATIONAL MODELING (29)
GRAPHICS PROCESSING UNITS (29)
STREAMING MEDIA (29)
VIDEO CODING (29)
INSTRUCTION SETS (28)
PROTOCOLS (28)
DECODING (26)
SOFTWARE (26)
DELAY (25)
ENCODING (25)
KERNEL (25)
REGISTERS (25)
ALGORITHM DESIGN AND ANALYSIS (24)
COMPUTERS (24)
FPGA (23)
THREE-DIMENSIONAL DISPLAYS (23)
TIMING (23)
MICROPROCESSORS (22)
MULTICORE PROCESSING (22)
ORGANIZATIONS (22)
ROUTING (22)
THROUGH-SILICON VIAS (22)
DRAM (21)
INTERFERENCE (20)
LOGIC GATES (20)
SWITCHES (20)
COMPLEXITY THEORY (19)
MOBILE COMMUNICATION (19)
PREFETCHING (19)
CLOUD COMPUTING (18)
DATA MINING (18)
BUFFER STORAGE (17)
MULTIPROCESSING SYSTEMS (17)
PIPELINES (17)
QUALITY OF SERVICE (17)
MATHEMATICAL MODEL (16)
MEMORY (16)
THREE DIMENSIONAL DISPLAYS (16)
INTEGRATED CIRCUIT INTERCONNECTIONS (15)
PROCESSOR SCHEDULING (15)
RELIABILITY (15)
SIGNAL PROCESSING (15)
SILICON (15)
TESTING (15)
VIRTUAL MACHINING (15)
DATA MODELS (14)
DELAYS (14)
LINUX (14)
MOTION ESTIMATION (14)
STORAGE MANAGEMENT (14)
TOPOLOGY (14)
CACHE STORAGE (13)
DEGRADATION (13)
EMBEDDED SYSTEMS (13)
PROCESS CONTROL (13)
SIMULATION (13)
STANDARDS (13)
SYNCHRONIZATION (13)
CMOS INTEGRATED CIRCUITS (12)
CONFERENCES (12)
ENGINES (12)
HEURISTIC ALGORITHMS (12)
INTERNET (12)
IP NETWORKS (12)
MEMORY BANDWIDTH (12)
SCHEDULING (12)
STACKING (12)
DATA TRANSFER (11)
IMAGE PROCESSING (11)
INDEXES (11)
INTEGRATED CIRCUIT DESIGN (11)
MICROPROCESSOR CHIPS (11)
MULTIMEDIA COMMUNICATION (11)
NETWORK-ON-CHIP (11)
PROPOSALS (11)
RADIATION DETECTORS (11)
SCALABILITY (11)
SIGNAL PROCESSING ALGORITHMS (11)
more

INFONA - science communication portal

Search results

Investigating TI KeyStone II and quad-core ARM Cortex-A53 architectures for on-board space processing

Algorithm and hardware co-optimized solution for large SpMV problems

Toward Managing HPC Burst Buffers Effectively: Draining Strategy to Regulate Bursty I/O Behavior

The memory challenge in computing systems: A survey

The onion routing performance using shadow-plugin-TOR

A novel ReRAM-based processing-in-memory architecture for graph computing

A Staged Memory Resource Management Method for CMP systems

Sub-megahertz linewidth single photon source suitable for quantum memories

SRAM: A State-Aware Risk Assessment Model for Intrusion Response

A fully-integrated energy-efficient H.265/HEVC decoder with eDRAM for wearable devices

Augmenting Amdahl's Second Law: A Theoretical Model to Build Cost-Effective Balanced HPC Infrastructure for Data-Driven Science

CacheDOCS: A Dynamic Key-Value Object Caching Service

CubeX: Leveraging glocality of cube-based networks for RAM-based key-value store

Energy-efficient scheduling method with cross-loop model for resource-limited CNN accelerator designs

Per-Server Dominant-Share Fairness (PS-DSF): A multi-resource fair allocation mechanism for heterogeneous servers

Improving CPU Performance Through Dynamic GPU Access Throttling in CPU-GPU Heterogeneous Processors

Extending Message Passing Interface Windows to Storage

GNSS interference monitoring and characterisation station

Benchmarking and Metrics for Emerging Memory

Capability Models for Manycore Memory Systems: A Case-Study with Xeon Phi KNL

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options