Search results

Items from 141 to 160 out of 529 results

1 ...
5
6
7
8
9
10
11

chapter

RPFF: A Remote Page-Fault Filter for Post-copy Live Migration

Kui Su, Wenzhi Chen, Guoxi Li, Zonghui Wang

2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity) > 938 - 943

2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity)

Live migration of virtual machine has attracted significant attention in recent years. It facilitates system online maintenance, load balancing, fault tolerance and power management. Existing pre-copy live migration approach has to iteratively copy redundant memory pages, which causes high network overhead and slow migration. Another post-copy live migration approach can provide quick migration with...

chapter

Characterizing Large Dataset GPU Compute Workloads Targeting Systems with Die-Stacked Memory

Srividya Ramanathan, Gautam Hazari, Kanishka Lahiri, Francesco Spadini

2015 IEEE 22nd International Conference on High Performance Computing (HiPC) > 204 - 213

2015 IEEE 22nd International Conference on High Performance Computing (HiPC)

The increasing adoption of GPUs as mainstream computing devices, coupled with the imminent availability of large high-bandwidth caches based on die-stacked memory makes it important to analyze and understand modern GPU compute applications from the perspective of their memory access and data reuse characteristics. This paper presents detailed workload characterization studies on four GPU compute applications...

chapter

Efficient data management on 3D stacked memory for big data applications

Cheng Qian, Libo Huang, Peng Xie, Nong Xiao, more

2015 10th International Design & Test Symposium (IDT) > 84 - 89

2015 10th International Design & Test Symposium (IDT)

Big data processing has been an increasingly important field which has attracted a lot of attention from academia and industry. However, it worsens the memory wall problem for processor design, which means a large performance gap between processor computation and memory access. The 3D stacked memory structure has been put forward as a promising method to relieve this problem. As non-volatile memory(NVM)...

chapter

Investigations into techniques to accelerate memory intensive GPGPU applications

Winnie Thomas, Rohin D. Daruwala

2015 Annual IEEE India Conference (INDICON) > 1 - 6

2015 Annual IEEE India Conference (INDICON)

Recent advancements in the architecture of Graphic Processing Unit (GPU), enables the acceleration of many general purpose applications. Even with high memory bandwidth, GPUs are still faced with the challenge of accelerating highly memory intensive applications. To overcome this challenge this paper investigates the impact of scaling up of the memory partitions and also scaling of frequency of the...

chapter

FAME: A Fast and Accurate Memory Emulator for New Memory System Architecture Exploration

Krishna T. Malladi, Mu-Tien Chang, John Ping, Hongzhong Zheng

2015 IEEE 23rd International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems > 43 - 46

2015 IEEE 23rd International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS)

Memory systems are critical to system responsivenessand operating costs. New memory technologies like PCM, STT-MRAM, RRAM are poised to provide an intermediatememory layer between DRAM and flash to better serve the needs of capacity, latency hungry datacenter applications. To drive their efficient deployment, it is imperative to make complex architectural decisions and justify the need to rethink...

chapter

Multi-resource schedulable unit for adaptive application-driven unified resource management in data centers

David M. Gutierrez-Estevez, Min Luo

2015 International Telecommunication Networks and Applications Conference (ITNAC) > 261 - 268

2015 International Telecommunication Networks and Applications Conference (ITNAC)

Applications in modern data centers have a wide variety of resource requirements along the four main dimensions of computing, memory, storage, and networking. Data centers must manage these resources separately for each dimension, resulting in highly inefficient allocation of precious resources or even disastrous schemes that contribute to low utilization or over-provisioning of resources. However,...

chapter

A comparative analysis of resource requirements for parallel applications in GPGPU

Winnie Thomas, Rohin D. Daruwala

TENCON 2015 - 2015 IEEE Region 10 Conference > 1 - 6

TENCON 2015 - 2015 IEEE Region 10 Conference

The Single Instruction Multiple Thread (SIMT) architecture based Graphic Processing Units (GPUs) are emerging as more efficient platforms than Multiple Instruction Multiple Data (MIMD) architectures in exploiting parallelism. A GPU has numerous shader cores and thousands of simultaneous fine-grained active threads. These threads are grouped into Cooperative Thread Arrays (CTAs). All the threads within...

chapter

CAMs as synchronizing caches for multithreaded irregular applications on FPGAs

Skyler Windh, Prerna Budhkar, Walid A. Najjar

2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) > 331 - 336

2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Irregular applications, by their very nature, suffer from poor data locality. This often results in high miss rates for caches, and many long waits to off-chip memory. Historically, long latencies have been dealt with in two ways: (1) latency mitigation using large cache hierarchies, or (2) latency masking where threads relinquish their control after issuing a memory request. Multithreaded CPUs are...

chapter

Efficient architecture for large-scale video on demand storage server

Ola A. Al-wesabi, Putra Sumari, Mohammed Ahmed Al-wesabi

2015 IEEE International Conference on Control System, Computing and Engineering (ICCSCE) > 383 - 388

2015 IEEE International Conference on Control System, Computing and Engineering (ICCSCE)

Video-on-demand (VOD) system allows users to access media any time without the need to leave their home. Hard disk drive (HDD) has become popular for VOD storage to store and deal with a large amount of data. In practical cases, disk storage throughput is limited by a slow HDD and disk operating degrades drastically when the server needs to serve many simultaneous video streams. On the other hand,...

chapter

VM placement of multidimensional resources using cartesian coordinates based approach

Naisargi Patel, Govind Patel

2015 5th Nirma University International Conference on Engineering (NUiCONE) > 1 - 5

2015 5th Nirma University International Conference on Engineering (NUiCONE)

Data centers offer many services hosted on dedicated physical servers, which are often under-utilized in terms of resources used. Virtual machine placement goal maximizing the usage of available resources and saving of power being shut down some unused physical machines. After studying different Virtual Machine placement techniques in the data center, there is wastage of resources in Multi-dimensionality...

chapter

A Lightweight Timing Channel Protection for Shared Memory Controllers

Guopei Liu, Ying Wang, Sen Li, Huawei Li, more

2015 IEEE 24th Asian Test Symposium (ATS) > 55 - 60

2015 IEEE 24th Asian Test Symposium (ATS)

With the growth of cloud computing, security and privacy is becoming more and more important. Timing channel attack is one of the most remarkable security threads for memory controllers due to competition for shared resources. However, the existing protection strategies that ensure the deterministic of memory accesses by dividing bandwidth introduce great latency and performance degradation. This...

chapter

Early experience with optimizing I/O performance using high-performance SSDs for in-memory cluster computing

I. Stephen Choi, Weiqing Yang, Yang-Suk Kee

2015 IEEE International Conference on Big Data (Big Data) > 1073 - 1083

2015 IEEE International Conference on Big Data (Big Data)

This paper describes our experience with storage optimization that utilizes cost-effective PCIe solid-state drives (SSDs) to improve the overall performance of a Spark framework. A key problem we address is the limited memory system performance. In particular, we adopt high-performance SSDs to alleviate the saturated DRAM bandwidth and its limited capacity. We utilize SSDs to store shuffle data and...

chapter

Saving memory movements through vector processing in the DRAM

Marco A. Z. Alves, Paulo C. Santos, Francis B. Moreira, Matthias Diener, more

2015 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES) > 117 - 126

2015 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES)

Despite the ability of modern processors to execute a variety of algorithms efficiently through instructions based on registers with ever-increasing widths, some applications present poor performance due to the limited interconnection bandwidth between main memory and processing units. Near-data processing has started to gain acceptance as an accelerator device due to the technology constraints and...

chapter

DLB: Dynamic lane borrowing for improving bandwidth and performance in Hybrid Memory Cube

Xianwei Zhang, Youtao Zhang, Jun Yang

2015 33rd IEEE International Conference on Computer Design (ICCD) > 125 - 132

2015 33rd IEEE International Conference on Computer Design (ICCD)

The Hybrid Memory Cube (HMC) is an innovative DRAM architecture that adopts 3D-stacking to improve bandwidth and save energy. An HMC module adopts separate receive and transmit lanes and thus may achieve the maximal memory bandwidth only if data can be driven at full speed in both directions. However, due to the natural read and write imbalance in modern applications, the effective memory bandwidth...

chapter

The FTK to Level-2 Interface Card (FLIC) for the ATLAS experiment

R. Wang, J. Anderson, B. Auerbach, R. Blair, more

2015 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC) > 1 - 4

2015 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC)

The Fast TracKer (FTK) to Level-2 Interface Card (FLIC) of the ATLAS FTK trigger upgrade is the final component in the FTK chain of custom electronics to connect the system to the High-Level trigger (HLT). The FTK performs full event tracking using the ATLAS Silicon detectors for every Level-1(L1) accepted event at 100 kHz. The FLIC is a custom Advanced Telecommunications Architecture (ATCA) card...

chapter

Performance Evaluation of Hypervisors for HPC Applications

David Beserra, Felipe Oliveira, Jean Araujo, Felipe Fernandes, more

2015 IEEE International Conference on Systems, Man, and Cybernetics > 846 - 851

2015 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

High Performance Computing (HPC) aggregates computing power in order to solve large and complex problems in different knowledge areas. Nowadays, HPC users can utilize virtualized infrastructures as a low-cost alternative to deploy their applications. However, virtualization brings some challenges for HPC, specially in regard to overhead caused by hyper visors. In this work, our main goal is to analyze...

chapter

Design space exploration of latency and bandwidth in RRAM-based solid state drives

Lorenzo Zuolo, Cristian Zambelli, Rino Micheloni, Stephen Bates, more

2015 15th Non-Volatile Memory Technology Symposium (NVMTS) > 1 - 4

2015 15th Non-Volatile Memory Technology Symposium (NVMTS)

The continuous request for higher storage density in Solid State Drives (SSD) is pushing the NAND-Flash technology to their reliability and performance limits. Among many memories technology candidates to replace them the Resistive RAM (RRAM) concept seems to emerge. However, before designing an entire SSD based on RRAM memory devices it must be performed a design space exploration of the disk features...

chapter

i-MIRROR: A Software Managed Die-Stacked DRAM-Based Memory Subsystem

Jee Ho Ryoo, Karthik Ganesan, Yao-Min Chen, Lizy Kurian John

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) > 82 - 89

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

This paper presents an operating system managed die-stacked DRAM called i-MIRROR that mirrors high locality pages from off-chip DRAM. Optimizing the problems of reducing cache tag area, reducing transfer bandwidth and improving hit latency altogether while using die-stacked DRAM as hardware cache is extremely challenging. In this paper, we show that performance and energy efficiency can be obtained...

chapter

Tidy Cache: Improving Data Placement in Die-Stacked DRAM Caches

Adria Armejach, Adrian Cristal, Osman S. Unsal

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) > 65 - 73

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Die-stacked DRAM caches are likely to become available in mainstream chips in the near future. DRAM caches are typically used as a last level shared cache behind the traditional hierarchy of on-chip SRAM caches. However, its internal organization differs from traditional caches as it is based on DRAM technology that provides significantly diverse access latencies depending on the state of its internal...

chapter

A Software-Managed Approach to Die-Stacked DRAM

Mark Oskin, Gabriel H. Loh

2015 International Conference on Parallel Architecture and Compilation (PACT) > 188 - 200

2015 International Conference on Parallel Architecture and Compilation (PACT)

Advances in die-stacking (3D) technology have enabled the tight integration of significant quantities of DRAM with high-performance computation logic. How to integrate this technology into the overall architecture of a computing system is an open question. While much recent effort has focused on hardware-based techniques for using die-stacked memory (e.g., caching), in this paper we explore what it...

1 ...
5
6
7
8
9
10
11

Keywords:
BANDWIDTH
RANDOM ACCESS MEMORY

Publication date

Set your own date range

Keywords

MEMORY MANAGEMENT (145)
COMPUTER ARCHITECTURE (70)
HARDWARE (62)
SERVERS (51)
DRAM CHIPS (49)
FIELD PROGRAMMABLE GATE ARRAYS (45)
BENCHMARK TESTING (43)
PROGRAM PROCESSORS (43)
SYSTEM-ON-CHIP (43)
CLOCKS (41)
PERFORMANCE EVALUATION (39)
PARALLEL PROCESSING (36)
RESOURCE MANAGEMENT (36)
ARRAYS (35)
THROUGHPUT (35)
NONVOLATILE MEMORY (32)
SYSTEM-ON-A-CHIP (32)
OPTIMIZATION (31)
MEMORY ARCHITECTURE (30)
POWER DEMAND (30)
COMPUTATIONAL MODELING (29)
GRAPHICS PROCESSING UNITS (29)
STREAMING MEDIA (29)
VIDEO CODING (29)
INSTRUCTION SETS (28)
PROTOCOLS (28)
DECODING (26)
SOFTWARE (26)
DELAY (25)
ENCODING (25)
KERNEL (25)
REGISTERS (25)
ALGORITHM DESIGN AND ANALYSIS (24)
COMPUTERS (24)
FPGA (23)
THREE-DIMENSIONAL DISPLAYS (23)
TIMING (23)
MICROPROCESSORS (22)
MULTICORE PROCESSING (22)
ORGANIZATIONS (22)
ROUTING (22)
THROUGH-SILICON VIAS (22)
DRAM (21)
INTERFERENCE (20)
LOGIC GATES (20)
SWITCHES (20)
COMPLEXITY THEORY (19)
MOBILE COMMUNICATION (19)
PREFETCHING (19)
CLOUD COMPUTING (18)
DATA MINING (18)
BUFFER STORAGE (17)
MULTIPROCESSING SYSTEMS (17)
PIPELINES (17)
QUALITY OF SERVICE (17)
MATHEMATICAL MODEL (16)
MEMORY (16)
THREE DIMENSIONAL DISPLAYS (16)
INTEGRATED CIRCUIT INTERCONNECTIONS (15)
PROCESSOR SCHEDULING (15)
RELIABILITY (15)
SIGNAL PROCESSING (15)
SILICON (15)
TESTING (15)
VIRTUAL MACHINING (15)
DATA MODELS (14)
DELAYS (14)
LINUX (14)
MOTION ESTIMATION (14)
STORAGE MANAGEMENT (14)
TOPOLOGY (14)
CACHE STORAGE (13)
DEGRADATION (13)
EMBEDDED SYSTEMS (13)
PROCESS CONTROL (13)
SIMULATION (13)
STANDARDS (13)
SYNCHRONIZATION (13)
CMOS INTEGRATED CIRCUITS (12)
CONFERENCES (12)
ENGINES (12)
HEURISTIC ALGORITHMS (12)
INTERNET (12)
IP NETWORKS (12)
MEMORY BANDWIDTH (12)
SCHEDULING (12)
STACKING (12)
DATA TRANSFER (11)
IMAGE PROCESSING (11)
INDEXES (11)
INTEGRATED CIRCUIT DESIGN (11)
MICROPROCESSOR CHIPS (11)
MULTIMEDIA COMMUNICATION (11)
NETWORK-ON-CHIP (11)
PROPOSALS (11)
RADIATION DETECTORS (11)
SCALABILITY (11)
SIGNAL PROCESSING ALGORITHMS (11)
more

INFONA - science communication portal

Search results

RPFF: A Remote Page-Fault Filter for Post-copy Live Migration

Characterizing Large Dataset GPU Compute Workloads Targeting Systems with Die-Stacked Memory

Efficient data management on 3D stacked memory for big data applications

Investigations into techniques to accelerate memory intensive GPGPU applications

FAME: A Fast and Accurate Memory Emulator for New Memory System Architecture Exploration

Multi-resource schedulable unit for adaptive application-driven unified resource management in data centers

A comparative analysis of resource requirements for parallel applications in GPGPU

CAMs as synchronizing caches for multithreaded irregular applications on FPGAs

Efficient architecture for large-scale video on demand storage server

VM placement of multidimensional resources using cartesian coordinates based approach

A Lightweight Timing Channel Protection for Shared Memory Controllers

Early experience with optimizing I/O performance using high-performance SSDs for in-memory cluster computing

Saving memory movements through vector processing in the DRAM

DLB: Dynamic lane borrowing for improving bandwidth and performance in Hybrid Memory Cube

The FTK to Level-2 Interface Card (FLIC) for the ATLAS experiment

Performance Evaluation of Hypervisors for HPC Applications

Design space exploration of latency and bandwidth in RRAM-based solid state drives

i-MIRROR: A Software Managed Die-Stacked DRAM-Based Memory Subsystem

Tidy Cache: Improving Data Placement in Die-Stacked DRAM Caches

A Software-Managed Approach to Die-Stacked DRAM

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options