Search results for: D. Hill

Items from 1 to 4 out of 4 results

article

gem5-gpu: A Heterogeneous CPU-GPU Simulator

Jason Power, Joel Hestness, Marc S. Orr, Mark D. Hill, more

IEEE Computer Architecture Letters > 2015 > 14 > 1 > 34 - 36

gem5-gpu is a new simulator that models tightly integrated CPU-GPU systems. It builds on gem5, a modular full-system CPU simulator, and GPGPU-Sim, a detailed GPGPU simulator. gem5-gpu routes most memory accesses through Ruby, which is a highly configurable memory system in gem5. By doing this, it is able to simulate many system configurations, ranging from a system with coherent caches and a single...

chapter

Border control: Sandboxing accelerators

Lena E. Olson, Jason Power, Mark D. Hill, David A. Wood

2015 48th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) > 470 - 481

2015 48th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

As hardware accelerators proliferate, there is a desire to logically integrate them more tightly with CPUs through interfaces such as shared virtual memory. Although this integration has programmability and performance benefits, it may also have serious security and fault isolation implications, especially when accelerators are designed by third parties. Unchecked, accelerators could make incorrect...

chapter

QuickRelease: A throughput-oriented approach to release consistency on GPUs

Blake A. Hechtman, Shuai Che, Derek R. Hower, Yingying Tian, more

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA) > 189 - 200

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA)

Graphics processing units (GPUs) have specialized throughput-oriented memory systems optimized for stream-ing writes with scratchpad memories to capture locality explicitly. Expanding the utility of GPUs beyond graphics encourages designs that simplify programming (e.g., using caches instead of scratchpads) and better support irregular applications with finer-grain synchronization. Our hypothe-sis...

chapter

Supporting x86-64 address translation for 100s of GPU lanes

Jason Power, Mark D. Hill, David A. Wood

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA) > 568 - 578

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA)

Efficient memory sharing between CPU and GPU threads can greatly expand the effective set of GPGPU workloads. For increased programmability, this memory should be uniformly virtualized, necessitating compatible address translation support for GPU memory references. However, even a modest GPU might need 100s of translations per cycle (6 CUs * 64 lanes/CU) with memory access patterns designed for throughput...

Filter options

Keywords:
GRAPHICS PROCESSING UNITS

Publication date

Set your own date range

Publication type

book (3)
article (1)

INFONA - science communication portal

Search results for: D. Hill

gem5-gpu: A Heterogeneous CPU-GPU Simulator

Border control: Sandboxing accelerators

QuickRelease: A throughput-oriented approach to release consistency on GPUs

Supporting x86-64 address translation for 100s of GPU lanes

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options