Search results

Items from 1 to 4 out of 4 results

chapter

Overlapping Data Transfers with Computation on GPU with Tiles

Burak Bastem, Didem Unat, Weiqun Zhang, Ann Almgren, more

2017 46th International Conference on Parallel Processing (ICPP) > 171 - 180

2017 46th International Conference on Parallel Processing (ICPP)

GPUs are employed to accelerate scientific applications however they require much more programming effort from the programmers particularly because of the disjoint address spaces between the host and the device. OpenACC and OpenMP 4.0 provide directive based programming solutions to alleviate the programming burden however synchronous data movement can create a performance bottleneck in fully taking...

chapter

Directive-Based Pipelining Extension for OpenMP

Xuewen Cui, Thomas R. W. Scogland, Bronis R. de Supinski, Wu-Chun Feng

2016 IEEE International Conference on Cluster Computing (CLUSTER) > 481 - 484

2016 IEEE International Conference on Cluster Computing (CLUSTER)

Programming models like CUDA, OpenMP, OpenACC and OpenCL are designed to offload compute-intensive workloads to accelerators efficiently. However, the naive offload model, which synchronously copies and executes in sequence, requires extensive hand-tuning of techniques, such as pipelining to overlap computation and communication. Therefore, we propose an easy-to-use, directive-based pipelining extension...

chapter

Automatic Parallelization of GPU Applications Using OpenCL

Lizandro D. Solano-Quinde, Brett M. Bode, Arun K. Somani

2015 Asia-Pacific Conference on Computer Aided System Engineering > 276 - 283

2015 Asia-Pacific Conference on Computer Aided System Engineering (APCASE)

Graphics Processing Units (GPUs) have been successfully used to accelerate scientific applications due to their computation power and the availability of programming languages that make more approachable writing scientific applications for GPUs. However, since the programming model of GPUs requires offloading all the data to the GPU memory, the memory footprint of the application is limited to the...

chapter

GPUdmm: A high-performance and memory-oblivious GPU architecture using dynamic memory management

Youngsok Kim, Jaewon Lee, Jae-Eon Jo, Jangwoo Kim

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA) > 546 - 557

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA)

GPU programmers suffer from programmer-managed GPU memory because both performance and programmability heavily depend on GPU memory allocation and CPU-GPU data transfer mechanisms. To improve performance and programmability, programmers should be able to place only the data frequently accessed by GPU on GPU memory while overlapping CPU-GPU data transfers and GPU executions as much as possible. However,...

Filter options

Data set:
ieee
Keywords:
KERNEL
GRAPHICS PROCESSING UNITS
MEMORY MANAGEMENT
DATA TRANSFER

Publication date

Set your own date range

Keywords

ARRAYS (2)
PROGRAMMING (2)
ALGORITHMS (1)
BENCHMARK TESTING (1)
COMPUTATIONAL MODELING (1)
CUDA (1)
DATA MODELS (1)
GPU (1)
GPUS (1)
LIBRARIES (1)
LIBRARY (1)
OPENACC (1)
OPENCL (1)
OVERLAPPING COMMUNICATION WITH COMPUTATION (1)
PERFORMANCE EVALUATION (1)
PROGRAM TRANSFORMATION (1)
PROGRAMMING MODELS (1)
TILES (1)
XML (1)
more

INFONA - science communication portal

Search results

Overlapping Data Transfers with Computation on GPU with Tiles

Directive-Based Pipelining Extension for OpenMP

Automatic Parallelization of GPU Applications Using OpenCL

GPUdmm: A high-performance and memory-oblivious GPU architecture using dynamic memory management

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options