Search results

Items from 1 to 5 out of 5 results

chapter

Employing Compression Solutions under OpenACC

Ebad Salehi, Ahmad Lashgar, Amirali Baniasadi

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 348 - 356

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

For GPUs to achieve their peak performance, effective and efficient usage of memory bandwidth is necessary. To this end, programmers invest extensive development effort to optimize a GPU program, specially its memory bandwidth usage. The OpenACC programming model has been introduced to tackle the accelerators programming complexity. However, this model's coarse-grained control on a program can make...

chapter

GMH: A Message Passing Toolkit for GPU Clusters

Jie Chen, William Watson, Weizhen Mao

2010 IEEE 16th International Conference on Parallel and Distributed Systems > 35 - 42

2010 IEEE 16th International Conference on Parallel and Distributed Systems (ICPADS 2010)

Driven by the market demand for high-definition 3D graphics, commodity graphics processing units (GPUs) have evolved into highly parallel, multi-threaded, many-core processors, which are ideal for data parallel computing. Many applications have been ported to run on a single GPU with tremendous speedups using general C-style programming languages such as CUDA. However, large applications require multiple...

chapter

Formal Description and Optimization Based High - Performance Computing on CUDA

Bo Li, Huacheng Zhao, Jingjing Liang

2009 First International Conference on Information Science and Engineering > 219 - 224

2009 1st International Conference on Information Science and Engineering (ICISE 2009)

In recent years, with the development of GPU, based on the general purpose computation on graphics processors has became a new field. Aiming at the processing of GPU, this paper provides the formal description for data parallel mode, a detailed description of the CUDA programming mode land the principle of optimization. It shows by the comparative experiment that CUDA owns strongly of the ability...

chapter

Message passing for GPGPU clusters: CudaMPI

O.S. Lawlor

2009 IEEE International Conference on Cluster Computing and Workshops > 1 - 8

2009 IEEE International Conference on Cluster Computing and Workshops (CLUSTER)

We present and analyze two new communication libraries, cudaMPI and glMPI, that provide an MPI-like message passing interface to communicate data stored on the graphics cards of a distributed-memory parallel computer. These libraries can help applications that perform general purpose computations on these networked GPU clusters. We explore how to efficiently support both point-to-point and collective...

chapter

Software Pipelined Execution of Stream Programs on GPUs

A. Udupa, R. Govindarajan, M.J. Thazhuthaveetil

2009 International Symposium on Code Generation and Optimization > 200 - 209

2009 7th Annual IEEE/ACM International Symposium on Code Generation and Optimization (CGO 2009)

The StreamIt programming model has been proposed to exploit parallelism in streaming applications on general purpose multi-core architectures. This model allows programmers to specify the structure of a program as a set of filters that act upon data, and a set of communication channels between them. The StreamIt graphs describe task, data and pipeline parallelism which can be exploited on modern graphics...

Filter options

Keywords:
KERNEL
PARALLEL PROCESSING
PROGRAMMING
BANDWIDTH

Publication date

Set your own date range

Keywords

COMPUTER GRAPHICS (2)
COPROCESSORS (2)
CUDA (2)
GPU (2)
GRAPHICS PROCESSING UNIT (2)
GRAPHICS PROCESSING UNITS (2)
MESSAGE PASSING (2)
3D GRAPHICS (1)
ACCELERATORS (1)
C-STYLE PROGRAMMING LANGUAGES (1)
CLUSTER (1)
COLLECTIVE COMMUNICATION (1)
COMMODITY GRAPHICS PROCESSING UNITS (1)
COMMUNICATION LIBRARIES (1)
COMPRESSION (1)
COMPUTATIONAL MODELING (1)
COMPUTE UNIFIED DEVICE ARCHITECTURE (1)
COMPUTER GRAPHIC EQUIPMENT (1)
CUDA PROGRAMMING MODE (1)
CUDAMPI (1)
DATA PARALLEL COMPUTING (1)
DATA TRANSFER (1)
DATA-PARALLEL THREAD GROUP (1)
DISTRIBUTED-MEMORY PARALLEL COMPUTER (1)
FORMAL DESCRIPTION (1)
FORMAL SPECIFICATION (1)
GENERAL PURPOSE COMPUTATION (1)
GLMPI (1)
GPGPU CLUSTERS (1)
GPU CLUSTERS (1)
GPU MESSAGE HANDLER (1)
GPU PROGRAMMING (1)
GRAPHICS (1)
GRAPHICS CARDS (1)
HARDWARE (1)
HIGH MEMORY BANDWIDTH (1)
INSTRUCTION SETS (1)
INTEGER LINEAR PROGRAM (1)
LINEAR PROGRAMMING (1)
MANY-CORE PROCESSOR (1)
MESSAGE PASSING INTERFACE (1)
MESSAGE PASSING TOOLKIT (1)
MESSAGE SYSTEMS (1)
MPI (1)
MPI RANK (1)
MPI STYLE POINT-TO-POINT COMMUNICATION (1)
MULTI-CORE ARCHITECTURES (1)
MULTI-THREADING (1)
MULTITHREADED PROCESSOR (1)
NVIDIA GPU (1)
OPENACC (1)
OPTIMIZATION BASED HIGH PERFORMANCE COMPUTING (1)
PARALLEL PROCESSOR (1)
PARALLEL PROGRAMMING (1)
PIPELINE PROCESSING (1)
PIXEL (1)
PROGRAM PROCESSORS (1)
RANDOM ACCESS MEMORY (1)
SCHEDULES (1)
SOFTWARE (1)
SOFTWARE DESIGN (1)
SOFTWARE PIPELINED EXECUTION (1)
SOFTWARE PIPELINING (1)
STANDARDS (1)
STREAM PROGRAMMING (1)
STREAM PROGRAMS (1)
STREAMIT GRAPHS (1)
STREAMIT PROGRAMMING MODEL (1)
VIRTUAL GPU (1)
YARN (1)
more

INFONA - science communication portal

Search results

Employing Compression Solutions under OpenACC

GMH: A Message Passing Toolkit for GPU Clusters

Formal Description and Optimization Based High - Performance Computing on CUDA

Message passing for GPGPU clusters: CudaMPI

Software Pipelined Execution of Stream Programs on GPUs

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options