Search results

Items from 1 to 5 out of 5 results

chapter

Runtime Coordinated Heterogeneous Tasks in Charm++

Michael P. Robson, Ronak Buch, Laxmikant V. Kale

2016 Second International Workshop on Extreme Scale Programming Models and Middlewar (ESPM2) > 40 - 43

2016 Second International Workshop on Extreme Scale Programming Models and Middleware (ESPM2)

Effective utilization of the increasingly heterogeneous hardware in modern supercomputers is a significant challenge. Many applications have seen performance gains by using GPUs, but many implementations leave CPUs sitting idle.In this paper, we describe a runtime managed system for coordinating heterogeneous execution. This system manages data transfers to and from GPU devices and schedules work...

chapter

OpenSHMEM Non-blocking Data Movement Operations with MVAPICH2-X: Early Experiences

Khaled Hamidouche, Jie Zhang, Dhabaleswar K. Panda, Karen Tomko

2016 PGAS Applications Workshop (PAW) > 9 - 16

2016 PGAS Applications Workshop (PAW)

PGAS models with a lightweight synchronization and shared memory abstraction, are seen as a good alternative to the Message Passing model for irregular communication patterns. OpenSHMEM is a library based PGAS model. OpenSHMEM 1.3 introduced Non-Blocking data movement operations to provide better asynchronous progress and overlap. In this paper, we present our experiences in designing Non-Blocking...

chapter

TIDeFlow: The Time Iterated Dependency Flow Execution Model

Daniel Orozco, Elkin Garcia, Robert Pavel, Rishi Khan, more

2011 First Workshop on Data-Flow Execution Models for Extreme Scale Computing > 1 - 9

2011 First Workshop on Data-Flow Execution Models for Extreme Scale Computing (DFM)

The many-core revolution brought forward by recent advances in computer architecture has created immense challenges in the writing of parallel programs for High Performance Computing (HPC). Development of parallel HPC programs remains an art, and a universaldoctrine for synchronization, scheduling and execution in general has not been found for many-core/multi-core architectures. These issues are...

chapter

Task Superscalar: An Out-of-Order Task Pipeline

Y Etsion, F Cabarcas, A Rico, A Ramirez, more

2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture > 89 - 100

2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2010)

We present \emph{Task Super scalar}, an abstraction of instruction-level out-of-order pipeline that operates at the task-level. Like ILP pipelines, which uncover parallelism in a sequential instruction stream, task super scalar uncovers task-level parallelism among tasks generated by a sequential thread. Utilizing intuitive programmer annotations of task inputs and outputs, the task super scalar pipeline...

chapter

Speculative execution on multi-GPU systems

Gregory Diamos, Sudhakar Yalamanchili

2010 IEEE International Symposium on Parallel&Distributed Processing (IPDPS) > 1 - 12

2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

The lag of parallel programming models and languages behind the advance of heterogeneous many-core processors has left a gap between the computational capability of modern systems and the ability of applications to exploit them. Emerging programming models, such as CUDA and OpenCL, force developers to explicitly partition applications into components (kernels) and assign them to accelerators in order...

Filter options

Keywords:
KERNEL
RUNTIME
PROGRAMMING
PARALLEL PROGRAMMING

Publication date

Set your own date range

Keywords

PROGRAM PROCESSORS (2)
ACCELERATOR (1)
ACCELERATOR ARCHITECTURES (1)
APPLICATION PARTITIONING (1)
BENCHMARK TESTING (1)
CMP/MANYCORE (1)
CODELETS (1)
COLOR (1)
COMPONENTS (1)
COMPUTATIONAL CAPABILITY (1)
COMPUTATIONAL MODELING (1)
COMPUTER ARCHITECTURE (1)
COMPUTER SCIENCE (1)
COMPUTERS AND INFORMATION PROCESSING (1)
COPROCESSORS (1)
CUDA (1)
DATA MODELS (1)
DATA STRUCTURES (1)
DATAFLOW (1)
DECODING (1)
DEPENDENCY GRAPH (1)
DISTRIBUTED TASK SUPERSCALAR PIPELINE (1)
DYNAMIC PARALLELIZATION TECHNIQUES (1)
ELECTRONICS PACKAGING (1)
EXECUTION MODEL (1)
GRAPH LANGUAGES (1)
GRAPHICS PROCESSING UNITS (1)
HARDWARE (1)
HARMONY EXECUTION MODEL (1)
HARMONY RUNTIME (1)
HETEROGENEOUS MANY-CORE PROCESSORS (1)
HETEROGENEOUS SYSTEM (1)
HIGH PERFORMANCE COMPUTING (1)
IMAGE COLOR ANALYSIS (1)
INSTRUCTION LEVEL ABSTRACTION (1)
INTERTASK DATA DEPENDENCY (1)
INTUITIVE PROGRAMMER ANNOTATIONS (1)
ISA (1)
ITERATED DATAFLOW (1)
KERNEL LEVEL SPECULATION (1)
MAGNETIC CORES (1)
MICRO-ARCHITECTURE (1)
MULTI-GPU SYSTEMS (1)
MULTIPROCESSING SYSTEMS (1)
NONSPECULATIVE TASK (1)
OPENCL (1)
OUT-OF-ORDER EXECUTION (1)
PARALLEL PROCESSING (1)
PARALLEL PROGRAMMING LANGUAGES (1)
PARALLEL PROGRAMMING MODELS (1)
PERFORMANCE EVALUATION (1)
PIPELINES (1)
PROGRAMMING MODEL (1)
REGISTERS (1)
RESOURCE MANAGEMENT (1)
RUNTIME SYSTEM (1)
SEQUENTIAL INSTRUCTION STREAM (1)
SEQUENTIAL PROGRAMMING MODEL (1)
SEQUENTIAL THREAD (1)
SPECULATIVE EXECUTION (1)
SUPERCOMPUTERS (1)
SYNCHRONIZATION (1)
TASK ANALYSIS (1)
TASK LEVEL PARALLELISM (1)
TASK PIPELINE (1)
TASK SUPERSCALAR (1)
TASKS OUT OF ORDER (1)
TIDEFLOW (1)
more

INFONA - science communication portal

Search results

Runtime Coordinated Heterogeneous Tasks in Charm++

OpenSHMEM Non-blocking Data Movement Operations with MVAPICH2-X: Early Experiences

TIDeFlow: The Time Iterated Dependency Flow Execution Model

Task Superscalar: An Out-of-Order Task Pipeline

Speculative execution on multi-GPU systems

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options