Search results for: Hiroaki Kobayashi

Items from 1 to 4 out of 4 results

chapter

The Importance of Dynamic Load Balancing among OpenMP Thread Teams for Irregular Workloads

Xiong Xiao, Shoichi Hirasawa, Hiroyuki Takizawa, Hiroaki Kobayashi

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 529 - 535

2016 Fourth International Symposium on Computing and Networking (CANDAR)

Recently, massively-parallel many-core processors such as Intel Xeon Phi coprocessors have attracted researchers' attentions because various applications are significantly accelerated with those processors. In the field of high-performance computing, OpenMP is a standard programming model commonly used to parallelize a kernel loop for many-core processors. For hierarchical parallel processing, OpenMP...

chapter

A User-Defined Code Transformation Approach to Overlapping MPI Communication with Computation

Yasuharu Hayashi, Hiroyuki Takizawa, Hiroaki Kobayashi

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 508 - 514

2016 Fourth International Symposium on Computing and Networking (CANDAR)

The Xevolver framework has been developed to enable application programmers to define their own code translation rules outside of their codes so that they can express platform-specific optimizations separately from algorithm-level application codes. Due to the diversity of HPC node architectures, the Xevolver framework has so far mainly been used to separate node-level code optimizations from application...

chapter

A Comparison of Performance Tunabilities between OpenCL and OpenACC

Makoto Sugawara, Shoichi Hirasawa, Kazuhiko Komatsu, Hiroyuki Takizawa, more

2013 IEEE 7th International Symposium on Embedded Multicore Socs > 147 - 152

2013 IEEE 7th International Symposium on Embedded Multicore Socs (MCSoC)

To design and develop any auto tuning mechanisms for OpenACC, it is important to clarify the differences between conventional GPU programming models and OpenACC in terms of available programming and tuning techniques, called performance tunabilities. This paper hence discusses the performance tunabilities of OpenACC and OpenCL. As OpenACC cannot synchronize threads running on GPUs, some important...

chapter

A History-Based Performance Prediction Model with Profile Data Classification for Automatic Task Allocation in Heterogeneous Computing Systems

Katsuto Sato, Kazuhiko Komatsu, Hiroyuki Takizawa, Hiroaki Kobayashi

2011 IEEE Ninth International Symposium on Parallel and Distributed Processing with Applications > 135 - 142

2011 IEEE 9th International Symposium on Parallel and Distributed Processing with Applications (ISPA)

In this paper, we propose a runtime performance prediction model for automatic selection of accelerators to execute kernels in OpenCL. The proposed method is a history-based approach that uses profile data for performance prediction. The profile data are classified into some groups, from each of which its own performance model is derived. As the execution time of a kernel depends on some runtime parameters...

Filter options

Keywords:
INSTRUCTION SETS

Publication date

Set your own date range

Keywords

DYNAMIC SCHEDULING (2)
KERNEL (2)
OPENCL (2)
OPTIMIZATION (2)
PROGRAMMING (2)
ACCURACY (1)
ANNOTATION (1)
AUTOTUNING (1)
BENCHMARK TESTING (1)
CODE TRANSFORMATION (1)
COMPUTATIONAL MODELING (1)
COPROCESSORS (1)
CORRELATION (1)
CUSTOMIZABLE (1)
DATA TRANSFER (1)
DISTRIBUTED-MEMORY PARALLEL (1)
ELECTRONIC MAIL (1)
GPGPU (1)
GRAPHICS PROCESSING UNITS (1)
HETEROGENEOUS (1)
HISTORY-BASED (1)
LOAD MANAGEMENT (1)
MESSAGE SYSTEMS (1)
MPI (1)
OPENACC (1)
OVERLAPPING OF COMMUNICATION AND COMPUTATION (1)
PERFORMANCE PORTABILITY (1)
PERFORMANCE PREDICTION (1)
PREDICTIVE MODELS (1)
RUNTIME (1)
SKIN (1)
SYNCHRONIZATION (1)
USER-DEFINED DIRECTIVE (1)
XEVOLVER (1)
XML (1)
more

INFONA - science communication portal

Search results for: Hiroaki Kobayashi

The Importance of Dynamic Load Balancing among OpenMP Thread Teams for Irregular Workloads

A User-Defined Code Transformation Approach to Overlapping MPI Communication with Computation

A Comparison of Performance Tunabilities between OpenCL and OpenACC

A History-Based Performance Prediction Model with Profile Data Classification for Automatic Task Allocation in Heterogeneous Computing Systems

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options