Search results

Items from 1 to 7 out of 7 results

chapter

Online scalability characterization of data-parallel programs on many cores

Younghyun Cho, Surim Oh, Bernhard Egger

2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) > 191 - 205

2016 International Conference on Parallel Architecture and Compilation Techniques (PACT)

We present an accurate online scalability prediction model for data-parallel programs on NUMA many-core systems. Memory contention is considered to be the major limiting factor of program scalability as data parallelism limits the amount of synchronization or data dependencies between parallel work units. Reflecting the architecture of NUMA systems, contention is modeled at the last-level caches of...

chapter

Preemptive thread block scheduling with online structural runtime prediction for concurrent GPGPU kernels

Sreepathi Pai, R. Govindarajan, Matthew J. Thazhuthaveetil

2014 23rd International Conference on Parallel Architecture and Compilation (PACT) > 483 - 484

2014 23rd International Conference on Parallel Architecture and Compilation (PACT)

Recent NVIDIA Graphics Processing Units (GPUs) can execute multiple kernels concurrently. On these GPUs, the thread block scheduler (TBS) currently uses the FIFO policy to schedule thread blocks of concurrent kernels. We show that the FIFO policy leaves performance to chance, resulting in significant loss of performance and fairness. To improve performance and fairness, we propose use of the preemptive...

chapter

PEMOGEN: Automatic adaptive performance modeling during program runtime

Arnamoy Bhattacharyya, Torsten Hoefler

2014 23rd International Conference on Parallel Architecture and Compilation (PACT) > 393 - 404

2014 23rd International Conference on Parallel Architecture and Compilation (PACT)

Traditional means of gathering performance data are tracing, which is limited by the available storage, and profiling, which has limited accuracy. Performance modeling is often used to interpret the tracing data and generate performance predictions. We aim to complement the traditional data collection mechanisms with online performance modeling, a method that generates performance models while the...

chapter

A Simplified and Accurate Model of Power-Performance Efficiency on Emergent GPU Architectures

Shuaiwen Song, Chunyi Su, Barry Rountree, Kirk W. Cameron

2013 IEEE 27th International Symposium on Parallel and Distributed Processing > 673 - 686

2013 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

Emergent heterogeneous systems must be optimized for both power and performance at exascale. Massive parallelism combined with complex memory hierarchies form a barrier to efficient application and architecture design. These challenges are exacerbated with GPUs as parallelism increases orders of magnitude and power consumption can easily double. Models have been proposed to isolate power and performance...

chapter

A History-Based Performance Prediction Model with Profile Data Classification for Automatic Task Allocation in Heterogeneous Computing Systems

Katsuto Sato, Kazuhiko Komatsu, Hiroyuki Takizawa, Hiroaki Kobayashi

2011 IEEE Ninth International Symposium on Parallel and Distributed Processing with Applications > 135 - 142

2011 IEEE 9th International Symposium on Parallel and Distributed Processing with Applications (ISPA)

In this paper, we propose a runtime performance prediction model for automatic selection of accelerators to execute kernels in OpenCL. The proposed method is a history-based approach that uses profile data for performance prediction. The profile data are classified into some groups, from each of which its own performance model is derived. As the execution time of a kernel depends on some runtime parameters...

chapter

Predicting Parameter Sweep Jobs: From Simulation to Grid Implementation

P. Hellinckx, S. Verboven, F. Arickx, J. Broeckhove

2009 International Conference on Complex, Intelligent and Software Intensive Systems > 402 - 408

2009 International Conference on Complex, Intelligent and Software Intensive Systems (CISIS 2009)

Efficiently using the computational power made available through desktop grids based distributed systems is a complicated and many-sided problem, caused by the intermittent resource availability. In this paper a novel solution is presented for predicting the runtimes of parameter sweep jobs. These jobs are characterized by their lack of inter-dependence and suitability for runtime prediction by modeling...

chapter

Predicting Multiple Metrics for Queries: Better Decisions Enabled by Machine Learning

A. Ganapathi, H. Kuno, U. Dayal, J.L. Wiener, more

2009 IEEE 25th International Conference on Data Engineering > 592 - 603

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

One of the most challenging aspects of managing a very large data warehouse is identifying how queries will behave before they start executing. Yet knowing their performance characteristics - their runtimes and resource usage - can solve two important problems. First, every database vendor struggles with managing unexpectedly long-running queries. When these long-running queries can be identified...

Filter options

Data set:
ieee
Keywords:
KERNEL
RUNTIME
PREDICTIVE MODELS

Publication date

Set your own date range

Keywords

ADAPTATION MODELS (3)
COMPUTATIONAL MODELING (2)
DATA MODELS (2)
GRAPHICS PROCESSING UNITS (2)
INSTRUCTION SETS (2)
SCHEDULING (2)
TRAINING (2)
ACCURACY (1)
AVAILABILITY (1)
COBRA (1)
CORRELATION (1)
DATA COLLECTION (1)
DATA MINING (1)
DATABASE PERFORMANCE PREDICTION (1)
DATABASE QUERIES (1)
DATABASES (1)
DESKTOP GRID (1)
DESKTOP GRID BASED DISTRIBUTED SYSTEM (1)
DISTANCE MEASUREMENT (1)
DISTRIBUTED COMPUTING (1)
GIPSY (1)
GPGPU (1)
GRID COMPUTING (1)
GRID INFORMATION PREDICTION SYSTEM (1)
HETEROGENEOUS (1)
HISTORY-BASED (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
MACHINE LEARNING (1)
MEASUREMENT (1)
MODELLING (1)
OPENCL (1)
OPERATIONAL BUSINESS INTELLIGENCE (1)
OPTIMIZATION (1)
PARAMETER SWEEP JOB (1)
PERFORMANCE METRICS (1)
PERFORMANCE PREDICTION (1)
PGS (1)
PREDICTION BASED GRID SCHEDULING (1)
PROBABILITY DENSITY FUNCTION (1)
QUERY PROCESSING (1)
RADIATION DETECTORS (1)
RESOURCE ALLOCATION (1)
SCALABILITY (1)
SOFTWARE METRICS (1)
THROUGHPUT (1)
VERY LARGE DATA WAREHOUSE (1)
VERY LARGE DATABASES (1)
more

INFONA - science communication portal

Search results

Online scalability characterization of data-parallel programs on many cores

Preemptive thread block scheduling with online structural runtime prediction for concurrent GPGPU kernels

PEMOGEN: Automatic adaptive performance modeling during program runtime

A Simplified and Accurate Model of Power-Performance Efficiency on Emergent GPU Architectures

A History-Based Performance Prediction Model with Profile Data Classification for Automatic Task Allocation in Heterogeneous Computing Systems

Predicting Parameter Sweep Jobs: From Simulation to Grid Implementation

Predicting Multiple Metrics for Queries: Better Decisions Enabled by Machine Learning

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options