2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC)

Items from 1 to 7 out of 7 results

chapter

QuickTM: A Hardware Solution to a High Performance Unbounded Transactional Memory

S Sanyal, S Roy

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 62 - 71

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

Transactional Memory (TM) is an emerging technology which simplifies the concurrency control in a parallel program. In this paper we propose Quick TM, a new hardware transactional memory (HTM) architecture. It incorporates three features to address known bottlenecks in the existing HTM architectures. First, we propose hardware-only dynamic detection of true-shared variables. Our result shows that...

chapter

Effortless and Efficient Distributed Data-Partitioning in Linear Algebra

Carlos de Blas Cartón, Arturo Gonzalez-Escribano, Diego R Llanos

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 89 - 97

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

This paper introduces a new technique to exploit compositions of different data-layout techniques with Hit map, a library for hierarchical-tiling and automatic mapping of arrays. We show how Hit map is used to implement block-cyclic layouts for a parallel LU decomposition algorithm. The paper compares the well-known ScaLAPACK implementation of LU, as well as other carefully optimized MPI versions,...

chapter

Enhancing Muesli's Data Parallel Skeletons for Multi-core Computer Architectures

P Ciechanowicz, H Kuchen

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 108 - 113

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

Algorithmic skeletons encapsulate typical parallel programming patterns such that they can be easily applied by users. Existing skeleton libraries usually work on distributed memory machines. We present an extension of our skeleton library Muesli which now allows to use the same application without modifications on a variety of parallel machines ranging from multi-processor distributed memory to many-core...

chapter

Sparse Matrix Formats Evaluation and Optimization on a GPU

Maxime R Hugues, Serge G Petiton

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 122 - 129

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

The data parallel programming model comes back with massive multicore architectures. The GPU is one of these and offers important possibilities to accelerate linear algebra. However, the irregular structure of sparse matrix operations generates problems with this programming model to obtain efficient performance. This depends on the used format to store values and the matrix structure. The sparse...

chapter

Evaluation of the Task Programming Model in the Parallelization of Wavefront Problems

A J Dios, R Asenjo, A Navarro, F Corbera, more

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 257 - 264

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

This paper analyzes the applicability of the task programming model in the parallelization of generic wave front problems. Computations on this type of problems are characterized by a data dependency pattern across a data space, which can produce a variable number of independent tasks through the traversal of such space. Precisely, we think that it is better to formulate the parallelization of this...

chapter

Parallel Computational Modelling of Inelastic Neutron Scattering in Multi-node and Multi-core Architectures

M T Garba, Horacio González-Vélez, D L Roach

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 509 - 514

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

This paper examines the initial parallel implementation of SCATTER, a computationally intensive inelastic neutron scattering routine with polycrystalline averaging capability, for the General Utility Lattice Program (GULP). Of particular importance to structural investigation on the atomic scale, this work identifies the computational features of SCATTER relevant to a parallel implementation and presents...

chapter

OpenCL: Make Ubiquitous Supercomputing Possible

Slo-Li Chu, Chih-Chieh Hsiao

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 556 - 561

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

Due to the dramatic requirements of 3D games and applications, graphics processing unit (GPU) or general-purpose graphics processing unit (GPGPU) have become required components in the modern computer systems. While these devices enable high parallelism with huge amount of processing elements, the utilization of their capabilities in general scientific applications are still low due to their difficult...

Filter options

Keywords:
PARALLEL PROGRAMMING

Publication date

Set your own date range

Keywords

INSTRUCTION SETS (5)
COMPUTER ARCHITECTURE (4)
INDEXES (3)
LIBRARIES (3)
MESSAGE PASSING (3)
MULTIPROCESSING SYSTEMS (3)
BENCHMARK TESTING (2)
COMPUTATIONAL MODELING (2)
COPROCESSORS (2)
GPU (2)
GRAPHICS PROCESSING UNIT (2)
LINEAR ALGEBRA (2)
OPENMP (2)
SHARED MEMORY SYSTEMS (2)
3D GAMES (1)
ALGORITHMIC SKELETONS (1)
APPLICATION PROGRAM INTERFACES (1)
ARRAYS (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTOMATIC DATA PARTITION (1)
AUTOMATIC DATA-LAYOUTS (1)
AUTOMATIC MAPPING (1)
BLOCK-CYCLIC LAYOUTS (1)
CACHE ASSOCIATIVITY (1)
CACHE-COHERENCE (1)
CODE LENGTH (1)
COMPOSITION TECHNIQUE (1)
COMPUTATIONAL PHYSICS (1)
COMPUTER GRAPHIC EQUIPMENT (1)
CONCURRENCY CONTROL (1)
DATA DEPENDENCY PATTERN (1)
DATA MODELS (1)
DATA PARALLEL PROGRAMMING (1)
DATA PARALLEL PROGRAMMING MODEL (1)
DATA PARALLEL SKELETON (1)
DATA-LAYOUT TECHNIQUES (1)
DISTRIBUTED COMPUTING (1)
DISTRIBUTED DATA-PARTITIONING (1)
DISTRIBUTED DATABASES (1)
DISTRIBUTED MEMORY MACHINES (1)
DISTRIBUTED MEMORY SYSTEMS (1)
DISTRIBUTED SYSTEMS (1)
DYNAMIC PROGRAMMING (1)
DYNAMIC PROGRAMMING ALGORITHMS (1)
DYNAMIC SEPARATION (1)
FINITE ELEMENT METHODS (1)
GENERAL UTILITY LATTICE PROGRAM (1)
GENERAL-PURPOSE GRAPHICS PROCESSING UNIT (1)
GPGPU (1)
GRAIN SIZE (1)
HARDWARE (1)
HARDWARE TRANSACTIONAL MEMORY (1)
HARDWARE TRANSACTIONAL MEMORY ARCHITECTURE (1)
HARDWARE-ONLY DYNAMIC DETECTION (1)
HIERARCHICAL-TILING (1)
HIGH PERFORMANCE UNBOUNDED TRANSACTIONAL MEMORY (1)
HIT MAP IMPLEMENTATION (1)
INELASTIC NEUTRON SCATTERING (1)
ITERATIVE METHOD (1)
ITERATIVE METHODS (1)
KERNEL (1)
L1 DATA CACHE (1)
LATTICE DYNAMICS (1)
LAYOUT (1)
LAYOUTS (1)
LOAD BALANCING (1)
MANY-CORE (1)
MASSIVE MULTICORE (1)
MATERIALS (1)
MATHEMATICAL MODEL (1)
MATRIX DECOMPOSITION (1)
MESSAGE PASSING INTERFACE (1)
MESSAGE SYSTEMS (1)
MICROPROCESSORS (1)
MPI IMPLEMENTATION (1)
MPI VERSIONS (1)
MUESLI LIBRARY (1)
MULTI-CORE PROCESSORS (1)
MULTICORE ARCHITECTURES (1)
MULTICORE COMPUTER ARCHITECTURES (1)
MULTINODE ARCHITECTURES (1)
MULTIPROCESSING (1)
NEUTRON SCATTERING (1)
OPEN STANDARD (1)
OPENCL (1)
OPENMP 3.0 (1)
OVERFLOWN TRANSACTION (1)
PARALLEL ALGORITHMS (1)
PARALLEL ARCHITECTURES (1)
PARALLEL COMPUTATIONAL MODELLING (1)
PARALLEL LU DECOMPOSITION ALGORITHM (1)
PARALLEL MACHINES (1)
PARALLEL PROCESSING (1)
PARALLEL PROGRAM (1)
PARALLEL PROGRAMMING PATTERNS (1)
PARALLEL PROGRAMS (1)
PARALLEL WAVE FRONT CODES (1)
PERFORMANCE ANALYSIS (1)
PHYSICS COMPUTING (1)
more

INFONA - science communication portal

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) $("#expandableTitles").expandable();

QuickTM: A Hardware Solution to a High Performance Unbounded Transactional Memory

Effortless and Efficient Distributed Data-Partitioning in Linear Algebra

Enhancing Muesli's Data Parallel Skeletons for Multi-core Computer Architectures

Sparse Matrix Formats Evaluation and Optimization on a GPU

Evaluation of the Task Programming Model in the Parallelization of Wavefront Problems

Parallel Computational Modelling of Inelastic Neutron Scattering in Multi-node and Multi-core Architectures

OpenCL: Make Ubiquitous Supercomputing Possible

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC)