Search results

Items from 1 to 13 out of 13 results

chapter

A Compiler-Based Tool for Array Analysis in HPC Applications

Ahmad Qawasmeh, Barbara Chapman, Amrita Banerjee

2012 41st International Conference on Parallel Processing Workshops > 454 - 463

2012 41st International Conference on Parallel Processing Workshops (ICPPW)

Array region analysis plays a significant role in various optimizations at compile time. Displaying array access information efficiently in HPC applications has been a vital challenge for scientists and developers for the past few years. Dragon array region analysis tool is a powerful and interactive tool that was built on top of the Open UH compiler, an open source C/C++/Fortran compiler, that supports...

chapter

A Polyhedral Modeling Based Source-to-Source Code Optimization Framework for GPGPU

Chenxi Wang, Kang Kang, Maohua Zhu, Yangdong Deng

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 1964 - 1970

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

In this paper, we propose a source-to-source code optimization framework for general purpose computing on graphics processing units (GPGPU). Our framework is based on a re-formulation of the polyhedral loop transformation theory under the context of GPGPU. We prove that the number of actual memory transactions can be used as a performance metric to guide the code optimization process. In addition,...

chapter

Implementation of XcalableMP Device Acceleration Extention with OpenCL

Takuma Nomizu, Daisuke Takahashi, Jinpil Lee, Taisuke Boku, more

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 2394 - 2403

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Due to their outstanding computational performance, many acceleration devices, such as GPUs, the Cell Broadband Engine (Cell/B.E.), and multi-core computing are attracting a lot of attention in the field of high-performance computing. Although there are many programming models and languages de-signed for programming accelerators, such as CUDA, AMD Accelerated Parallel Processing (AMD APP), and OpenCL,...

chapter

Image Authentication Algorithm on GPU

P.L.V. Vihari, Manoj Mishra

2012 International Conference on Communication Systems and Network Technologies > 874 - 878

2012 International Conference on Communication Systems and Network Technologies (CSNT)

As the demand for research on Image/ Content authentication has significantly increased, many authentication schemes have been proposed so far. But most of them are time consuming. This research concentrates on decreasing the time needed by an Image authentication algorithm. In this paper, we have shown a CUDA-based implementation of content authentication algorithm with NVIDIA's GeForce 8400 GS GPU...

chapter

Implementation of graph algorithms over GPU: A comparative analysis

Swarish Dashora, Nilay Khare

2012 IEEE Students' Conference on Electrical, Electronics and Computer Science > 1 - 8

2012 IEEE Students' Conference on Electrical, Electronics and Computer Science (SCEECS)

GPU (Graphics Processing Unit) provides high computational speed at a very low cost as compared to high end systems. The field of parallel processing using GPU is advancing very fast with a new technology being introduced in the field every day. With such advancements, it is necessary to review the major works done in this field. Graph traversal is one of the major challenges in this field. So far...

chapter

Acceleration for CFD applications on large GPU clusters: An NPB case study

Fengshun Lu, Junqiang Song, Xiaoqun Cao, Xiaoqian Zhu

2011 6th International Conference on Computer Sciences and Convergence Information Technology (ICCIT) > 534 - 538

2011 6th International Conference on Computer Sciences and Convergence Information Technology (ICCIT)

Computational fluid dynamics (CFD) applications have an ever-growing demand for the power of high performance computing (HPC) infrastructure. Many CFD simulations have benefited from newly-acknowledged GPU clusters. However, few of them have exploited both the CPU and the GPU computational resources within the heterogeneous HPC platforms. In this paper, we endeavor to demonstrate the approach of making...

chapter

Particle Gradient Multi-objective Evolutionary Algorithm Based on GPU with CUDA

Xuezhi Yue, Zhijian Wu, Kangshun Li

2010 Third International Symposium on Information Science and Engineering > 540 - 544

2010 International Symposium on Information Science and Engineering (ISISE)

In the paper, particle gradient multi-objective evolutionary algorithm (PGMOEA) on GPU is presented. PGMOEA extends the classical particle dynamic multi-objective evolutionary algorithm by incorporating the gradient information of each particle from evolutionary programming. We perform experiments to compare PGMOEA on GPU with PGMOEA on CPU and demonstrate that PGMOEA on GPU is much more effective...

chapter

Parallel Particle-Based Reaction Diffusion: A GPU Implementation

L Dematté

2010 Ninth International Workshop on Parallel and Distributed Methods in Verification, and Second International Workshop on High Performance Computational Systems Biology > 67 - 77

2010 9th International Workshop on Parallel & Distributed Methods in Verification and 2nd International Workshop on High Performance Computational Systems Biology (PDMC-HiBi 2010)

Space is a very important aspect in the simulation of biochemical models, recently, the need for simulation algorithms able to cope with space is becoming more and more compelling. Complex and large models of biochemical systems need to deal with the movement of single molecules and particles, taking into consideration localised fluctuations, transportation phenomena and diffusion. A common drawback...

chapter

GPUMP: A Multiple-Precision Integer Library for GPUs

Kaiyong Zhao, Xiaowen Chu

2010 10th IEEE International Conference on Computer and Information Technology > 1164 - 1168

2010 IEEE 10th International Conference on Computer and Information Technology (CIT)

Multiple-precision integer operations are key components of many security applications; but unfortunately they are computationally expensive on contemporary CPUs. In this paper, we present our design and implementation of a multiple-precision integer library for GPUs which is implemented by CUDA. We report our experimental results which show that a significant speedup can be achieved by GPUs as compared...

chapter

FPGA Circuit Synthesis of Accelerator Data-Parallel Programs

Barry Bond, Kerry Hammil, Lubomir Litchev, Satnam Singh

2010 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines > 167 - 170

2010 IEEE 18th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2010)

This paper describes the techniques used to describe and synthesize FPGA circuits expressed in a data-parallel domain specific language (DSL) called Accelerator. We identify the subset of data-parallel descriptions that are supported by our system and explain how we track memory access patterns which allow us to generate efficient FPGA circuits.

chapter

AUTO-GC: Automatic translation of data mining applications to GPU clusters

Wenjing Ma, Gagan Agrawal

2010 IEEE International Symposium on Parallel&Distributed Processing, Workshops and Phd Forum (IPDPSW) > 1 - 8

2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW 2010)

Because of the very favorable price to performance ratio of the GPUs, a popular parallel programming configuration today is a cluster of GPUs. However, extracting performance on such a configuration would typically require programming in both MPI and CUDA, thus requiring a high degree of expertise and effort. It is clearly desirable to be able to support higher-level programming of this emerging high-performance...

chapter

Simulating anomalous diffusion on graphics processing units

Karl Heinz Hoffmann, Michael Hofmann, Jens Lang, Gudula Runger, more

2010 IEEE International Symposium on Parallel&Distributed Processing, Workshops and Phd Forum (IPDPSW) > 1 - 8

2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW 2010)

The computational power of modern graphics processing units (GPUs) has become an interesting alternative in high performance computing. The specialized hardware of GPUs delivers a high degree of parallelism and performance. Various applications in scientific computing have been implemented such that computationally intensive parts are executed on GPUs. In this article, we present a GPU implementation...

chapter

Linear optimization on modern GPUs

D.G. Spampinato, A.C. Elster

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 8

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

Optimization algorithms are becoming increasingly more important in many areas, such as finance and engineering. Typically, real problems involve several hundreds of variables, and are subject to as many constraints. Several methods have been developed trying to reduce the theoretical time complexity. Nevertheless, when problems exceed reasonable sizes they end up being very computationally intensive...

Filter options

Data set:
ieee
Keywords:
ARRAYS
PROGRAMMING
GRAPHICS PROCESSING UNIT

Publication date

Set your own date range

Keywords

CUDA (6)
GPU (6)
KERNEL (6)
COPROCESSORS (5)
INSTRUCTION SETS (5)
COMPUTER GRAPHIC EQUIPMENT (4)
COMPUTATIONAL MODELING (3)
DATA MINING (3)
OPTIMIZATION (3)
CHEMISTRY COMPUTING (2)
DIFFUSION (2)
GRAPHICS (2)
INDEXES (2)
PARALLEL ARCHITECTURES (2)
PARALLEL PROCESSING (2)
PARALLEL PROGRAMMING (2)
ACCELERATION (1)
ACCELERATOR (1)
ACCELERATOR DATA-PARALLEL PROGRAM (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ALGORITHMS (1)
ANALYSIS TOOL (1)
ANOMALOUS DIFFUSION SIMULATION PROCESS (1)
API (1)
APPLICATION PROGRAM INTERFACES (1)
ARRAY REGION ANALYSIS (1)
ATLAS-BASED CPU VERSION (1)
AUTHENTICATION (1)
AUTO-GC (1)
BIOCHEMICAL SYSTEMS (1)
BIOLOGICAL SYSTEM MODELING (1)
BIOLOGY COMPUTING (1)
BROWNIAN DYNAMICS (1)
CHEMICAL REACTIONS (1)
CLUSTER (1)
CLUSTERING ALGORITHMS (1)
CODE GENERATION SYSTEM (1)
COMPILER-BASED TOOL (1)
COMPUTATIONAL FLUID DYNAMICS (1)
COMPUTE UNIFIED DEVICE ARCHITECTURE (1)
COMPUTEUNIFIEDDEVICE ARCHITECTURE (1)
DATA-PARALLEL DESCRIPTION (1)
DIFFUSED ALGORITHM (1)
DOMAIN SPECIFIC LANGUAGE (1)
ELECTRONIC ENGINEERING COMPUTING (1)
EVOLUTIONARY COMPUTATION (1)
FIELD PROGRAMMABLE GATE ARRAYS (1)
FPGA CIRCUIT SYNTHESIS (1)
FRACTAL STRUCTURES (1)
FRACTALS (1)
GPGPU (1)
GPGPU TECHNOLOGIES (1)
GPU CLUSTERS (1)
GPU PROGRAMMING (1)
GPUMP (1)
GPUS (1)
GRAPH ALGORITHMS (1)
GRAPHICS PROCESSING UNITS (1)
GRAPHICS PROCESSINGUNIT (1)
HARDWARE (1)
HIGH PERFORMANCE COMPUTING (1)
HIGH PERFORMANCE COMPUTING PLATFORM (1)
INTEGRATED CIRCUIT MODELING (1)
IRREGULAR ALGORITHMS (1)
IRREGULAR COMPUTATIONAL STRUCTURE (1)
K-MEAN CLUSTERING (1)
LIBRARIES (1)
LINEAR OPTIMIZATION (1)
LINEAR PROGRAMMING (1)
LINEAR PROGRAMMING PROBLEMS (1)
LINEAR-BASED TECHNIQUES (1)
LOCALISED FLUCTUATIONS (1)
LOGIC DESIGN (1)
MAGNETIC CORES (1)
MATHEMATICAL MODEL (1)
MATRIX MULTIPLICATION (1)
MATRIX-MATRIX MULTIPLICATION (1)
MEMORY ACCESS PATTERN (1)
MIDDLEWARE (1)
MODERN GPUS (1)
MPI (1)
MULTI-OBJECTIVE OPTIMIZATION PROBLEM (1)
MULTIPLE-PRECISION ALGORITHM (1)
MULTIPLE-PRECISION INTEGER LIBRARY (1)
NETWORK SYNTHESIS (1)
NVIDIA CUDA PROGRAMMING LIBRARY (1)
OPENCL (1)
PARALLEL (1)
PARALLEL ALGORITHMS (1)
PARALLEL DATA MINING ALGORITHM (1)
PARALLEL PARTICLE BASED REACTION DIFFUSION (1)
PARALLEL SIMULATION ALGORITHMS (1)
PARALLELISM (1)
PARTICLE GRADIENT MULTI-OBJECTIVE EVOLUTIONARY ALGORITHM (1)
PARTICLES (1)
PARTITIONING ALGORITHMS (1)
PATTERN CLUSTERING (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options