Search results

Items from 1 to 10 out of 10 results

chapter

Programming the Adapteva Epiphany 64-Core Network-on-Chip Coprocessor

Anish Varghese, Bob Edwards, Gaurav Mitra, Alistair P. Rendell

2014 IEEE International Parallel & Distributed Processing Symposium Workshops > 984 - 992

2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW)

With energy efficiency and power consumption being the primary impediment in the path to exascale systems, low-power high performance embedded systems are of increasing interest. The Parallella System-on-module (SoM) created by Adapteva combines the Epiphany-IV 64-core coprocessor with a host ARM processor housed in a Zynq System-on-chip. The Epiphany integrates low-power RISC cores on a 2D mesh network...

chapter

Parallel Particle-Based Reaction Diffusion: A GPU Implementation

L Dematté

2010 Ninth International Workshop on Parallel and Distributed Methods in Verification, and Second International Workshop on High Performance Computational Systems Biology > 67 - 77

2010 9th International Workshop on Parallel & Distributed Methods in Verification and 2nd International Workshop on High Performance Computational Systems Biology (PDMC-HiBi 2010)

Space is a very important aspect in the simulation of biochemical models, recently, the need for simulation algorithms able to cope with space is becoming more and more compelling. Complex and large models of biochemical systems need to deal with the movement of single molecules and particles, taking into consideration localised fluctuations, transportation phenomena and diffusion. A common drawback...

chapter

GPUMP: A Multiple-Precision Integer Library for GPUs

Kaiyong Zhao, Xiaowen Chu

2010 10th IEEE International Conference on Computer and Information Technology > 1164 - 1168

2010 IEEE 10th International Conference on Computer and Information Technology (CIT)

Multiple-precision integer operations are key components of many security applications; but unfortunately they are computationally expensive on contemporary CPUs. In this paper, we present our design and implementation of a multiple-precision integer library for GPUs which is implemented by CUDA. We report our experimental results which show that a significant speedup can be achieved by GPUs as compared...

chapter

AUTO-GC: Automatic translation of data mining applications to GPU clusters

Wenjing Ma, Gagan Agrawal

2010 IEEE International Symposium on Parallel&Distributed Processing, Workshops and Phd Forum (IPDPSW) > 1 - 8

2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW 2010)

Because of the very favorable price to performance ratio of the GPUs, a popular parallel programming configuration today is a cluster of GPUs. However, extracting performance on such a configuration would typically require programming in both MPI and CUDA, thus requiring a high degree of expertise and effort. It is clearly desirable to be able to support higher-level programming of this emerging high-performance...

chapter

Simulating anomalous diffusion on graphics processing units

Karl Heinz Hoffmann, Michael Hofmann, Jens Lang, Gudula Runger, more

2010 IEEE International Symposium on Parallel&Distributed Processing, Workshops and Phd Forum (IPDPSW) > 1 - 8

2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW 2010)

The computational power of modern graphics processing units (GPUs) has become an interesting alternative in high performance computing. The specialized hardware of GPUs delivers a high degree of parallelism and performance. Various applications in scientific computing have been implemented such that computationally intensive parts are executed on GPUs. In this article, we present a GPU implementation...

chapter

Accelerating scientific applications using GPU's

M. Taher

2009 4th International Design and Test Workshop (IDT) > 1 - 6

2009 4th International Design and Test Workshop (IDT 2009)

Graphics processing units (GPUs) have emerged as a powerful platform for high-performance computation. They have been successfully used to accelerate many scientific workloads. Typically, the computationally intensive parts of the application are offloaded to the GPU, which serves as the CPU's parallel coprocessor. The key to effective utilization of GPUs for scientific computing is the design and...

chapter

Thalweg: A framework for programming 1,000 machines with 1,000 cores

A.L. Beberg, V.S. Pande

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 7

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

While modern large-scale computing tasks have grown to span many machines, each with many cores, traditional programming models have not kept up with these advancements, resulting in difficulty exploiting these computing resources with only modest programmer effort. Thalweg seeks to address this breakdown in several ways. It provides a model for designing algorithms that have the potential to scale...

chapter

Linear optimization on modern GPUs

D.G. Spampinato, A.C. Elster

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 8

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

Optimization algorithms are becoming increasingly more important in many areas, such as finance and engineering. Typically, real problems involve several hundreds of variables, and are subject to as many constraints. Several methods have been developed trying to reduce the theoretical time complexity. Nevertheless, when problems exceed reasonable sizes they end up being very computationally intensive...

chapter

Practical Pre-stack Kirchhoff Time Migration of Seismic Processing on General Purpose GPU

Xiaohua Shi, Xu Wang, Changhai Zhao, Haiyan Yang

2009 WRI World Congress on Computer Science and Information Engineering > 2 > 461 - 465

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

In this paper, we introduced three prototypes of GPGPU solutions on NVidia GeForce8800GT for a practical Pre-stack Kirchhoff Time Migration program. We presented how to re-design and re-implement the original CPU code to efficiency GPU code. The prototypes are more than at most 7.2 times faster than its CPU version on Intelpsilas P4 3.0G.

chapter

Accelerate Your Graphic Program with GPU/CPU Cache

Zhou Likun, Chen Dingfang

2008 International Conference on Cyberworlds > 667 - 671

2008 International Conference on Cyberworlds

This paper discusses how to optimize the digital graphic program with cache system used in GPU/CPU architecture to gain more FPS. Firstly, we introduce the basic principle of cache system summarily; secondly, we discuss the three main organization and mapping technologies of cache system in detail, and then compare these three cache mapping solutions by giving examples; thirdly, illustrate the cache-friendly...

Filter options

Data set:
ieee
Keywords:
ARRAYS
PROGRAMMING
COPROCESSORS

Publication date

Set your own date range

Keywords

COMPUTER GRAPHIC EQUIPMENT (5)
GRAPHICS PROCESSING UNIT (5)
GPU (4)
GRAPHICS (4)
CUDA (3)
DATA MINING (3)
HARDWARE (3)
KERNEL (3)
API (2)
APPLICATION PROGRAM INTERFACES (2)
CHEMISTRY COMPUTING (2)
COMPUTATIONAL MODELING (2)
COMPUTER GRAPHICS (2)
DIFFUSION (2)
GRAPHICS PROCESSING UNITS (2)
LIBRARIES (2)
PARALLEL ARCHITECTURES (2)
SCIENTIFIC COMPUTING (2)
YARN (2)
ACCELERATION (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ANOMALOUS DIFFUSION SIMULATION PROCESS (1)
ATLAS-BASED CPU VERSION (1)
AUTO-GC (1)
BIOCHEMICAL SYSTEMS (1)
BIOLOGICAL SYSTEM MODELING (1)
BIOLOGY COMPUTING (1)
BROWNIAN DYNAMICS (1)
BUILT-IN SELF-TEST (1)
CACHE MAPPING (1)
CACHE MAPPING SOLUTIONS (1)
CACHE SYSTEM (1)
CACHE-FRIENDLY PROGRAMMING (1)
CACHE-FRIENDLY PROGRAMMING STYLE (1)
CHEMICAL REACTIONS (1)
CLOCKS (1)
CLUSTERING ALGORITHMS (1)
CODE GENERATION SYSTEM (1)
COMPUTE UNIFIED DEVICE ARCHITECTURE (1)
COMPUTER ARCHITECTURE (1)
COMPUTER SCIENCE EDUCATION (1)
CONTEXT (1)
CPU CODE (1)
CPU PARALLEL COPROCESSOR (1)
CUDA PROGRAMMING MODEL (1)
DATA ENCRYPTION STANDARD (1)
DATA STRUCTURES (1)
DIFFUSED ALGORITHM (1)
DIGITAL GRAPHIC PROGRAM (1)
DISTRIBUTED COMPUTING (1)
EARTH (1)
EFFICIENT DATA PARALLEL ALGORITHMS (1)
EPIPHANY (1)
FPS (1)
FRACTAL STRUCTURES (1)
FRACTALS (1)
GAIN (1)
GENERAL PURPOSE GPU (1)
GEOPHYSICS COMPUTING (1)
GPGPU (1)
GPGPU TECHNOLOGIES (1)
GPU CLUSTERS (1)
GPU PROGRAMMING (1)
GPU-CPU CACHE (1)
GPUMP (1)
GPUS (1)
HIGH PERFORMANCE COMPUTING (1)
HIGH PERFORMANCE COMPUTING PLATFORM (1)
INDEXES (1)
INSTRUCTION SETS (1)
INTEL P4 3.0G (1)
IRREGULAR ALGORITHMS (1)
IRREGULAR COMPUTATIONAL STRUCTURE (1)
K-MEAN CLUSTERING (1)
LABORATORY FRAMEWORK (1)
LINEAR OPTIMIZATION (1)
LINEAR PROGRAMMING (1)
LINEAR PROGRAMMING PROBLEMS (1)
LOCALISED FLUCTUATIONS (1)
MACHINE PROGRAMMING (1)
MAGNETIC CORES (1)
MATHEMATICAL MODEL (1)
MATRICES (1)
MATRIX MULTIPLICATION (1)
MATRIX-MATRIX MULTIPLICATION (1)
MIDDLEWARE (1)
MODERN GPUS (1)
MODERN LARGE-SCALE COMPUTING TASKS (1)
MPI (1)
MULTICORE COMPUTING (1)
MULTIPLE-PRECISION ALGORITHM (1)
MULTIPLE-PRECISION INTEGER LIBRARY (1)
MULTIPROCESSING SYSTEMS (1)
NETWORK-ON-CHIP (1)
NVIDIA CUDA PROGRAMMING LIBRARY (1)
NVIDIA GEFORCE8800GT (1)
NVIDIA GPU (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options