Wyniki wyszukiwania

Pozycje od 1 do 10 spośród 10 wyników

rozdział

Programming the Adapteva Epiphany 64-Core Network-on-Chip Coprocessor

Anish Varghese, Bob Edwards, Gaurav Mitra, Alistair P. Rendell

2014 IEEE International Parallel & Distributed Processing Symposium Workshops > 984 - 992

2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW)

With energy efficiency and power consumption being the primary impediment in the path to exascale systems, low-power high performance embedded systems are of increasing interest. The Parallella System-on-module (SoM) created by Adapteva combines the Epiphany-IV 64-core coprocessor with a host ARM processor housed in a Zynq System-on-chip. The Epiphany integrates low-power RISC cores on a 2D mesh network...

rozdział

Parallel Particle-Based Reaction Diffusion: A GPU Implementation

L Dematté

2010 Ninth International Workshop on Parallel and Distributed Methods in Verification, and Second International Workshop on High Performance Computational Systems Biology > 67 - 77

2010 9th International Workshop on Parallel & Distributed Methods in Verification and 2nd International Workshop on High Performance Computational Systems Biology (PDMC-HiBi 2010)

Space is a very important aspect in the simulation of biochemical models, recently, the need for simulation algorithms able to cope with space is becoming more and more compelling. Complex and large models of biochemical systems need to deal with the movement of single molecules and particles, taking into consideration localised fluctuations, transportation phenomena and diffusion. A common drawback...

rozdział

GPUMP: A Multiple-Precision Integer Library for GPUs

Kaiyong Zhao, Xiaowen Chu

2010 10th IEEE International Conference on Computer and Information Technology > 1164 - 1168

2010 IEEE 10th International Conference on Computer and Information Technology (CIT)

Multiple-precision integer operations are key components of many security applications; but unfortunately they are computationally expensive on contemporary CPUs. In this paper, we present our design and implementation of a multiple-precision integer library for GPUs which is implemented by CUDA. We report our experimental results which show that a significant speedup can be achieved by GPUs as compared...

rozdział

AUTO-GC: Automatic translation of data mining applications to GPU clusters

Wenjing Ma, Gagan Agrawal

2010 IEEE International Symposium on Parallel&Distributed Processing, Workshops and Phd Forum (IPDPSW) > 1 - 8

2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW 2010)

Because of the very favorable price to performance ratio of the GPUs, a popular parallel programming configuration today is a cluster of GPUs. However, extracting performance on such a configuration would typically require programming in both MPI and CUDA, thus requiring a high degree of expertise and effort. It is clearly desirable to be able to support higher-level programming of this emerging high-performance...

rozdział

Simulating anomalous diffusion on graphics processing units

Karl Heinz Hoffmann, Michael Hofmann, Jens Lang, Gudula Runger, więcej

2010 IEEE International Symposium on Parallel&Distributed Processing, Workshops and Phd Forum (IPDPSW) > 1 - 8

2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW 2010)

The computational power of modern graphics processing units (GPUs) has become an interesting alternative in high performance computing. The specialized hardware of GPUs delivers a high degree of parallelism and performance. Various applications in scientific computing have been implemented such that computationally intensive parts are executed on GPUs. In this article, we present a GPU implementation...

rozdział

Accelerating scientific applications using GPU's

M. Taher

2009 4th International Design and Test Workshop (IDT) > 1 - 6

2009 4th International Design and Test Workshop (IDT 2009)

Graphics processing units (GPUs) have emerged as a powerful platform for high-performance computation. They have been successfully used to accelerate many scientific workloads. Typically, the computationally intensive parts of the application are offloaded to the GPU, which serves as the CPU's parallel coprocessor. The key to effective utilization of GPUs for scientific computing is the design and...

rozdział

Thalweg: A framework for programming 1,000 machines with 1,000 cores

A.L. Beberg, V.S. Pande

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 7

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

While modern large-scale computing tasks have grown to span many machines, each with many cores, traditional programming models have not kept up with these advancements, resulting in difficulty exploiting these computing resources with only modest programmer effort. Thalweg seeks to address this breakdown in several ways. It provides a model for designing algorithms that have the potential to scale...

rozdział

Linear optimization on modern GPUs

D.G. Spampinato, A.C. Elster

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 8

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

Optimization algorithms are becoming increasingly more important in many areas, such as finance and engineering. Typically, real problems involve several hundreds of variables, and are subject to as many constraints. Several methods have been developed trying to reduce the theoretical time complexity. Nevertheless, when problems exceed reasonable sizes they end up being very computationally intensive...

rozdział

Practical Pre-stack Kirchhoff Time Migration of Seismic Processing on General Purpose GPU

Xiaohua Shi, Xu Wang, Changhai Zhao, Haiyan Yang

2009 WRI World Congress on Computer Science and Information Engineering > 2 > 461 - 465

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

In this paper, we introduced three prototypes of GPGPU solutions on NVidia GeForce8800GT for a practical Pre-stack Kirchhoff Time Migration program. We presented how to re-design and re-implement the original CPU code to efficiency GPU code. The prototypes are more than at most 7.2 times faster than its CPU version on Intelpsilas P4 3.0G.

rozdział

Accelerate Your Graphic Program with GPU/CPU Cache

Zhou Likun, Chen Dingfang

2008 International Conference on Cyberworlds > 667 - 671

2008 International Conference on Cyberworlds

This paper discusses how to optimize the digital graphic program with cache system used in GPU/CPU architecture to gain more FPS. Firstly, we introduce the basic principle of cache system summarily; secondly, we discuss the three main organization and mapping technologies of cache system in detail, and then compare these three cache mapping solutions by giving examples; thirdly, illustrate the cache-friendly...

Opcje filtrowania

Zbiór danych:
ieee
Słowa kluczowe:
ARRAYS
PROGRAMMING
COPROCESSORS

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

COMPUTER GRAPHIC EQUIPMENT (5)
GRAPHICS PROCESSING UNIT (5)
GPU (4)
GRAPHICS (4)
CUDA (3)
DATA MINING (3)
HARDWARE (3)
KERNEL (3)
API (2)
APPLICATION PROGRAM INTERFACES (2)
CHEMISTRY COMPUTING (2)
COMPUTATIONAL MODELING (2)
COMPUTER GRAPHICS (2)
DIFFUSION (2)
GRAPHICS PROCESSING UNITS (2)
LIBRARIES (2)
PARALLEL ARCHITECTURES (2)
SCIENTIFIC COMPUTING (2)
YARN (2)
ACCELERATION (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ANOMALOUS DIFFUSION SIMULATION PROCESS (1)
ATLAS-BASED CPU VERSION (1)
AUTO-GC (1)
BIOCHEMICAL SYSTEMS (1)
BIOLOGICAL SYSTEM MODELING (1)
BIOLOGY COMPUTING (1)
BROWNIAN DYNAMICS (1)
BUILT-IN SELF-TEST (1)
CACHE MAPPING (1)
CACHE MAPPING SOLUTIONS (1)
CACHE SYSTEM (1)
CACHE-FRIENDLY PROGRAMMING (1)
CACHE-FRIENDLY PROGRAMMING STYLE (1)
CHEMICAL REACTIONS (1)
CLOCKS (1)
CLUSTERING ALGORITHMS (1)
CODE GENERATION SYSTEM (1)
COMPUTE UNIFIED DEVICE ARCHITECTURE (1)
COMPUTER ARCHITECTURE (1)
COMPUTER SCIENCE EDUCATION (1)
CONTEXT (1)
CPU CODE (1)
CPU PARALLEL COPROCESSOR (1)
CUDA PROGRAMMING MODEL (1)
DATA ENCRYPTION STANDARD (1)
DATA STRUCTURES (1)
DIFFUSED ALGORITHM (1)
DIGITAL GRAPHIC PROGRAM (1)
DISTRIBUTED COMPUTING (1)
EARTH (1)
EFFICIENT DATA PARALLEL ALGORITHMS (1)
EPIPHANY (1)
FPS (1)
FRACTAL STRUCTURES (1)
FRACTALS (1)
GAIN (1)
GENERAL PURPOSE GPU (1)
GEOPHYSICS COMPUTING (1)
GPGPU (1)
GPGPU TECHNOLOGIES (1)
GPU CLUSTERS (1)
GPU PROGRAMMING (1)
GPU-CPU CACHE (1)
GPUMP (1)
GPUS (1)
HIGH PERFORMANCE COMPUTING (1)
HIGH PERFORMANCE COMPUTING PLATFORM (1)
INDEXES (1)
INSTRUCTION SETS (1)
INTEL P4 3.0G (1)
IRREGULAR ALGORITHMS (1)
IRREGULAR COMPUTATIONAL STRUCTURE (1)
K-MEAN CLUSTERING (1)
LABORATORY FRAMEWORK (1)
LINEAR OPTIMIZATION (1)
LINEAR PROGRAMMING (1)
LINEAR PROGRAMMING PROBLEMS (1)
LOCALISED FLUCTUATIONS (1)
MACHINE PROGRAMMING (1)
MAGNETIC CORES (1)
MATHEMATICAL MODEL (1)
MATRICES (1)
MATRIX MULTIPLICATION (1)
MATRIX-MATRIX MULTIPLICATION (1)
MIDDLEWARE (1)
MODERN GPUS (1)
MODERN LARGE-SCALE COMPUTING TASKS (1)
MPI (1)
MULTICORE COMPUTING (1)
MULTIPLE-PRECISION ALGORITHM (1)
MULTIPLE-PRECISION INTEGER LIBRARY (1)
MULTIPROCESSING SYSTEMS (1)
NETWORK-ON-CHIP (1)
NVIDIA CUDA PROGRAMMING LIBRARY (1)
NVIDIA GEFORCE8800GT (1)
NVIDIA GPU (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu