Search results

Items from 1 to 20 out of 48 results

chapter

Improved Meshless Method for Simulating Incompressible Fluids on GPU

Andre Luiz Buarque Vieira-e-Silva, Mozart William Santos Almeida, Caio Jose dos Santos Brito, Veronica Teichrieb

2017 19th Symposium on Virtual and Augmented Reality (SVR) > 297 - 308

2017 19th Symposium on Virtual and Augmented Reality (SVR)

Meshless methods to simulate fluid flows have been increasingly evolving through the years since they are a great alternative to deal with large deformations, which is where mesh-based methods fail to perform efficiently. A well known meshless method is the Moving Particle Semi-implicit (MPS) method, which was designed to simulate free-surface truly incompressible fluid flows. Many variations and...

chapter

Reducing the Memory Footprint of an Eikonal Solver

Daniel Ganellari, Gundolf Haase

2017 International Conference on High Performance Computing & Simulation (HPCS) > 325 - 332

2017 International Conference on High Performance Computing & Simulation (HPCS)

The numerical solution of the Eikonal equation follows the fast iterative method with its application for tetrahe-dral meshes. Therein the main operations in each discretization element τ contain various inner products in the M-metric as ($e^{\rarr}$k,s,$e^{\rarr}$s,ℓMτ $e^{\rarr}$Tk,s · Mτ · $e^{\rarr}$s,ℓ with $e^{\rarr}$s,ℓ as connecting edge between vertices s and ℓ in element τ. Instead of passing...

chapter

Benchmarking MD systems simulations on the graphics processing unit and multi-core systems

Iuliana Marin, Nicolae Goga, Maria Goga

2016 IEEE International Symposium on Systems Engineering (ISSE) > 1 - 5

2016 IEEE International Symposium on Systems Engineering (ISSE)

Molecular dynamics facilitates the simulation of a complex system to be analyzed at molecular and atomic levels. Simulations can last a long period of time, even months. Due to this cause the graphics processing units (GPUs) and multi-core systems are used as solutions to overcome this impediment. The current paper describes a comparison done between these two kinds of systems. The first system used...

chapter

Evaluating Multi-core and Many-Core Architectures through Parallelizing a High-Order WENO Solver

Liang Deng, Hanli Bai, Dan Zhao, Fang Wang

2016 IEEE Trustcom/BigDataSE/ISPA > 2167 - 2174

2016 IEEE Trustcom/BigDataSE/ISPA

This paper studies the implementation and optimization of a high-order weighted essentially non-oscillatory (WENO) solver to the solution of the Euler equations on the multi-core and many-core architectures (Intel Ivy Bridge CPU, Intel Xeon Phi 7110P coprocessor and NVIDIA Kepler K20c GPU). The implementation of up to ninth-order accurate WENO schemes is used in the solver. For the GPU platform, both...

chapter

GPU computing using CUDA in the deployment of smart grids

Daniel J. Sooknanan, Ajay Joshi

2016 SAI Computing Conference (SAI) > 1260 - 1266

2016 SAI Computing Conference (SAI)

This paper underscores the use of CUDA-based GPUs as high performance parallel computers for the purpose of real time analysis in a smart grid setting. In a smart grid, with the influx of new, renewable, distributed generation technologies, the network is more complex and requires more computationally intensive means of simulation and analysis. To show its usefulness, a power flow analysis case study...

chapter

Accelerating frequency-domain simulations using small shared-memory CPU/GPU cluster

Tomasz Topa, Artur Noga, Andrzej Karwowski

2016 21st International Conference on Microwave, Radar and Wireless Communications (MIKON) > 1 - 4

2016 21st International Conference on Microwave, Radar and Wireless Communications (MIKON)

Numerical approach to frequency response problems usually requires that the system governing equation is solved repeatedly at many frequencies. The computational efficiency of the overall process can be increased by departing from traditional sequential computing model in favor of utilizing the parallel processing capability commonly offered by modern hardware. In this paper, we consider a hybrid...

chapter

Evaluating Multi-core and Many-Core Architectures through Accelerating an Alternating Direction Implicit CFD Solver

Liang Deng, Jianbin Fang, Fang Wang, Hanli Bai

2016 15th International Symposium on Parallel and Distributed Computing (ISPDC) > 1 - 10

2016 15th International Symposium on Parallel and Distributed Computing (ISPDC)

In this paper, we accelerate a double-precision alternating direction implicit (ADI) solver for three-dimensional compressible Navier-Stokes equations from our in-house computational fluid dynamics (CFD) software on the latest multi-core and many-core architectures (Intel Ivy Bridge CPU, Intel Xeon Phi 7110P coprocessor and NVIDIA Kepler K20c GPU). For the GPU platform, both the OpenACC-based and...

chapter

CUDA accelerated visual relative motion estimation

Safa Ouerghi, Fethi Tlili

2016 International Symposium on Signal, Image, Video and Communications (ISIVC) > 302 - 307

2016 International Symposium on Signal, Image, Video and Communications (ISIVC)

Egomotion estimation is a fundamental issue in structure from motion and particularly for ADAS systems. Several camera motion estimation methods from a set of variable number of image correspondances were proposed. Seven-point method represent the minimal number of required correspondences to estimate the fundamental matrix, raised special interest for their application in a hypothesize-and-test framework...

chapter

Efficient implementation of BCH decoders on GPU for flash memory devices using iBMA

Arul K. Subbiah, Tokunbo Ogunfunmi

2016 IEEE International Conference on Consumer Electronics (ICCE) > 275 - 278

2016 IEEE International Conference on Consumer Electronics (ICCE)

Recent development and popularity of Flash Memory requires efficient error correction technique on its eco system like gaming and mobile platforms. In this paper, we have addressed an efficient method to decode and correct errors using the parallel computing technique offered by Graphical Processing Unit (GPU). This decoder employs the inversion-less Berleykamp-Massey algorithm (iBMA), and Chein search...

chapter

New Tridiagonal Systems Solvers on GPU Architectures

Adrian Perez Dieguez, Margarita Amor, Ramon Doallo

2015 IEEE 22nd International Conference on High Performance Computing (HiPC) > 85 - 94

2015 IEEE 22nd International Conference on High Performance Computing (HiPC)

Modern GPUs (Graphics Processing Units) offer very high computing power at relatively low cost. Nevertheless, designing efficient algorithms for the GPUs usually requires additional time and effort, even for experienced programmers. On the other hand, tridiagonal systems solvers are an important building block for a wide range of applications. In this paper, we present a new tuning parallel proposal...

chapter

High performance GPU Bayesian image synthesis

Miguel Carcamo, Fernando R. Rannou, Pablo E. Roman, Victor Moral, more

2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 264 - 268

2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

ALMA is a revolutionary instrument in its scientific concept, its engineering design and its organisation as a global effort. ALMA and new incoming radio-telescopes delivery big amounts of data that are useful to the sky image reconstruction. In this context, MEM is one of the most recognized reconstruction algorithms in radio-interferometry and is based on a Bayesian approach. Our results show that...

chapter

A fast parallel matrix inversion algorithm based on heterogeneous multicore architectures

Denggao Yu, Shiwen He, Yongming Huang, Guangshi Yu, more

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 903 - 907

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

Large matrix inversion is usually a basic step in a wide range of signal processing or numerical problems, such as digital filtering, equalization detection, and etc. It is essential to figure out an algorithm to invert large matrix quickly and accurately. On the other hand, the Graphics Processor Unit (GPU) is able to provide a low-cost and flexible multicore architecture for high performance computing,...

chapter

GPU accelerated geometric multigrid method: Performance comparison on recent NVIDIA architectures

Iulian Stroia, Lucian Itu, Cosmin Nita, Laszlo Lazar, more

2015 19th International Conference on System Theory, Control and Computing (ICSTCC) > 175 - 179

2015 19th International Conference on System Theory, Control and Computing (ICSTCC)

During the past decade Graphics Processing Units (GPU) have been increasingly employed for speeding up compute intensive scientific applications. In this field, the geometric multigrid method (GMG) is one of the most efficient algorithms for solving large sparse linear systems of equations. Herein we analyze the performance of an optimized GPU based implementation of the GMG method on different state-of-the-art...

chapter

Implementation of Particle Filters for Single Target Tracking Using CUDA

Bhavya Goyal, Tarun Budhraja, Roheet Bhatnagar, Chandan Shivakumar

2015 Fifth International Conference on Advances in Computing and Communications (ICACC) > 28 - 32

2015 Fifth International Conference on Advances in Computing & Communications (ICACC)

In order to implement Sequential Bayesian estimator using Monte carlo simulation and to get rid of limitations of Kalman filter, Particle filtering techniques plays a very crucial role for target tracking applications in state space where Importance sampling approximately distributed by posterior distribution with multimodel feature and robustness to noise. However as the particles becomes very large,...

chapter

GPU Based Sound Simulation and Visualization

Torbjorn Loken, Sergiu M. Dascalu, Frederick C. Harris

2015 12th International Conference on Information Technology - New Generations > 692 - 697

2015 12th International Conference on Information Technology - New Generations (ITNG)

As the era of Moore's Law and increasing CPU clock rates nears its stopping point the focus of chip and hardware design has shifted to increasing the number of computation cores present on the chip. This increase can be most clearly seen in the rise of Graphic Processing Units (GPU) where hundreds or thousands of slower cores work in parallel to accomplish tasks. Programming for these chips represents...

chapter

Iterative Krylov Methods for Acoustic Problems on Graphics Processing Unit

Abal-Kassim Cheik Ahamed, Frederic Magoules

2014 13th International Symposium on Distributed Computing and Applications to Business, Engineering and Science > 19 - 23

2014 13th International Symposium on Distributed Computing and Applications to Business, Engineering and Science (DCABES)

This paper deals with linear algebra operations on Graphics Processing Unit (GPU) with complex number arithmetic using double precision. An analysis of their uses within iterative Krylov methods is presented to solve acoustic problems. Numerical experiments performed on a set of acoustic matrices arising from the modelisation of acoustic phenomena inside a car compartment are collected, and outline...

chapter

Peak pulse power enhancement via substrate integrated waveguides filled with non-linear dielectrics

Joseph Bryant, Michael Baginski, Hulya Kirkici, William Little

2014 IEEE International Power Modulator and High Voltage Conference (IPMHVC) > 267 - 270

2014 IEEE International Power Modulator and High Voltage Conference (IPMHVC)

Peak pulse power enhancement occurring in substrate integrated waveguides loaded with a nonlinear dielectric material was investigated via a series of finite difference, time domain simulations. The code was developed to run efficiently on GPU platforms thereby radically reducing solution runtimes and allowing a significant number of dielectric-waveguide models to be characterized. The low-loss ferroelectric...

chapter

Acceleration of a Python-Based Tsunami Modelling Application via CUDA and OpenHMPP

Zhe Weng, Peter E. Strazdins

2014 IEEE International Parallel & Distributed Processing Symposium Workshops > 1275 - 1284

2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW)

Modern graphics processing units (GPUs) have became powerful and cost-effective computing platforms. Parallel programming standards (e.g. CUDA) and directive-based programming standards (like OpenHMPP and OpenACC) are available to harness this tremendous computing power to tackle largescale modelling and simulation in scientific areas. ANUGA is a tsunami modelling application which is based on unstructured...

chapter

On the Programmability and Performance of Heterogeneous Platforms

Konstantinos Krommydas, Thomas R.W. Scogland, Wu-Chun Feng

2013 International Conference on Parallel and Distributed Systems > 224 - 231

2013 International Conference on Parallel and Distributed Systems (ICPADS)

General-purpose computing on an ever-broadening array of parallel devices has led to an increasingly complex and multi-dimensional landscape with respect to programmability and performance optimization. The growing diversity of parallel architectures presents many challenges to the domain scientist, including device selection, programming model, and level of investment in optimization. All of these...

chapter

CUDA Implementation of a Euler Solver for Cartesian Grid

Yang Liu, Yufei Pang, Bo Chen, Hanshan Xiao, more

2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing > 1308 - 1314

2013 IEEE International Conference on High Performance Computing and Communications (HPCC) & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (EUC)

Based on the features of GPU architecture, this paper introduces CUDA into an existing Euler solver software for a 3-D Cartesian grid. Theories and Techniques used to solve the equations with finite volume methods using an explicit scheme are described. Two versions of GPU-based Cart Solver are implemented and optimized. For a real and complex model, the implementation on a NVIDIA GTX460se GPU by...

Keywords:
MATHEMATICAL MODEL
GPU

Publication date

Set your own date range

Publication type

book (43)
article (5)

Keywords

GRAPHICS PROCESSING UNITS (27)
EQUATIONS (22)
COMPUTATIONAL MODELING (21)
GRAPHICS PROCESSING UNIT (18)
COPROCESSORS (13)
INSTRUCTION SETS (12)
KERNEL (12)
COMPUTER ARCHITECTURE (9)
COMPUTER GRAPHIC EQUIPMENT (8)
PROGRAMMING (7)
MATRIX DECOMPOSITION (5)
PARALLEL COMPUTING (5)
ALGORITHM DESIGN AND ANALYSIS (4)
CENTRAL PROCESSING UNIT (4)
COMPUTER GRAPHICS (4)
GRAPHICS (4)
HIGH PERFORMANCE COMPUTING (4)
ITERATIVE METHODS (4)
PARALLEL PROCESSING (4)
VECTORS (4)
YARN (4)
ACCELERATION (3)
ARRAYS (3)
CFD (3)
FINITE DIFFERENCE METHODS (3)
HARDWARE (3)
JACOBIAN MATRICES (3)
MEMORY MANAGEMENT (3)
METHOD OF MOMENTS (3)
NUMERICAL MODELS (3)
OPENACC (3)
OPTIMIZATION (3)
PARALLEL (3)
PERFORMANCE (3)
PROGRAMMABILITY (3)
ACOUSTICS (2)
BOUNDARY CONDITIONS (2)
BRIDGES (2)
COMPUTATIONAL ELECTROMAGNETICS (2)
COMPUTE UNIFIED DEVICE ARCHITECTURE (2)
DATA MINING (2)
DATA MODELS (2)
ESTIMATION (2)
FINITE DIFFERENCE TIME-DOMAIN ANALYSIS (2)
FLUID SIMULATION (2)
ITERATIVE KRYLOV METHODS (2)
IVY BRIDGE (2)
MATLAB (2)
MOM (2)
MPI (2)
NAVIER-STOKES EQUATIONS (2)
NUMERICAL INTEGRATION (2)
OPTIMIZATION TECHNIQUES (2)
PARALLEL ARCHITECTURES (2)
PARTIAL DIFFERENTIAL EQUATION (2)
PARTIAL DIFFERENTIAL EQUATIONS (2)
POLLUTANT BEHAVIOR (2)
RENDERING (COMPUTER GRAPHICS) (2)
RUNGE-KUTTA METHODS (2)
SCIENTIFIC COMPUTING (2)
SIGNAL PROCESSING ALGORITHMS (2)
SOLID MODELING (2)
SPARSE MATRICES (2)
XEON PHI (2)
3D VISUALIZATION (1)
4096 IZHIKEVICH NEURONS (1)
4TH ORDER RUNGE-KUTTA SCHEME (1)
A-DE (1)
ACCELERATED COMPUTATION (1)
ACCURACY (1)
ACOUSTIC (1)
ADAPTIVE ORDER (1)
ADI METHOD (1)
ADMITTANCE (1)
ADVECTION-DIFFUSION EQUATION (1)
ADVECTION-DIFFUSION EQUATION CALCULATION (1)
AIR POLLUTION (1)
ALMA (1)
ALTERNATING DIRECTION IMPLICIT (1)
ALTERNATING DIRECTION IMPLICIT METHOD (1)
ANOMALOUS DIFFUSION SIMULATION PROCESS (1)
ANUGA (1)
APERTURES (1)
APPROXIMATION METHODS (1)
ARTIFACT DISCRETIZATION (1)
ATMOSPHERIC EQUATION (1)
ATMOSPHERIC MODELING (1)
ATMOSPHERIC TECHNIQUES (1)
AUTONOMOUS VEHICLE (1)
AVX (1)
AZIMUTH (1)
BASKET OPTION (1)
BASKET OPTION PRICING (1)
BAYES THEOREM (1)
BCH (1)
BIOLOGY COMPUTING (1)
BIOMEDICAL MRI (1)
more

INFONA - science communication portal

Search results

Improved Meshless Method for Simulating Incompressible Fluids on GPU

Reducing the Memory Footprint of an Eikonal Solver

Benchmarking MD systems simulations on the graphics processing unit and multi-core systems

Evaluating Multi-core and Many-Core Architectures through Parallelizing a High-Order WENO Solver

GPU computing using CUDA in the deployment of smart grids

Accelerating frequency-domain simulations using small shared-memory CPU/GPU cluster

Evaluating Multi-core and Many-Core Architectures through Accelerating an Alternating Direction Implicit CFD Solver

CUDA accelerated visual relative motion estimation

Efficient implementation of BCH decoders on GPU for flash memory devices using iBMA

New Tridiagonal Systems Solvers on GPU Architectures

High performance GPU Bayesian image synthesis

A fast parallel matrix inversion algorithm based on heterogeneous multicore architectures

GPU accelerated geometric multigrid method: Performance comparison on recent NVIDIA architectures

Implementation of Particle Filters for Single Target Tracking Using CUDA

GPU Based Sound Simulation and Visualization

Iterative Krylov Methods for Acoustic Problems on Graphics Processing Unit

Peak pulse power enhancement via substrate integrated waveguides filled with non-linear dielectrics

Acceleration of a Python-Based Tsunami Modelling Application via CUDA and OpenHMPP

On the Programmability and Performance of Heterogeneous Platforms

CUDA Implementation of a Euler Solver for Cartesian Grid

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options