The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Meshless methods to simulate fluid flows have been increasingly evolving through the years since they are a great alternative to deal with large deformations, which is where mesh-based methods fail to perform efficiently. A well known meshless method is the Moving Particle Semi-implicit (MPS) method, which was designed to simulate free-surface truly incompressible fluid flows. Many variations and...
The numerical solution of the Eikonal equation follows the fast iterative method with its application for tetrahe-dral meshes. Therein the main operations in each discretization element τ contain various inner products in the M-metric as ($e^{\rarr}$k,s,$e^{\rarr}$s,ℓMτ $e^{\rarr}$Tk,s · Mτ · $e^{\rarr}$s,ℓ with $e^{\rarr}$s,ℓ as connecting edge between vertices s and ℓ in element τ. Instead of passing...
Molecular dynamics facilitates the simulation of a complex system to be analyzed at molecular and atomic levels. Simulations can last a long period of time, even months. Due to this cause the graphics processing units (GPUs) and multi-core systems are used as solutions to overcome this impediment. The current paper describes a comparison done between these two kinds of systems. The first system used...
This paper studies the implementation and optimization of a high-order weighted essentially non-oscillatory (WENO) solver to the solution of the Euler equations on the multi-core and many-core architectures (Intel Ivy Bridge CPU, Intel Xeon Phi 7110P coprocessor and NVIDIA Kepler K20c GPU). The implementation of up to ninth-order accurate WENO schemes is used in the solver. For the GPU platform, both...
This paper underscores the use of CUDA-based GPUs as high performance parallel computers for the purpose of real time analysis in a smart grid setting. In a smart grid, with the influx of new, renewable, distributed generation technologies, the network is more complex and requires more computationally intensive means of simulation and analysis. To show its usefulness, a power flow analysis case study...
Numerical approach to frequency response problems usually requires that the system governing equation is solved repeatedly at many frequencies. The computational efficiency of the overall process can be increased by departing from traditional sequential computing model in favor of utilizing the parallel processing capability commonly offered by modern hardware. In this paper, we consider a hybrid...
In this paper, we accelerate a double-precision alternating direction implicit (ADI) solver for three-dimensional compressible Navier-Stokes equations from our in-house computational fluid dynamics (CFD) software on the latest multi-core and many-core architectures (Intel Ivy Bridge CPU, Intel Xeon Phi 7110P coprocessor and NVIDIA Kepler K20c GPU). For the GPU platform, both the OpenACC-based and...
Egomotion estimation is a fundamental issue in structure from motion and particularly for ADAS systems. Several camera motion estimation methods from a set of variable number of image correspondances were proposed. Seven-point method represent the minimal number of required correspondences to estimate the fundamental matrix, raised special interest for their application in a hypothesize-and-test framework...
Recent development and popularity of Flash Memory requires efficient error correction technique on its eco system like gaming and mobile platforms. In this paper, we have addressed an efficient method to decode and correct errors using the parallel computing technique offered by Graphical Processing Unit (GPU). This decoder employs the inversion-less Berleykamp-Massey algorithm (iBMA), and Chein search...
Modern GPUs (Graphics Processing Units) offer very high computing power at relatively low cost. Nevertheless, designing efficient algorithms for the GPUs usually requires additional time and effort, even for experienced programmers. On the other hand, tridiagonal systems solvers are an important building block for a wide range of applications. In this paper, we present a new tuning parallel proposal...
ALMA is a revolutionary instrument in its scientific concept, its engineering design and its organisation as a global effort. ALMA and new incoming radio-telescopes delivery big amounts of data that are useful to the sky image reconstruction. In this context, MEM is one of the most recognized reconstruction algorithms in radio-interferometry and is based on a Bayesian approach. Our results show that...
Large matrix inversion is usually a basic step in a wide range of signal processing or numerical problems, such as digital filtering, equalization detection, and etc. It is essential to figure out an algorithm to invert large matrix quickly and accurately. On the other hand, the Graphics Processor Unit (GPU) is able to provide a low-cost and flexible multicore architecture for high performance computing,...
During the past decade Graphics Processing Units (GPU) have been increasingly employed for speeding up compute intensive scientific applications. In this field, the geometric multigrid method (GMG) is one of the most efficient algorithms for solving large sparse linear systems of equations. Herein we analyze the performance of an optimized GPU based implementation of the GMG method on different state-of-the-art...
In order to implement Sequential Bayesian estimator using Monte carlo simulation and to get rid of limitations of Kalman filter, Particle filtering techniques plays a very crucial role for target tracking applications in state space where Importance sampling approximately distributed by posterior distribution with multimodel feature and robustness to noise. However as the particles becomes very large,...
As the era of Moore's Law and increasing CPU clock rates nears its stopping point the focus of chip and hardware design has shifted to increasing the number of computation cores present on the chip. This increase can be most clearly seen in the rise of Graphic Processing Units (GPU) where hundreds or thousands of slower cores work in parallel to accomplish tasks. Programming for these chips represents...
This paper deals with linear algebra operations on Graphics Processing Unit (GPU) with complex number arithmetic using double precision. An analysis of their uses within iterative Krylov methods is presented to solve acoustic problems. Numerical experiments performed on a set of acoustic matrices arising from the modelisation of acoustic phenomena inside a car compartment are collected, and outline...
Peak pulse power enhancement occurring in substrate integrated waveguides loaded with a nonlinear dielectric material was investigated via a series of finite difference, time domain simulations. The code was developed to run efficiently on GPU platforms thereby radically reducing solution runtimes and allowing a significant number of dielectric-waveguide models to be characterized. The low-loss ferroelectric...
Modern graphics processing units (GPUs) have became powerful and cost-effective computing platforms. Parallel programming standards (e.g. CUDA) and directive-based programming standards (like OpenHMPP and OpenACC) are available to harness this tremendous computing power to tackle largescale modelling and simulation in scientific areas. ANUGA is a tsunami modelling application which is based on unstructured...
General-purpose computing on an ever-broadening array of parallel devices has led to an increasingly complex and multi-dimensional landscape with respect to programmability and performance optimization. The growing diversity of parallel architectures presents many challenges to the domain scientist, including device selection, programming model, and level of investment in optimization. All of these...
Based on the features of GPU architecture, this paper introduces CUDA into an existing Euler solver software for a 3-D Cartesian grid. Theories and Techniques used to solve the equations with finite volume methods using an explicit scheme are described. Two versions of GPU-based Cart Solver are implemented and optimized. For a real and complex model, the implementation on a NVIDIA GTX460se GPU by...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.