Search results

Items from 81 to 100 out of 547 results

chapter

Vehicle detection and tracking using Mean Shift segmentation on semi-dense disparity maps

Sebastien Lefebvre, Sebastien Ambellouis

2012 IEEE Intelligent Vehicles Symposium > 855 - 860

2012 IEEE Intelligent Vehicles Symposium (IV)

This paper describes an original joint obstacle detection and tracking method based on a Mean Shift algorithm and semi-dense disparity maps. The semi-dense disparity maps are computed with a local 1D fuzzy scanline stereo matching approach. Each map is associated to a confidence map that is used to remove bad matches. The Mean Shift algorithm is applied to simultaneously extract each vehicle and track...

chapter

GPU-based Cloud computing for comparing the structure of protein binding sites

Matthias Leinweber, Lars Baumgartner, Marco Mernberger, Thomas Fober, more

2012 6th IEEE International Conference on Digital Ecosystems and Technologies (DEST) > 1 - 6

2012 6th IEEE International Conference on Digital Ecosystems and Technologies (DEST 2012) - Complex Environment Engineering

In this paper, we present a novel approach for using a GPU-based Cloud computing infrastructure to efficiently perform a structural comparison of protein binding sites. The original CPU-based Java version of a recent graph-based algorithm called SEGA has been rewritten in OpenCL to run on NVIDIA GPUs in parallel on a set of Amazon EC2 Cluster GPU Instances. This new implementation of SEGA has been...

chapter

Particle Swarm Optimization on a GPU

Mikhail Rabinovich, Phillip Kainga, David Johnson, Brandon Shafer, more

2012 IEEE International Conference on Electro/Information Technology > 1 - 6

2012 IEEE International Conference on Electro/Information Technology (EIT 2012)

Optimization problems that contain discontinuities, non-linearity, or high dimensionality are difficult to solve and time consuming using conventional computational methods. This paper introduces a tool that solves these kinds of optimization problems using a patent pending Gaming Particle Swarm Optimization (GPSO) algorithm implemented on Graphics Processing Unit (GPU) hardware. Our study applied...

chapter

Multi-biomarker panel selection on a GPU

David Johnson, Brandon Shafer, Jaehwan John Lee, Jake Y. Chen

2012 IEEE International Conference on Electro/Information Technology > 1 - 6

2012 IEEE International Conference on Electro/Information Technology (EIT 2012)

Liquid chromatography-based tandem mass spectrometry (LC-MS) technique allows for identification and quantification of thousands of proteins in parallel. This technique coupled with a feed-forward artificial neural network provides a technique to analyze and select protein panels for use in multi-biomarker panel discovery applications. In this study, we enhance this technique by utilizing massively...

chapter

An algorithm to solve the Dominating Set Problem on GPUs

Christian Trefftz

2012 IEEE International Conference on Electro/Information Technology > 1 - 4

2012 IEEE International Conference on Electro/Information Technology (EIT 2012)

A brute-force algorithm to solve small instances of the Dominating Set Problem on GPUs is presented. Two implementations of the algorithm are discussed, one that uses atomic operations and one that uses reductions. Experimental results are reported.

chapter

Parallel algorithm of amplitude correction for time-lapse seismic data based on GPU

Zheng Wenjing, Liu Qicheng, Song Yibin, Tong Xiangrong, more

2012 International Conference on Systems and Informatics (ICSAI2012) > 924 - 926

2012 International Conference on Systems and Informatics (ICSAI)

Cross equalization is the core step of time-lapse seismic data processing, it can effectively eliminate the influence which is due to the inconsistent of acquisition, data processing and tube processing parameter. As the amount of data and processing of time-lapse seismic data increasing, it becomes the inevitable trend for seismic data to array on massively parallel processes. It deal with the time-lapse...

chapter

GPU accelerated simulation of the human arterial circulation

Lucian Itu, Sharma Puneet, Ali Kamen, Constantin Suciu, more

2012 13th International Conference on Optimization of Electrical and Electronic Equipment (OPTIM) > 1478 - 1485

2012 13th International Conference on Optimization of Electrical and Electronic Equipment

A GPU accelerated implementation of a reduced-order model of the human arterial circulation is introduced. The computationally intensive tasks of the algorithm (namely, the computation of the flow rate and area values at the interior grid points of the domain) have been migrated to the GPU. The CPU not only coordinates the actions performed by the GPU, but it also computes the inflow, bifurcation...

chapter

Gigapixel spotlight synthetic aperture radar backprojection using clusters of GPUs and CUDA

Thomas M. Benson, Daniel P. Campbell, Daniel A. Cook

2012 IEEE Radar Conference > 853 - 858

2012 IEEE Radar Conference (RadarCon)

Synthetic aperture radar (SAR) image formation via backprojection offers a robust mechanism by which to form images on general, non-planar surfaces, without often restrictive assumptions regarding the planarity of the wavefront at the locations being imaged. However, backprojection presents a substantially increased computational load relative to other image formation algorithms that typically depend...

chapter

A GPU implementation of color digital halftoning using the Direct Binary Search algorithm

Kartheek Chandu, Mikel Stanich, Barry Trager, Chai Wah Wu

2012 IEEE International Symposium on Circuits and Systems > 185 - 188

2012 IEEE International Symposium on Circuits and Systems - ISCAS 2012

We illustrate how employing Graphics Processing Units (GPU) can speed-up intensive image processing operations. In particular, we demonstrate the use of the NVIDIA CUDA architecture to implement a color digital binary halftoning algorithm based on Direct Binary Search (DBS). Halftoning a color image is more computationally expensive than the single color case as there is a need to minimize dot interaction...

chapter

Generalizing the Utility of GPUs in Large-Scale Heterogeneous Computing Systems

Shucai Xiao, Wu-chun Feng

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 2554 - 2557

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Graphics Processing Units (GPUs) have been widely used as accelerators in large-scale heterogeneous computing systems. However, current programming models can only support the utilization of local GPUs. When using non-local GPUs, programmers need to explicitly call API functions for data communication across computing nodes. As such, programming GPUs in large-scale computing systems is more challenging...

chapter

GPU Implementation of the Branch and Bound Method for Knapsack Problems

Mohamed Esseghir Lalami, Didier El-Baz

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 1769 - 1777

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

In this paper, we propose an efficient implementation of the branch and bound method for knapsack problems on a CPU-GPU system via CUDA. Branch and bound computations can be carried out either on the CPU or on a GPU according to the size of the branch and bound list. A better management of GPUs memories, less GPUCPU communications and better synchronization between GPU threads are proposed in this...

chapter

Energy Efficiency Analysis of GPUs

Juan M. Cebri'n, Gines D. Guerrero, Jose M. Garcia

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 1014 - 1022

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

In the last few years, Graphics Processing Units (GPUs) have become a great tool for massively parallel computing. GPUs are specifically designed for throughput and face several design challenges, specially what is known as the Power and Memory Walls. In these devices, available resources should be used to enhance performance and throughput, as the performance per watt is really high. For massively...

chapter

Evaluation of GPU-based Seed Generation for Computational Genomics Using Burrows-Wheeler Transform

Yongchao Liu, Bertil Schmidt

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 684 - 690

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Unprecedented production of short reads from the new high-throughput sequencers has posed challenges to align short reads to reference genomes with high sensitivity and high speed. Many CPU-based short read aligners have been developed to address this challenge. Among them, one popular approach is the seed-and-extend heuristic. For this heuristic, the first and foremost step is to generate seeds between...

chapter

Parameterized Verification of GPU Kernel Programs

Guodong Li, Ganesh Gopalakrishnan

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 2450 - 2459

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

We present an automated symbolic verifier for checking the functional correctness of GPGPU kernels parametrically, for an arbitrary number of threads. Our tool checks the functional equivalence of a kernel and its optimized versions, helping debug errors introduced during memory coalescing and bank conflict elimination related optimizations. Key features of our work include: (1) a symbolic method...

chapter

An implementation of Coincidence Algorithm on Graphic Processing Units

Thitipan Tongsiri, Prabhas Chongstitvatana

2012 Ninth International Conference on Computer Science and Software Engineering (JCSSE) > 126 - 130

2012 International Joint Conference on Computer Science and Software Engineering (JCSSE)

Genetic Algorithms (GAs) are powerful search techniques. However when they are applied to complex problems, they consume large computation power. One of the choices to make them faster is to use a parallel implementation. This paper presents a parallel implementation of Combinatorial Optimisation with Coincidence Algorithm (COIN) on Graphic Processing Units. COIN is a modern GA. It has a wide range...

chapter

Towards the Design of Systolic Genetic Search

Martin Pedemonte, Enrique Alba, Francisco Luna

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 1778 - 1786

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

This paper elaborates on a new, fresh parallel optimization algorithm specially engineered to run on Graphic Processing Units (GPUs). The underlying operation relates to Systolic Computation. The algorithm, called Systolic Genetic Search (SGS) is based on the synchronous circulation of solutions through a grid of processing units and tries to profit from the parallel architecture of GPUs. The proposed...

chapter

Design of Direct Communication Facility for Many-Core Based Accelerators

Min Si, Yutaka Ishikawa

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 924 - 929

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

A direct communication facility, called DCFA, for a many-core based cluster, whose compute node consists of many-core units connected to the host via PCI Express with Infiniband, is designed and evaluated. Because a many-core unit is a device of the PCI Express bus, it is not capable of configuring and initializing the Infiniband HCA, according to the PCI Express specification. This means that the...

chapter

dOpenCL: Towards a Uniform Programming Approach for Distributed Heterogeneous Multi-/Many-Core Systems

Philipp Kegel, Michel Steuwer, Sergei Gorlatch

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 174 - 186

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Modern computer systems are becoming increasingly heterogeneous by comprising multi-core CPUs, GPUs, and other accelerators. Current programming approaches for such systems usually require the application developer to use a combination of several programming models (e.g., MPI with OpenCL or CUDA) in order to exploit the full compute capability of a system. In this paper, we presentd OpenCL (Distributed...

chapter

Implementing High-performance Intensity Model with Blur Effect on GPUs for Large-scale Star Image Simulation

Chao Li, Yunquan Zhang, Changwen Zheng, Xiaohui Hu

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 1879 - 1888

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Intensity model with blur effects are widely employed to accurately simulate the imaging process of a star simulator used for attitude determination and guiding feedback. The model is computationally intensive and the time requirements are proportional to the number of stars in the simulation, imposing great demands of computing power for realistic uses. This paper presents two star simulators using...

chapter

Experiences in Teaching a Specialty Multicore Computing Course

Peter E. Strazdins

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 1283 - 1288

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

We detail the design and experiences in delivering a specialty multicore computing course whose materials are openly available. The course ambitiously covers three multicore programming paradigms: shared memory (OpenMP), device (CUDA) and message passing (RCCE), and involves significant practical work on their respective platforms: an UltraSPARC T2, Fermi GPU and the Intel Single-Chip Cloud Computer...

Keywords:
KERNEL
GRAPHICS PROCESSING UNIT

Publication date

Set your own date range

Content availability

Available (546)
None (1)

Keywords

INSTRUCTION SETS (291)
GPU (180)
COPROCESSORS (158)
CUDA (138)
COMPUTER GRAPHIC EQUIPMENT (133)
COMPUTATIONAL MODELING (101)
PARALLEL PROCESSING (98)
COMPUTER ARCHITECTURE (97)
GPGPU (71)
OPTIMIZATION (68)
HARDWARE (60)
ARRAYS (58)
PROGRAMMING (55)
PERFORMANCE EVALUATION (46)
MEMORY MANAGEMENT (45)
ACCELERATION (41)
MATHEMATICAL MODEL (40)
ALGORITHM DESIGN AND ANALYSIS (37)
GRAPHICS PROCESSING UNITS (37)
OPENCL (35)
COMPUTE UNIFIED DEVICE ARCHITECTURE (34)
LIBRARIES (33)
SYNCHRONIZATION (33)
PARALLEL ARCHITECTURES (32)
VECTORS (32)
REGISTERS (31)
CENTRAL PROCESSING UNIT (29)
COMPUTER GRAPHICS (29)
SPARSE MATRICES (29)
INDEXES (28)
PIXEL (28)
EQUATIONS (25)
MULTIPROCESSING SYSTEMS (25)
BANDWIDTH (24)
BENCHMARK TESTING (24)
PARALLEL ALGORITHMS (24)
PARALLEL PROGRAMMING (24)
PARALLEL COMPUTING (22)
HIGH PERFORMANCE COMPUTING (19)
MULTICORE PROCESSING (19)
OPTIMISATION (19)
RUNTIME (19)
CONVOLUTION (18)
YARN (18)
GRAPHICS (17)
THROUGHPUT (17)
IMAGE PROCESSING (16)
REAL TIME SYSTEMS (16)
FIELD PROGRAMMABLE GATE ARRAYS (15)
OPENMP (15)
THREE DIMENSIONAL DISPLAYS (15)
CPU (14)
GENETIC ALGORITHMS (14)
ENCODING (13)
FEATURE EXTRACTION (13)
GPU COMPUTING (13)
GRAPHIC PROCESSING UNIT (13)
RANDOM ACCESS MEMORY (13)
ACCURACY (12)
DATABASES (12)
IMAGE COLOR ANALYSIS (12)
MEDICAL IMAGE PROCESSING (12)
MPI (12)
TILES (12)
CONTEXT (11)
EDUCATIONAL INSTITUTIONS (11)
IMAGE RECONSTRUCTION (11)
ITERATIVE METHODS (11)
JACOBIAN MATRICES (11)
LAYOUT (11)
MATRIX MULTIPLICATION (11)
SERVERS (11)
BIOINFORMATICS (10)
CLUSTERING ALGORITHMS (10)
DATA STRUCTURES (10)
INTERPOLATION (10)
LATTICES (10)
LINEAR ALGEBRA (10)
MESSAGE SYSTEMS (10)
NVIDIA (10)
PERFORMANCE (10)
TRAINING (10)
ULTRASONIC IMAGING (10)
APPLICATION PROGRAM INTERFACES (9)
CLOCKS (9)
ENERGY CONSUMPTION (9)
EVOLUTIONARY COMPUTATION (9)
PIPELINES (9)
POLYNOMIALS (9)
PROTEINS (9)
BIOLOGY COMPUTING (8)
COMPUTERS (8)
DECODING (8)
ENERGY EFFICIENCY (8)
FAST FOURIER TRANSFORMS (8)
GENERATORS (8)
GPUS (8)
HETEROGENEOUS COMPUTING (8)
more

INFONA - science communication portal

Search results

Vehicle detection and tracking using Mean Shift segmentation on semi-dense disparity maps

GPU-based Cloud computing for comparing the structure of protein binding sites

Particle Swarm Optimization on a GPU

Multi-biomarker panel selection on a GPU

An algorithm to solve the Dominating Set Problem on GPUs

Parallel algorithm of amplitude correction for time-lapse seismic data based on GPU

GPU accelerated simulation of the human arterial circulation

Gigapixel spotlight synthetic aperture radar backprojection using clusters of GPUs and CUDA

A GPU implementation of color digital halftoning using the Direct Binary Search algorithm

Generalizing the Utility of GPUs in Large-Scale Heterogeneous Computing Systems

GPU Implementation of the Branch and Bound Method for Knapsack Problems

Energy Efficiency Analysis of GPUs

Evaluation of GPU-based Seed Generation for Computational Genomics Using Burrows-Wheeler Transform

Parameterized Verification of GPU Kernel Programs

An implementation of Coincidence Algorithm on Graphic Processing Units

Towards the Design of Systolic Genetic Search

Design of Direct Communication Facility for Many-Core Based Accelerators

dOpenCL: Towards a Uniform Programming Approach for Distributed Heterogeneous Multi-/Many-Core Systems

Implementing High-performance Intensity Model with Blur Effect on GPUs for Large-scale Star Image Simulation

Experiences in Teaching a Specialty Multicore Computing Course

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options