Exploiting task and data parallelism in ILUPACK’s preconditioned CG solver on NUMA architectures and many-core accelerators

José I. Aliaga; Rosa M. Badia; Maria Barreda; Matthias Bollhöfer; Ernesto Dufrechou; Pablo Ezzatti; Enrique S. Quintana-Ortí

doi:10.1016/j.parco.2015.12.004

Exploiting task and data parallelism in ILUPACK’s preconditioned CG solver on NUMA architectures and many-core accelerators

José I. Aliaga, Rosa M. Badia, Maria Barreda, Matthias Bollhöfer, Ernesto Dufrechou, Pablo Ezzatti, Enrique S. Quintana-Ortí

Source

Parallel Computing > 2016 > 54 > C > 97-107

Abstract

We present specialized implementations of the preconditioned iterative linear system solver in ILUPACK for Non-Uniform Memory Access (NUMA) platforms and many-core hardware co-processors based on the Intel Xeon Phi and graphics accelerators. For the conventional x86 architectures, our approach exploits task parallelism via the OmpSs runtime as well as a message-passing implementation based on MPI, respectively yielding a dynamic and static schedule of the work to the cores, with different numeric semantics to those of the sequential ILUPACK. For the graphics processor we exploit data parallelism by off-loading the computationally expensive kernels to the accelerator while keeping the numeric semantics of the sequential case.

Identifiers

journal ISSN :	0167-8191
DOI	10.1016/j.parco.2015.12.004

Authors

José I. Aliaga

Departamento de Ingeniería y Ciencia de los Computadores, Universitat Jaume I, Castellón, Spain

Rosa M. Badia

Barcelona Supercomputing Center (BSC-CNS) and Artificial Intelligence Research Institute (IIIA), Spanish National Research Council (CSIC), Barcelona, Spain

Maria Barreda

Departamento de Ingeniería y Ciencia de los Computadores, Universitat Jaume I, Castellón, Spain

Matthias Bollhöfer

Institute of Computational Mathematics, TU Braunschweig, Braunschweig, Germany

see all

Keywords

Sparse linear systems Reconditioned Conjugate Gradient solver Task and data parallelism Multi-core processors Intel Xeon Phi Graphics processing units (GPUs)

Additional information

Publication languages: English

Data set: Elsevier

Publisher

Elsevier Science

Fields of science

No field of science has been suggested yet.

article

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Exploiting task and data parallelism in ILUPACK’s preconditioned CG solver on NUMA architectures and many-core accelerators $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

José I. Aliaga

Rosa M. Badia

Maria Barreda

Matthias Bollhöfer

Keywords

Additional information

Publisher

Fields of science

Fields of science

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Exploiting task and data parallelism in ILUPACK’s preconditioned CG solver on NUMA architectures and many-core accelerators