Performance analysis of multicore and multinodal implementation of SpMV operation

Beata Bylina; Jaroslaw Bylina; Przemyslaw Stpiczynski; Dominik Szalkowski

doi:10.15439/2014F313

Performance analysis of multicore and multinodal implementation of SpMV operation

Bylina, Beata, Bylina, Jaroslaw, Stpiczynski, Przemyslaw, Szalkowski, Dominik

Source

2014 Federated Conference on Computer Science and Information Systems > 569 - 576

Abstract

In this paper we present two algorithms for performing sparse matrix-dense vector multiplication (known as SpMV operation). We show parallel (multicore) version of algorithm, which can be efficiently implemented on the contemporary multicore architectures. Next, we show distributed (so-called multinodal) version targeted at high performance clusters. Both versions are thoroughly tested using different architectures, compiler tools and sparse matrices of different sizes. Considered matrices comes from The University of Florida Sparse Matrix Collection. The performance of the algorithms is compared to the performance of SpMV routine from widely used Intel Math Kernel Library.