LIBXSMM: Accelerating Small Matrix Multiplications by Runtime Code Generation

Alexander Heinecke; Greg Henry; Maxwell Hutchinson; Hans Pabst

doi:10.1109/SC.2016.83

LIBXSMM: Accelerating Small Matrix Multiplications by Runtime Code Generation

Heinecke, Alexander, Henry, Greg, Hutchinson, Maxwell, Pabst, Hans

Source

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis > 981 - 991

Abstract

Many modern highly scalable scientific simulations packages rely on small matrix multiplications as their main computational engine. Math libraries or compilers are unlikely to provide the best possible kernel performance. To address this issue, we present a library which provides high performance small matrix multiplications targeting all recent x86 vector instruction set extensions up to Intel AVX-512. Our evaluation proves that speed-ups of more than 10× are possible depending on the CPU and application. These speed-ups are achieved by a combination of several novel technologies. We use a code generator which has a built-in architectural model to create code which runs well without requiring an auto-tuning phase. Since such code is very specialized we leverage just-in-time compilation to only build the required kernel variant at runtime. To keep ease-of-use, overhead, and kernel management under control we accompany our library with a BLAS-compliant frontend which features a multi-level code-cache hierarchy.

Identifiers

book e-ISSN :	2167-4337
book e-ISBN :	978-1-4673-8815-3
DOI	10.1109/SC.2016.83

Authors

Keywords

Libraries Kernel Runtime Indexes Algorithms Sparse matrices SEM Small GEMM JIT compilation code generation Block CSR FEM

Additional information

Data set: ieee

Publisher

IEEE

chapter

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

LIBXSMM: Accelerating Small Matrix Multiplications by Runtime Code Generation $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Heinecke, Alexander

Henry, Greg

Hutchinson, Maxwell

Pabst, Hans

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

LIBXSMM: Accelerating Small Matrix Multiplications by Runtime Code Generation