A Proposed API for Batched Basic Linear Algebra Subprograms

Dongarra, Jack and Duff, Iain and Gates, Mark and Haidar, Azzam and Hammarling, Sven and Higham, Nicholas J. and Hogg, Jonathon and Valero-Lara, Pedro and Relton, Samuel D. and Tomov, Stanimire and Zounon, Mawussi (2016) A Proposed API for Batched Basic Linear Algebra Subprograms. [MIMS Preprint]

This paper proposes an API for Batched Basic Linear Algebra Subprograms (Batched BLAS). We focus on many independent BLAS operations on small matrices that are grouped together as a single routine, called Batched BLAS routine, with the aim of providing more efficient, but portable, implementations of algorithms on high-performance manycore architectures (like multi/manycore CPU processors, GPUs, and coprocessors).

