2016.25: A Proposed API for Batched Basic Linear Algebra Subprograms
2016.25: Jack Dongarra, Iain Duff, Mark Gates, Azzam Haidar, Sven Hammarling, Nicholas J. Higham, Jonathon Hogg, Pedro Valero-Lara, Samuel D. Relton, Stanimire Tomov and Mawussi Zounon (2016) A Proposed API for Batched Basic Linear Algebra Subprograms.
Full text available as:
|PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader|
This paper proposes an API for Batched Basic Linear Algebra Subprograms (Batched BLAS). We focus on many independent BLAS operations on small matrices that are grouped together as a single routine, called Batched BLAS routine, with the aim of providing more efficient, but portable, implementations of algorithms on high-performance manycore architectures (like multi/manycore CPU processors, GPUs, and coprocessors).
|Item Type:||MIMS Preprint|
|Uncontrolled Keywords:||BLAS, linear algebra, numerical linear algebra, batched computation, accelerators|
|Subjects:||MSC 2000 > 68 Computer science|
|Deposited By:||Dr Samuel Relton|
|Deposited On:||20 April 2016|