Matrix size LAPACK can support with level-3 BLAS

Question

I am a newbie in using LAPACK library. I know that LAPACK's internal rountines break the large problem into smaller problems recursively (I am considering level-3 BLAS). If we consider matrix multiplication C= AB+C example, until what size ( maximum and minimum) we can divide the bigger matrices? (128 x 128) would be the smallest size?

score 1 · Answer 1 · answered Oct 16 '14 at 22:51

Bill stated this correctly in his comment. The reference BLAS implementation uses triply nested loops, but any fast implementation will use small panel matrices. The minimum size will be architecture and implementation dependent. You should refer to the Goto paper for the gory details.

score 0 · Answer 2 · answered Oct 16 '14 at 18:31

0

Matrix multiplication in LAPACK (DGEMM) isn't recursive, at least, not in the Netlib implementation. It implements matrix multiplication as a triply-nested loop.

answered Oct 16 '14 at 18:31

Geoff Oxberry

30,394
9
64
127

1

Hierarchical methods are uncommon, but almost all fast *GEMMs use matrix subdivisions (panels, etc). – Bill Barth Oct 16 '14 at 18:52

Matrix size LAPACK can support with level-3 BLAS

2 Answers2