Monday, February 16, 2015

Interesting matrix multiplication in CUDA 7.0 SDK

In the CUDA 7.0 SDK, for the matrix multiplication benchmark, the input A is 320 x 320 and input B is 640 x 320. It calculates output C using A x B!

(320 x 320)    x    (640 x 320)
        A                          B

It doesn't make sense!

No comments:

Post a Comment