Introduction to CUDA C
Introduction to CUDA C
Introduction to CUDA C
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Multiblock Dot Product: Algorithm<br />
� Each block computes a sum of its pairwise products like before:<br />
Block 0<br />
a b<br />
a 0<br />
a 1<br />
a 2<br />
a 3<br />
…<br />
*<br />
*<br />
*<br />
*<br />
b 0<br />
b 1<br />
b 2<br />
b 3<br />
Block 1<br />
a b<br />
a 512<br />
a 513<br />
a 514<br />
a 515<br />
…<br />
*<br />
*<br />
*<br />
*<br />
…<br />
b 512<br />
b 513<br />
b 514<br />
b 515<br />
…<br />
+<br />
+<br />
sum<br />
sum