21.11.2014 Views

Instruction Throughput - GPU Technology Conference

Instruction Throughput - GPU Technology Conference

Instruction Throughput - GPU Technology Conference

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Simplified View of Latency and Syncs<br />

Memory-only time<br />

Math-only time<br />

Kernel where most math cannot be<br />

executed until all data is loaded by<br />

the threadblock<br />

Full-kernel time, one large threadblock per SM<br />

© NVIDIA Corporation 2011<br />

time<br />

19

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!