10.02.2013 Views

Instruction Throughput - Nvidia

Instruction Throughput - Nvidia

Instruction Throughput - Nvidia

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Presentation Outline<br />

© NVIDIA Corporation 2011<br />

� Identifying performance limiters<br />

� Analyzing and optimizing :<br />

� Memory-bound kernels (Oct 11th, Tim Schroeder)<br />

� <strong>Instruction</strong> (math) bound kernels<br />

� Kernels with poor latency hiding<br />

� Register spilling (Oct 4th, Paulius Micikevicius)<br />

� For each:<br />

� Brief background<br />

� How to analyze<br />

� How to judge whether particular issue is problematic<br />

� How to optimize<br />

� Some cases studies based on “real-life” application kernels<br />

� Most information is for Fermi GPUs<br />

3

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!