11.07.2015 Views

CUPTI User's Guide

CUPTI User's Guide

CUPTI User's Guide

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

CapabilityEvent Name Description Type 2.0 2.1gst_inst_8bit Total number of 8-bit global store instructionsSM Y Ythat are executed by all thethreads across all thread blocksgst_inst_16bit Total number of 16-bit global store instructionsSM Y Ythat are executed by all thethreads across all thread blocksgst_inst_32bit Total number of 32-bit global store instructionsSM Y Ythat are executed by all thethreads across all thread blocksgst_inst_64bit Total number of 64-bit global store instructionsSM Y Ythat are executed by all thethreads across all thread blocksgst_inst_128bit Total number of 128-bit global storeinstructions that are executed by allthe threads across all thread blocksSM Y YTable 5: Capability 2.x Events For domain_cCapabilityEvent Name Description Type 2.0 2.1branchNumber of branches taken by threads SM Y Yexecuting a kernel. This counter willbe incremented by one if at least onethread in a warp takes the branchdivergent_branch Number of divergent branches within SM Y Ya warp. This counter will be incrementedby one if at least one threadin a warp diverges (that is, follows adifferent execution path) via a data dependentconditional branchwarps_launched Number of warps launched SM Y Ythreads_launched Number of threads launched SM Y Yactive_warpsAccumulated number of active warps SM Y Yper cycle. For every cycle it incrementsby the number of active warpsin the cycle which can be in the range0 to 48active_cyclesNumber of cycles a multiprocessor hasat least one active warpSM Y YCUDA Tools SDK <strong>CUPTI</strong> User’s <strong>Guide</strong> DA-05679-001_v01 | 19

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!