11.07.2015 Views

CUPTI User's Guide

CUPTI User's Guide

CUPTI User's Guide

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

CapabilityEvent Name Description Type 1.0 1.1 1.2 1.3branchNumber of branches taken by threads SM Y Y Y Yexecuting a kernel. This event isincremented by one if at least onethread in a warp takes the branch.Note that barrier instructions (__-syncThreads()) also get counted asbranchesdivergent_branch Number of divergent branches within SM Y Y Y Ya warp. This event is incremented byone if at least one thread in a warp diverges(that is, follows a different executionpath) via a data dependent conditionalbranch. The event is incrementedby one at each point of divergencein a warpinstructions Number of instructions executed SM Y Y Y Ywarp_serialize If two addresses of a memory request SM Y Y Y Yfall in the same memory bank, thereis a bank conflict and the access hasto be serialized. This event gives thenumber of thread warps that serializeon address conflicts to either shared orconstant memorygld_incoherent Number of non-coalesced global memoryTPC Y Y N Nloadsgld_coherent Number of coalesced global memory TPC Y Y N Nloadsgld_32bNumber of 32 byte global memory TPC N N Y Yload transactions; incremented by 1for each 32 byte transactiongld_64bNumber of 64 byte global memory TPC N N Y Yload transactions; incremented by 1for each 64 byte transactiongld_128b Number of 128 byte global memory TPC N N Y Yload transactions; incremented by 1for each 128 byte transactiongst_incoherent Number of non-coalesced global memoryTPC Y Y N Nstoresgst_coherent Number of coalesced global memorystoresTPC Y Y N NCUDA Tools SDK <strong>CUPTI</strong> User’s <strong>Guide</strong> DA-05679-001_v01 | 13

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!