2H 2015

intel-xeon-phi-sw-ecosystem-guide-2h-2015-public3 intel-xeon-phi-sw-ecosystem-guide-2h-2015-public3

07.12.2015 Views

Memory Access Analysis New! Intel® VTune Amplifier 2016 New! Tune data structures for better performance • Attribute cache misses to data structures Bandwidth Analysis for Non-Uniform Memory • See Read & Write contributions to Total Bandwidth • Easier tuning of multi-socket bandwidth Seeing total bandwidth can suggest data blocking opportunities to change a bandwidth bound app into a compute bound app. 102

Scalable Profiling for MPI and Hybrid Clusters with MPI Performance Snapshot Lightweight – Low overhead profiling up to 32K Ranks Scalability- Performance variation at scale can be detected sooner Identifying Key Metrics – Shows PAPI counters and MPI/OpenMP* imbalances 103

Memory Access Analysis<br />

New! Intel® VTune Amplifier 2016<br />

New!<br />

Tune data structures for better performance<br />

• Attribute cache misses to data structures<br />

Bandwidth Analysis for Non-Uniform Memory<br />

• See Read & Write contributions to<br />

Total Bandwidth<br />

• Easier tuning of multi-socket bandwidth<br />

Seeing total bandwidth can suggest data blocking opportunities<br />

to change a bandwidth bound app into a compute bound app.<br />

102

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!