10.07.2015 Views

Large scale and hybrid computing with CP2K - Prace Training Portal

Large scale and hybrid computing with CP2K - Prace Training Portal

Large scale and hybrid computing with CP2K - Prace Training Portal

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Parallel implementation: OMP/MPIE=v T (Mv) ?Distribute the matrix M (dim: 10 9 x 10 9 ), replicate the vector vSimple communication pattern (v is distributed in GGA mode)allows for exploiting the full symmetries (8x speedup)Advanced load balancing model usedv <strong>and</strong> (Mv) can be rather large vectors (10 9 elements)Exploit current architectures (e.g. 16 cores <strong>with</strong> 16Gb per node) → MPI/OpenMPShared v <strong>and</strong> (Mv), v is read-only, (Mv) is atomically updatedExploit that only O(N) entries of v are non-zero → sparse storage for vRemaining memory used for storing M.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!