13.07.2015 Views

automatically exploiting cross-invocation parallelism using runtime ...

automatically exploiting cross-invocation parallelism using runtime ...

automatically exploiting cross-invocation parallelism using runtime ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

12x11x10xSpecCrossPthread Barrier12x11x10xSpecCrossPthread Barrier9x9xLoop Speedup8x7x6x5x4xLoop Speedup8x7x6x5x4x3x3x2x2x1x1x0x2 4 6 8 10 12 14 16 18 20 22 24Number of Threads(a) CG0x2 4 6 8 10 12 14 16 18 20 22 24Number of Threads(b) EQUAKE12x11x10xSpecCrossPthread Barrier12x11x10xPthread BarrierSpecCross9x9xLoop Speedup8x7x6x5x4xProgram Speedup8x7x6x5x4x3x3x2x2x1x1x0x2 4 6 8 10 12 14 16 18 20 22 24Number of Threads(c) FDTD0x2 4 6 8 10 12 14 16 18 20 22 24Number of Threads(d) FLUIDANIMATE-212x11x10xSpecCrossPthread Barrier12x11x10xSpecCrossPthread Barrier9x9xLoop Speedup8x7x6x5x4xLoop Speedup8x7x6x5x4x3x3x2x2x1x1x0x2 4 6 8 10 12 14 16 18 20 22 24Number of Threads(e) JACOBI0x2 4 6 8 10 12 14 16 18 20 22 24Number of Threads(f) LLUBENCH12x11x10xSpecCrossPthread Barrier12x11x10xSpecCrossPthread Barrier9x9xLoop Speedup8x7x6x5x4xLoop Speedup8x7x6x5x4x3x3x2x2x1x1x0x2 4 6 8 10 12 14 16 18 20 22 24Number of Threads(g) LOOPDEP0x2 4 6 8 10 12 14 16 18 20 22 24Number of Threads(h) SYMMFigure 5.2: Performance comparison between code parallelized with pthread barrier andSPECCROSS.85

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!