automatically exploiting cross-invocation parallelism using runtime ...

automatically exploiting cross-invocation parallelism using runtime ... automatically exploiting cross-invocation parallelism using runtime ...

dataspace.princeton.edu
from dataspace.princeton.edu More from this publisher
13.07.2015 Views

library provides efficient misspeculation detection. In order to check for misspeculation,all threads periodically send memory access signatures to a dedicated checker thread. Amemory access signature summarizes which bytes in memory have been accessed since thelast check. The checker thread uses signatures to verify that speculative execution respectsall memory dependences. If the checker thread detects misspeculation, it kills all the speculativethreads and resumes execution from the last checkpoint. If two threads calculateand send signatures while they are executing in code regions guaranteed to be independent,these signatures will be safely skipped to avoid unnecessary checking overhead.To avoid high penalty from misspeculation, SPECCROSS uses a profiler to determinethe speculative range, which controls how aggressively to speculate. The speculative rangeis the distance between a thread with a high epoch number and thread with a low epochnumber. In SPECCROSS, an epoch number counts the number of barriers each threadhas passed. Non-speculative barriers ensure that all threads have the same epoch number,whereas speculative barriers allow each thread’s epoch number to increase independently.When the difference of two threads’ epoch numbers is zero, misspeculation is impossible,since nothing is speculated. When the difference is high, misspeculation is more likely,since more intervening memory accesses are speculated. The SPECCROSS runtime systemuses profiling data to compute a limit for the speculative range. When the program reachesthe limit, SPECCROSS stalls the thread with the higher epoch number until the thread withthe lower epoch number has caught up. Choosing the correct limit keeps misspeculationrates low.Figure 4.5 gives an overview of the SPECCROSS system. The SPECCROSS compilertakes a sequential program as input and detects a SPECCROSS code region as the transformationtarget. A SPECCROSS code region is often an outermost loop composed of consecutiveparallelizable inner loops. SPECCROSS parallelizes each independent inner loop andthen inserts SPECCROSS library function calls to enable cross-invocation parallelization.At runtime, the original process spawns multiple worker threads and a checker thread for56

InputSequential ProgramCompile TimeDetect Target Loop NestsParallelize IndependentLoop InvocationsInsert Calls to SpecCross Libraryconfigurespeculative rangeProfileDependenceDistanceOutputParallel ProgramRuntimeWorkerWorkerWorker ThreadcheckingrequestChecker Threadoriginal ProcessforkmisspeculationkillCheckpointProcessFigure 4.5: Overview of SPECCROSS: At compile time, the SPECCROSS compiler detectscode regions composed of consecutive parallel loop invocations, parallelizes the code regionand inserts SPECCROSS library functions to enable barrier speculation. At runtime,the whole program is first executed speculatively without barriers. Once misspeculationoccurs, the checkpoint process is woken up. It kills the original child process and spawnsnew worker threads. The worker threads will re-execute the misspeculated epochs withnon-speculative barriers.misspeculation detection. Each worker thread executes its code section and sends requestsfor misspeculation detection to the checker thread. The checker thread detects dependenceviolations using access signatures computed by each worker thread. When the checkerthread detects a violation, it signals a separate checkpoint process for misspeculation recovery.The checkpoint process is periodically forked from the original process. Oncerecovery is completed by squashing all speculative workers and restoring state to the lastcheckpoint, misspeculated epochs are re-executed with non-speculative barriers.57

InputSequential ProgramCompile TimeDetect Target Loop NestsParallelize IndependentLoop InvocationsInsert Calls to SpecCross Libraryconfigurespeculative rangeProfileDependenceDistanceOutputParallel ProgramRuntimeWorkerWorkerWorker ThreadcheckingrequestChecker Threadoriginal ProcessforkmisspeculationkillCheckpointProcessFigure 4.5: Overview of SPECCROSS: At compile time, the SPECCROSS compiler detectscode regions composed of consecutive parallel loop <strong>invocation</strong>s, parallelizes the code regionand inserts SPECCROSS library functions to enable barrier speculation. At <strong>runtime</strong>,the whole program is first executed speculatively without barriers. Once misspeculationoccurs, the checkpoint process is woken up. It kills the original child process and spawnsnew worker threads. The worker threads will re-execute the misspeculated epochs withnon-speculative barriers.misspeculation detection. Each worker thread executes its code section and sends requestsfor misspeculation detection to the checker thread. The checker thread detects dependenceviolations <strong>using</strong> access signatures computed by each worker thread. When the checkerthread detects a violation, it signals a separate checkpoint process for misspeculation recovery.The checkpoint process is periodically forked from the original process. Oncerecovery is completed by squashing all speculative workers and restoring state to the lastcheckpoint, misspeculated epochs are re-executed with non-speculative barriers.57

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!