The use of futures provides a flexible way to express parallelism and can generate arbitrary dependences among parallel subcomputations. The additional flexibility that futures provide comes with a cost, however. When scheduled using classic work stealing, a program with futures, compared to a program that uses only fork-join parallelism, can incur a much higher number of ``deviations,'' a metric for evaluating the performance of parallel executions. All prior works assume a parsimonious work-stealing scheduler, however, where a worker thread (surrogate of a processor) steals work only when its local deque becomes empty.
In this work, we investigate an alternative scheduling approach, called ProWS, where the workers perform proactive work stealing when handling future operations. We show that ProWS, for programs that use futures, can provide provably efficient execution time and equal or better bounds on the number of deviations compared to classic parsimonious work stealing. Given a computation with $T_1$ work and $T_\infty$ span, ProWS executes the computation on $P$ processors in expected time $O(T_1 / P + T_\infty \lg P)$, with an additional $\lg P$ overhead on the span term compared to the parsimonious variant. For structured use of futures, where each future is single touch with no race on the future handle, the algorithm incurs $O(P T_\infty^2)$ deviations, matching that of the parsimonious variant. For general use of futures, the algorithm incurs $O(m_k T_\infty + P T_\infty \lg P)$ deviations, where $m_k$ is the maximum number of future touches that are logically parallel. Compared to the bound for the parsimonious variant, $O(k T_\infty + P T_\infty)$, with $k$ being the total number of touches in the entire computation, this bound is better assuming $m_k = \Omega(P \lg P)$ and is smaller than $k$, which holds true for all the benchmarks we examined.
Tue 19 Feb Times are displayed in time zone: Guadalajara, Mexico City, Monterrey change
|14:00 - 14:25|
|Semantics-Aware Scheduling Policies for Synchronization Determinism|
Qi ZhaoNorth Carolina State University, Zhengyi QiuNorth Carolina State University, Guoliang JinNorth Carolina State UniversityDOI
|14:25 - 14:50|
|Proactive Work Stealing for Futures|
Kyle SingerWashington University in St. Louis, Yifan XuWashington University in St. Louis, I-Ting Angelina LeeWashington University in St. LouisDOI
|14:50 - 15:15|
|A Round-Efficient Distributed Betweenness Centrality Algorithm|
Loc HoangUniversity of Texas at Austin, USA, Matteo PontecorviNokia Bell Labs, Roshan DathathriUniversity of Texas at Austin, USA, Gurbinder GillUniversity of Texas at Austin, USA, Bozhi YouXi'an Jiaotong University, Keshav PingaliUniversity of Texas at Austin, USA, Vijaya RamachandranUniversity of Texas at AustinDOI