PPoPP 2019
Sat 16 - Wed 20 February 2019 Washington, DC, United States

PPoPP is the premier forum for leading work on all aspects of parallel programming, including theoretical foundations, techniques, languages, compilers, runtime systems, tools, and practical experience. In the context of the symposium, “parallel programming” encompasses work on concurrent and parallel systems (multicore, multi-threaded, heterogeneous, clustered, and distributed systems; grids; datacenters; clouds; and large scale machines). Given the rise of parallel architectures in the consumer market (desktops, laptops, and mobile devices) and data centers, PPoPP is particularly interested in work that addresses new parallel workloads and issues that arise out of extreme-scale applications or cloud platforms, as well as techniques and tools that improve the productivity of parallel programming or work towards improved synergy with such emerging architectures.

Dates
Tracks
You're viewing the program in a time zone which is different from your device's time zone change time zone

Sat 16 Feb

Displayed time zone: Guadalajara, Mexico City, Monterrey change

08:00 - 17:00

Sun 17 Feb

Displayed time zone: Guadalajara, Mexico City, Monterrey change

08:00 - 17:00
18:00 - 20:00
Welcome Reception and Poster SessionMain Conference at Mezzanine Foyer

Mon 18 Feb

Displayed time zone: Guadalajara, Mexico City, Monterrey change

08:00 - 17:00
08:15 - 09:30
Plenary Session 1Main Conference at Salon 6
08:15
15m
Talk
Chairs' Welcome
Main Conference

08:30
60m
Talk
HPCA Keynote: Srini Devadas (MIT)
Main Conference
09:35 - 10:25
Session 1: Big DataMain Conference at Salon 12/13
Chair(s): Roberto Palmieri Lehigh University
09:35
25m
Talk
Beyond Human-Level Accuracy: Computational Challenges in Deep Learning
Main Conference
Joel Hestness Baidu Research, Newsha Ardalani Baidu Research, Gregory Diamos Baidu Research
DOI
10:00
25m
Talk
S-EnKF: Co-designing for Scalable Ensemble Kalman Filter
Main Conference
Junmin Xiao , Shijie Wang Institute of Computing Technology, Chinese Academy of Sciences, Weiqiang Wan Institute of Computing Technology, Chinese Academy of Sciences, Xuehai Hong Institute of Computing Technology, Chinese Academy of Sciences, Guangming Tan Chinese Academy of Sciences(CAS)
DOI
10:55 - 12:35
Session 2: Heterogeneous Platforms and GPUMain Conference at Salon 12/13
Chair(s): Xu Liu College of William and Mary
10:55
25m
Talk
Throughput-Oriented GPU Memory Allocation
Main Conference
Isaac Gelado NVIDIA, Michael Garland NVIDIA Research
DOI
11:20
25m
Talk
SEP-Graph: Finding Shortest Execution Paths for Graph Processing under a Hybrid Framework on GPU
Main Conference
Hao Wang The Ohio State University, USA, Liang Geng The Ohio State University, USA, Rubao Lee United Parallel Computing Corporation, USA, Kaixi Hou Virginia Tech, USA, Yanfeng Zhang , Xiaodong Zhang The Ohio State University, USA
DOI
11:45
25m
Talk
Incremental Flattening for Nested Data Parallelism
Main Conference
Troels Henriksen University of Copenhagen, Denmark, Frederik Thorøe DIKU, University of Copenhagen, Martin Elsman University of Copenhagen, Denmark, Cosmin Oancea University of Copenhagen, Denmark
DOI
12:10
25m
Talk
Adaptive Sparse Matrix-Matrix Multiplication on the GPU
Main Conference
Martin Winter Graz University of Technology, Austria, Daniel Mlakar Graz University of Technology, Austria, Rhaleb Zayer Max Planck Institute for Informatics, Hans-Peter Seidel Max Planck Institute for Informatics, Markus Steinberger Graz University of Technology, Austria
DOI
14:00 - 15:40
Session 3: Transactional MemoryMain Conference at Salon 12/13
Chair(s): Milind Chabbi Uber Technologies
14:00
25m
Talk
Modular Transactions: Bounding Mixed Races in Space and Time
Main Conference
Brijesh Dongol University of Surrey, Radha Jagadeesan DePaul University, James Riely DePaul University
DOI
14:25
25m
Talk
Leveraging Hardware TM in Haskell
Main Conference
Ryan Yates , Michael Scott University of Rochester
DOI Authorizer link File Attached
14:50
25m
Talk
Stretching the capacity of Hardware Transactional Memory in IBM POWER architectures
Main Conference
Ricardo Jorge Duarte Filipe , Shady Issa INESC-ID, João Barreto INESC-ID, Paolo Romano University of Lisbon, Portugal
DOI
15:15
25m
Talk
Processing Transactions in a Predefined Order
Main Conference
Mohamed M. Saad Virginia Tech, Masoomeh Javidi Kishi Lehigh University, Shihao Jing Lehigh University, Sandeep Hans IBM India Research Lab, Roberto Palmieri Lehigh University
DOI
16:10 - 17:00
Session 4: GPU B-TreesMain Conference at Salon 12/13
Chair(s): Ang Li Pacific Northwest National Laboratory
16:10
25m
Talk
Harmonia: A High Throughput B+tree for GPUs
Main Conference
Zhaofeng Yan Fudan University, Yuzhe Lin Fudan University, Lu Peng , Weihua Zhang Fudan University
DOI
16:35
25m
Talk
Engineering a High-Performance GPU B-Tree
Main Conference
Muhammad Awad , Saman Ashkiani University of California, Davis, Rob Johnson VMWare Research, Martin Farach-Colton Rutgers University, John D. Owens University of California, Davis
DOI
17:00 - 18:00
Business MeetingMain Conference at Salon 12/13

Tue 19 Feb

Displayed time zone: Guadalajara, Mexico City, Monterrey change

08:00 - 17:00
08:15 - 09:30
Plenary Session 2Main Conference at Salon 6
Chair(s): Idit Keidar Technion - Israel institute of technology
08:15
75m
Talk
PPoPP Keynote: Karin Strauss (Microsoft Research)
Main Conference
S: Karin Strauss Microsoft Research
09:35 - 10:25
Session 5, PerformanceMain Conference at Salon 12/13
Chair(s): Cosmin Oancea University of Copenhagen, Denmark
09:35
25m
Talk
QTLS: High-Performance TLS Asynchronous Offload Framework with Intel® QuickAssist Technology
Main Conference
Xiaokang Hu Shanghai Jiao Tong University, Changzheng Wei Intel Asia-Pacific Research and Development Ltd., Li Jian , Brian Will Intel Corporation, Ping Yu Intel Asia-Pacific Research and Development Ltd., Lu Gong Intel Asia-Pacific Research and Development Ltd., Haibing Guan Shanghai Jiao Tong University
DOI
10:00
25m
Talk
Data-Flow/Dependence Profiling for Structured Transformations
Main Conference
Fabian Gruber Université Grenoble Alpes / INRIA Grenoble Rhônes-Alpes, Manuel Selva Université Grenoble Alpes, Diogo Sampaio Inria, Christophe Guillon STMicroelectronics, Antoine Moynault STMicroelectronics, Louis-Noël Pouchet Colorado State University, Fabrice Rastello INRIA
DOI
10:55 - 12:35
Session 6, Best Paper CandidatesMain Conference at Salon 12/13
Chair(s): Rudolf Eigenmann University of Delaware
10:55
25m
Talk
Lightweight Hardware Transactional Memory Profiling
Main Conference
Qingsen Wang College of William and Mary, Pengfei Su College of William and Mary, Milind Chabbi Uber Technologies, Xu Liu College of William and Mary
DOI
11:20
25m
Talk
A Pattern Based Algorithmic Autotuner for Graph Processing on GPUs
Main Conference
Ke Meng , Jiajia Li Georgia Institute of Technology, Pacific Northwest National Laboratory, Guangming Tan Chinese Academy of Sciences(CAS), Ninghui Sun State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, University of Chinese Academy of Sciences
DOI
11:45
25m
Talk
Provably and Practically Efficient Granularity Control
Main Conference
Umut A. Acar Carnegie Mellon University, Vitaly Aksenov Inria & ITMO University, Arthur Charguéraud Inria, Mike Rainey Indiana University, USA
DOI
12:10
25m
Talk
A Coordinated Tiling and Batching Framework for Efficient GEMM on GPUs
Main Conference
Xiuhong Li Peking University, Eric Liang Peking University, Shengen Yan SenseTime, Jia Liancheng Peking University, Yinghan Li SenseTime
DOI
14:00 - 15:15
Session 7: SchedulingMain Conference at Salon 12/13
Chair(s): Jidong Zhai Tsinghua University
14:00
25m
Talk
Semantics-Aware Scheduling Policies for Synchronization Determinism
Main Conference
Qi Zhao North Carolina State University, Zhengyi Qiu North Carolina State University, Guoliang Jin North Carolina State University
DOI
14:25
25m
Talk
Proactive Work Stealing for Futures
Main Conference
Kyle Singer Washington University in St. Louis, Yifan Xu Washington University in St. Louis, I-Ting Angelina Lee Washington University in St. Louis
DOI
14:50
25m
Talk
A Round-Efficient Distributed Betweenness Centrality Algorithm
Main Conference
Loc Hoang University of Texas at Austin, USA, Matteo Pontecorvi Nokia Bell Labs, Roshan Dathathri University of Texas at Austin, USA, Gurbinder Gill University of Texas at Austin, USA, Bozhi You Xi'an Jiaotong University, Keshav Pingali University of Texas at Austin, USA, Vijaya Ramachandran University of Texas at Austin
DOI
15:45 - 16:35
Session 8: HPCMain Conference at Salon 12/13
Chair(s): I-Ting Angelina Lee Washington University in St. Louis
15:45
25m
Talk
Corrected Trees for Reliable Group Communication
Main Conference
Martin Küttler TU Dresden, Maksym Planeta TU Dresden, Germany, Jan Bierbaum TU Dresden, Carsten Weinhold TU Dresden, Hermann Härtig TU Dresden, Amnon Barak The Hebrew University of Jerusalem, Torsten Hoefler ETH Zurich
DOI
16:10
25m
Talk
Adaptive Sparse Tiling for Sparse Matrix Multiplication
Main Conference
Changwan Hong , Aravind Sukumaran-Rajam Ohio State University, USA, Israt Nisa , Kunal Singh The Ohio State University, P. Sadayappan Ohio State University
DOI
17:10 - 18:30
19:00 - 21:00
Excursion and Banquet DinnerMain Conference at Air and Space Museum

Wed 20 Feb

Displayed time zone: Guadalajara, Mexico City, Monterrey change

08:00 - 13:00
08:15 - 09:30
Plenary Session 3Main Conference at Salon 6
08:15
75m
Talk
CGO Keynote: Michael O’Boyle (University of Edinburgh)
Main Conference
S: Michael F. P. O'Boyle University of Edinburgh
09:35 - 10:50
Session 9: SynchronizationMain Conference at Salon 12/13
Chair(s): Erez Petrank Technion
09:35
25m
Talk
Encapsulated Open Nesting for STM: Fine-Grained Higher-Level Conflict Detection
Main Conference
Martin Bättig Department of Computer Science, ETH Zurich, Thomas Gross ETH Zurich
DOI
10:00
25m
Talk
A Specialized B-Tree for Concurrent Datalog Evaluation
Main Conference
Herbert Jordan University of Innsbruck, Pavle Subotic University College London, David Zhao The University of Sydney, Bernhard Scholz University of Sydney, Australia
DOI
10:25
25m
Talk
Efficient Race Detection with Futures
Main Conference
Robert Utterback Monmouth College, Kunal Agrawal Washington University in St. Louis, Jeremy Fineman , I-Ting Angelina Lee Washington University in St. Louis
DOI
11:20 - 12:35
Session 10: VerificationMain Conference at Salon 12/13
Chair(s): Michael Lam James Madison University
11:20
25m
Talk
Verifying C11 Programs Operationally
Main Conference
Simon Doherty University of Sheffield, Brijesh Dongol University of Surrey, Heike Wehrheim Paderborn University, John Derrick University of Sheffield
DOI
11:45
25m
Talk
Checking Linearizability Using Hitting Families
Main Conference
Burcu Kulahcioglu Ozkan MPI-SWS, Germany, Rupak Majumdar MPI-SWS, Germany, Filip Niksic University of Pennsylvania
DOI
12:10
25m
Talk
Transitive Joins: A Sound and Efficient Online Deadlock-Avoidance Policy
Main Conference
Caleb Voss Georgia Institute of Technology, Tiago Cogumbreiro University of Massachusetts Boston, Vivek Sarkar Rice University, USA
DOI
12:35 - 13:00
Best Paper Award and ClosingMain Conference at Salon 12/13

Unscheduled Events

Not scheduled
Poster
A GPU Memory Efficient Speed-up Scheme for Training Ultra-deep Neural Networks
Posters
Jinrong Guo Institute of Information Engineering, Chinese Academy of Sciences & School of Cyber Security, University of Chinese Academy of Sciences, Wantao Liu Institute of Information Engineering, Chinese Academy of Sciences, Wang Wang Institute of Information Engineering, Chinese Academy of Sciences & School of Cyber Security, University of Chinese Academy of Sciences, Qu Lu Institute of Information Engineering, Chinese Academy of Sciences & School of Cyber Security, University of Chinese Academy of Sciences, Songlin Hu Institute of Information Engineering, Chinese Academy of Sciences, Jizhong Han Institute of Information Engineering, Chinese Academy of Sciences, Ruixuan Li Institute of Information Engineering, Chinese Academy of Sciences
Not scheduled
Poster
VEBO: A Vertex- and Edge-Balanced Ordering Heuristic to Load Balance Parallel Graph Processing
Posters
Jiawen Sun , Hans Vandierendonck Queen's University Belfast, Dimitrios Nikolopoulos EEECS, at Queen's University of Belfast, Northern Ireland, United Kingdom
Not scheduled
Poster
Managing Application Parallelism via Parallel Efficiency Regulation
Posters
Sharanyan Srikanthan University of Rochester, Princeton Ferro , Sandhya Dwarkadas University of Rochester, Sayak Chakraborti University of Rochester
Not scheduled
Poster
Exploiting the Input Sparsity to Accelerate Deep Neural Networks
Posters
Xiao Dong Institute of Computing Technology,Chinese Academy of Sciences, Lei Liu Institute of Computing Technology,Chinese Academy of Sciences, Guangli Li Institute of Computing Technology,Chinese Academy of Sciences;University of Chinese Academy of Sciences, Jiansong Li Institute of Computing Technology,Chinese Academy of Sciences;University of Chinese Academy of Sciences, Peng Zhao Institute of Computing Technology,Chinese Academy of Sciences;University of Chinese Academy of Sciences, Xueying Wang Institute of Computing Technology,Chinese Academy of Sciences;University of Chinese Academy of Sciences, Xiaobing Feng ICT CAS
Not scheduled
Poster
High-Throughput Image Alignment for Connectomics using Frugal Snap Judgments
Posters
Not scheduled
Poster
GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU
Posters
Chanyoung Oh University of Seoul, Zhen ZHENG Tsinghua University, Xipeng Shen North Carolina State University, Jidong Zhai Tsinghua University, Youngmin Yi University of Seoul
Not scheduled
Poster
CuLDA_CGS: Solving Large-scale LDA Problems on GPUs
Posters
Xiaolong Xie Peking University, Eric Liang Peking University, Xiuhong Li Peking University, Wei Tan Citadel LLC
Not scheduled
Poster
Blockchain Abstract Data Type
Posters
Emmanuelle Anceaume CNRS, IRISA, Antonella Del Pozzo CEA LIST, Romaric Ludinard IMT Atlantique, IRISA, Maria Potop-Butucaru Sorbonne Université, CNRS, Laboratoire d'Informatique de Paris 6, LIP6, Sara Tucci-Piergiovanni CEA LIST
Not scheduled
Poster
A Distributed Hypervisor for Resource Aggregation
Posters
Yubin Chen Shanghai Jiao Tong University, Zhuocheng Ding , Jin Zhang , Yun Wang Shanghai Jiao Tong University, Zhengwei Qi Shanghai Jiao Tong University, Haibing Guan Shanghai Jiao Tong University
Not scheduled
Poster
LOFT: Lock-Free Transactional Data Structures
Posters
Avner Elizarov , Guy Gueta VMWare Research, Erez Petrank Technion
Not scheduled
Poster
Scheduling HPC Workloads on Heterogeneous-ISA Architectures
Posters
Mohamed L. Karaoui Virginia Tech, Anthony Carno Virginia Tech, Rob Lyerly Virginia Tech, Sang-Hoon Kim Korea Advanced Institute of Science and Technology, Pierre Olivier Virginia Tech, USA, Changwoo Min Virginia Tech, Binoy Ravindran Virginia Tech
Not scheduled
Poster
Optimizing Computation-Communication Overlap in Asynchronous Task-Based Programs
Posters
Emilio Castillo Barcelona Supercomputing Center, Nikhil Jain Lawrence Livermore National Laboratory, Marc Casas Barcelona Supercomputing Center, Miquel Moreto Technical Univeristy of Catalonia, Martin Schulz Lawrence Livermore National Laboratory, Ramón Beivide , Mateo Valero , Abhinav Bhatele Lawrence Livermore National Laboratory
Not scheduled
Poster
Automated Multi-dimensional Elasticity for Streaming Runtimes
Posters
Xiang Ni IBM T.J. Watson Research Center, Scott Schneider IBM Research, Raju Pavuluri IBM T.J. Watson Research Center, Jonathan Kaus IBM, Kun-Lung Wu IBM Research
Not scheduled
Poster
Accelerating Distributed Stochastic Gradient Descent with Adaptive Periodic Parameter Averaging
Posters
Peng Jiang , Gagan Agrawal The Ohio State University
Not scheduled
Poster
Optimizing GPU Programs by Register Demotion
Posters
Putt Sakdhnagool NECTEC, Thailand, Amit Sabne School of Electrical and Computer Engineering, Purdue University, Rudolf Eigenmann Purdue University
Not scheduled
Poster
BASMAT: Bottleneck-Aware Sparse Matrix-Vector Multiplication Auto-Tuning on GPGPUs
Posters
Athena Elafrou National Technical University of Athens, Georgios Goumas National Technical University of Athens, Nectarios Koziris National Technical University of Athens
Not scheduled
Poster
Optimizing Graph Processing on GPUs using Approximate Computing
Posters
Somesh Singh Indian Institute of Technology, Madras, Rupesh Nasre IIT Madras, India
Not scheduled
Poster
Toward Efficient Architecture-Independent Algorithms for Dynamic Programs
Posters
Mohammad Mahdi Javanmard Stony Brook University, NY, USA, Pramod Ganapathi Stony Brook University, Rathish Das Stony Brook University, NY, USA, Zafar Ahmad Stony Brook University, NY, USA, Stephen Tschudi Google, CA, USA, Rezaul Chowdhury Stony Brook University
Not scheduled
Poster
Compiler-assisted Adaptive Program Scheduling in big.LITTLE Systems
Posters
Not scheduled
Poster
Profiling based out-of-core Hybrid method for large neural networks
Posters
Yuki Ito Tokyo Institute of Technology, Haruki Imai IBM Research - Tokyo, Tung D. Le IBM Research - Tokyo, Yasushi Negishi IBM Research - Tokyo, Kiyokuni Kawachiya IBM Research - Tokyo, Ryo Matsumiya Tokyo Institute of Technology, Toshio Endo Tokyo Institute of Technology
Not scheduled
Poster
Making Concurrent Algorithms Detectable
Posters
Naama Ben-David Carnegie Mellon University, Guy E. Blelloch Carnegie Mellon University, Michal Friedman , Yuanhao Wei Carnegie Mellon University
Not scheduled
Poster
Lock-free Channels for Programming via Communicating Sequential Processes
Posters
Nikita Koval IST Austria & JetBrains, Dan Alistarh IST Austria, Roman Elizarov JetBrains
Not scheduled
Poster
GPU-based 3D Cryo-EM Reconstruction with Key-Value Streams
Posters
Kunpeng Wang Tsinghua University, Shizhen Xu Tsinghua University, Hongkun Yu Tsinghua University, Haohuan Fu Tsinghua University, Guangwen Yang Tsinghua University
Not scheduled
Poster
GPOP: A cache and memory-efficient framework for Graph Processing Over Partitions
Posters
Kartik Lakhotia University of Southern California, Rajgopal Kannan Army Research Lab-West, Sourav Pati University of Southern California, Viktor Prasanna University of Southern California
Not scheduled
Poster
T-thinker: A Task-Centric Distributed Framework For Compute-Intensive Divide-and-Conquer Algorithms
Posters
Da Yan The University of Alabama at Birmingham, Guimu Guo The University of Alabama at Birmingham, Md Mashiur Rahman Chowdhury The University of Alabama at Birmingham, M. Tamer Özsu University of Waterloo, John C.S. Lui The Chinese University of Hong Kong, Weida Tan The University of Alabama at Birmingham

Accepted Posters

Title
Accelerating Distributed Stochastic Gradient Descent with Adaptive Periodic Parameter Averaging
Posters
A Distributed Hypervisor for Resource Aggregation
Posters
A GPU Memory Efficient Speed-up Scheme for Training Ultra-deep Neural Networks
Posters
Automated Multi-dimensional Elasticity for Streaming Runtimes
Posters
BASMAT: Bottleneck-Aware Sparse Matrix-Vector Multiplication Auto-Tuning on GPGPUs
Posters
Blockchain Abstract Data Type
Posters
Compiler-assisted Adaptive Program Scheduling in big.LITTLE Systems
Posters
CuLDA_CGS: Solving Large-scale LDA Problems on GPUs
Posters
Exploiting the Input Sparsity to Accelerate Deep Neural Networks
Posters
GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU
Posters
GPOP: A cache and memory-efficient framework for Graph Processing Over Partitions
Posters
GPU-based 3D Cryo-EM Reconstruction with Key-Value Streams
Posters
High-Throughput Image Alignment for Connectomics using Frugal Snap Judgments
Posters
Lock-free Channels for Programming via Communicating Sequential Processes
Posters
LOFT: Lock-Free Transactional Data Structures
Posters
Making Concurrent Algorithms Detectable
Posters
Managing Application Parallelism via Parallel Efficiency Regulation
Posters
Optimizing Computation-Communication Overlap in Asynchronous Task-Based Programs
Posters
Optimizing GPU Programs by Register Demotion
Posters
Optimizing Graph Processing on GPUs using Approximate Computing
Posters
Profiling based out-of-core Hybrid method for large neural networks
Posters
Scheduling HPC Workloads on Heterogeneous-ISA Architectures
Posters
Toward Efficient Architecture-Independent Algorithms for Dynamic Programs
Posters
T-thinker: A Task-Centric Distributed Framework For Compute-Intensive Divide-and-Conquer Algorithms
Posters
VEBO: A Vertex- and Edge-Balanced Ordering Heuristic to Load Balance Parallel Graph Processing
Posters