Engineering a High-Performance GPU B-Tree (PPoPP 2019 - Main Conference)

Who

Muhammad Awad, Saman Ashkiani, Rob Johnson, Martin Farach-Colton, John D. Owens

Track

PPoPP 2019 Main Conference

Time Zone

The program is currently displayed in (GMT-05:00) Guadalajara, Mexico City, Monterrey.

Use conference time zone: (GMT-05:00) Guadalajara, Mexico City, MonterreySelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 18 Feb 2019 16:35 - 17:00 at Salon 12/13 - Session 4: GPU B-Trees Chair(s): Ang Li

Abstract

We engineer a GPU implementation of a B-Tree that supports concurrent queries (point, range, and successor) and updates (insertions and deletions). Our B-tree outperforms the state of the art, a GPU log-structured merge tree (LSM) and a GPU sorted array. In particular, point and range queries are significantly faster than in a GPU LSM (the GPU LSM does not implement successor queries). Furthermore, B-Tree insertions are also faster than LSM and sorted array insertions unless insertions come in batches of more than roughly 100k. Because we cache the upper levels of the tree, we achieve lookup throughput that exceeds the DRAM bandwidth of the GPU. We demonstrate that the key limiter of performance on a GPU is contention and describe the design choices that allow us to achieve this high performance.

DOI

https://doi.org/10.1145/3293883.3295706

Muhammad Awad

Saman Ashkiani

University of California, Davis

Rob Johnson

VMWare Research

Martin Farach-Colton

Rutgers University

John D. Owens