A GPU Memory Efficient Speed-up Scheme for Training Ultra-deep Neural Networks (PPoPP 2019 - Posters)

Sat 16 - Wed 20 February 2019 Washington, DC, United States

Who

Jinrong Guo, Wantao Liu, Wang Wang, Qu Lu, Songlin Hu, Jizhong Han, Ruixuan Li

Track

PPoPP 2019 Posters

Abstract

Ultra-deep neural network(UDNN) tends to yield higher-quality model but its training process is often difficult to handle. Scarce GPU DRAM capacity is the primary bottleneck that limits the depth of neural network and the range of trainable minibatch size. In this paper, we present a scheme that dedicates to make the utmost use of finite GPU memory resource to speed up the training process for UDNN. Firstly, a performance-model guided dynamic swap out/in strategy between GPU and host memory is carefully orchestrated to tackle the out-of-memory problem without introducing performance penalty. Then, a hyperparameter (minibatch size, learning rate) tuning policy is designed to explore the optimal configuration after applying the swap strategy from the perspectives of training time and final accuracy simultaneously. Finally, we verify the effectiveness of our scheme in both single and distributed GPU mode.

Jinrong Guo

Institute of Information Engineering, Chinese Academy of Sciences & School of Cyber Security, University of Chinese Academy of Sciences

Wantao Liu

Institute of Information Engineering, Chinese Academy of Sciences

Wang Wang

Institute of Information Engineering, Chinese Academy of Sciences & School of Cyber Security, University of Chinese Academy of Sciences

Qu Lu

Institute of Information Engineering, Chinese Academy of Sciences & School of Cyber Security, University of Chinese Academy of Sciences

Songlin Hu

Institute of Information Engineering, Chinese Academy of Sciences

Jizhong Han

Institute of Information Engineering, Chinese Academy of Sciences

Ruixuan Li

Institute of Information Engineering, Chinese Academy of Sciences

Time Zone

The program is currently displayed in (GMT-05:00) Cancun.

Use conference time zone: (GMT-05:00) CancunSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

Session Program

Sun 17 Feb
Displayed time zone: Cancun change

	18:00 - 20:00	Welcome Reception and Poster SessionMain Conference at Mezzanine Foyer