Memory-efficient DNN training on mobile devices

In Gim, Jeong Gil Ko

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

On-device deep neural network (DNN) training holds the potential to enable a rich set of privacy-aware and infrastructure-independent personalized mobile applications. However, despite advancements in mobile hardware, locally training a complex DNN is still a nontrivial task given its resource demands. In this work, we show that the limited memory resources on mobile devices are the main constraint and propose Sage as a framework for efficiently optimizing memory resources for on-device DNN training. Specifically, Sage configures a flexible computation graph for DNN gradient evaluation and reduces the memory footprint of the graph using operator- and graph-level optimizations. In run-time, Sage employs a hybrid of gradient checkpointing and micro-batching techniques to dynamically adjust its memory use to the available system memory budget. Using implementation on off-the-shelf smartphones, we show that Sage enables local training of complex DNN models by reducing memory use by more than 20-fold compared to a baseline approach. We also show that Sage successfully adapts to run-time memory budget variations, and evaluate its energy consumption to show Sage's practical applicability.

Original languageEnglish
Title of host publicationMobiSys 2022 - Proceedings of the 2022 20th Annual International Conference on Mobile Systems, Applications and Services
PublisherAssociation for Computing Machinery, Inc
Pages464-476
Number of pages13
ISBN (Electronic)9781450391856
DOIs
Publication statusPublished - 2022 Jun 27
Event20th ACM International Conference on Mobile Systems, Applications and Services, MobiSys 2022 - Portland, United States
Duration: 2022 Jun 272022 Jul 1

Publication series

NameMobiSys 2022 - Proceedings of the 2022 20th Annual International Conference on Mobile Systems, Applications and Services

Conference

Conference20th ACM International Conference on Mobile Systems, Applications and Services, MobiSys 2022
Country/TerritoryUnited States
CityPortland
Period22/6/2722/7/1

Bibliographical note

Funding Information:
The authors would like to thank our shepherd, Professor Mi Zhang, and the anonymous reviewers for their valuable feedback on the work. This work was supported by the Ministry of Science and ICT’s NRF Basic Science Research Program (2021R1A2C4002380), IITP (IITP-2022-2022-0-00240), ITRC Program supervised by IITP (IITP-2021-2020-0-01461), Ministry of Culture, Sports and Tourism and Korea Creative Content Agency (R2021040018), and by the Ministry of Trade, Industry and Energy and KIAT through the International Cooperative R&D program (P0016150). In Gim submitted this work as Hyunjun Kim. JeongGil Ko is the corresponding author for this work (jeonggil.ko@yonsei.ac.kr).

Publisher Copyright:
© 2022 ACM.

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Memory-efficient DNN training on mobile devices'. Together they form a unique fingerprint.

Cite this