Hardware/Software Co-Programmable Framework for Computational SSDs to Accelerate Deep Learning Service on Large-Scale Graphs

Miryeong Kwon, Donghyun Gouk, Sangwon Lee, Myoungsoo Jung

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Graph neural networks (GNNs) process large-scale graphs consisting of a hundred billion edges. In contrast to traditional deep learning, unique behaviors of the emerging GNNs are engaged with a large set of graphs and embedding data on storage, which exhibits complex and irregular preprocessing. We propose a novel deep learning framework on large graphs, HolisticGNN, that provides an easy-to-use, near-storage inference infrastructure for fast, energy-efficient GNN processing. To achieve the best end-to-end latency and high energy efficiency, HolisticGNN allows users to implement various GNN algorithms and directly executes them where the actual data exist in a holistic manner. It also enables RPC over PCIe such that the users can simply program GNNs through a graph semantic library without any knowledge of the underlying hardware or storage configurations. We fabricate HolisticGNN's hardware RTL and implement its software on an FPGA-based computational SSD (CSSD). Our empirical evaluations show that the inference time of HolisticGNN outperforms GNN inference services using high-performance modern GPUs by 7.1× while reducing energy consumption by 33.2×, on average.

Original languageEnglish
Title of host publicationProceedings of the 20th USENIX Conference on File and Storage Technologies, FAST 2022
PublisherUSENIX Association
Pages147-163
Number of pages17
ISBN (Electronic)9781939133267
Publication statusPublished - 2022
Event20th USENIX Conference on File and Storage Technologies, FAST 2022 - Santa Clara, United States
Duration: 2022 Feb 222022 Feb 24

Publication series

NameProceedings of the 20th USENIX Conference on File and Storage Technologies, FAST 2022

Conference

Conference20th USENIX Conference on File and Storage Technologies, FAST 2022
Country/TerritoryUnited States
CitySanta Clara
Period22/2/2222/2/24

Bibliographical note

Funding Information:
This research is supported by Samsung Research Funding & Incubation Center of Samsung Electronics (SRFC-IT2101-04). This work is protected by one or more patents, and Myoungsoo Jung is the corresponding author. The authors would like to thank the anonymous reviewers for their comments and suggestions. The authors also thank Raju Rangaswami for shepherding this paper.

Publisher Copyright:
© AST 2022.All rights reserved.

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Hardware and Architecture
  • Software

Fingerprint

Dive into the research topics of 'Hardware/Software Co-Programmable Framework for Computational SSDs to Accelerate Deep Learning Service on Large-Scale Graphs'. Together they form a unique fingerprint.

Cite this