In this paper, we observe that the main performance bottleneck of emerging graph neural networks (GNNs) is not the inference algorithms themselves, but their graph data preprocessing. To take such preprocessing off the critical path in GNNs, we propose PreGNN, a novel hardware automation architecture that accelerates all the tasks of GNN preprocessing from the beginning to the end. Specifically, PreGNN accelerates graph generation in parallel, samples neighbor nodes of a given graph, and prepares graph datasets through all hardware. To reduce the long latency of GNN preprocessing over hardware, we also propose simple, efficient combinational logic that can perform radix sort and arrange the data in a self-governing manner. The evaluation results show that PreGNN can shorten the end-to-end latency of GNN inferences by 10.7× while consuming less energy by 3.3×, compared to a GPU-only system.
|Number of pages||4|
|Journal||IEEE Computer Architecture Letters|
|Publication status||Published - 2022|
Bibliographical noteFunding Information:
This work was supported by Samsung Science and Technology Foundation under Grant SRFC-IT2101-04.
© 2002-2011 IEEE.
All Science Journal Classification (ASJC) codes
- Hardware and Architecture