FlashGPU

Placing new flash next to GPU cores

Jie Zhang, Miryeong Kwon, Hyojong Kim, Hyesoon Kim, Myoungsoo Jung

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We propose FlashGPU, a new GPU architecture that tightly blends new flash (Z-NAND) with massive GPU cores. Specifically, we replace global memory with Z-NAND that exhibits ultra-low latency. We also architect a flash core to manage request dispatches and address translations underneath L2 cache banks of GPU cores. While Z-NAND is a hundred times faster than conventional 3D-stacked flash, its latency is still longer than DRAM. To address this shortcoming, we propose a dynamic page-placement and buffer manager in Z-NAND subsystems by being aware of bulk and parallel memory access characteristics of GPU applications, thereby offering high-throughput and low-energy consumption behaviors.

Original languageEnglish
Title of host publicationProceedings of the 56th Annual Design Automation Conference 2019, DAC 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781450367257
DOIs
Publication statusPublished - 2019 Jun 2
Event56th Annual Design Automation Conference, DAC 2019 - Las Vegas, United States
Duration: 2019 Jun 22019 Jun 6

Publication series

NameProceedings - Design Automation Conference
ISSN (Print)0738-100X

Conference

Conference56th Annual Design Automation Conference, DAC 2019
CountryUnited States
CityLas Vegas
Period19/6/219/6/6

Fingerprint

NAND
Flash
Latency
Data storage equipment
Dynamic random access storage
Cache
High Throughput
Placement
Energy Consumption
Buffer
Subsystem
Managers
Energy utilization
Throughput
Graphics processing unit

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Control and Systems Engineering
  • Electrical and Electronic Engineering
  • Modelling and Simulation

Cite this

Zhang, J., Kwon, M., Kim, H., Kim, H., & Jung, M. (2019). FlashGPU: Placing new flash next to GPU cores. In Proceedings of the 56th Annual Design Automation Conference 2019, DAC 2019 [a156] (Proceedings - Design Automation Conference). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1145/3316781.3317827
Zhang, Jie ; Kwon, Miryeong ; Kim, Hyojong ; Kim, Hyesoon ; Jung, Myoungsoo. / FlashGPU : Placing new flash next to GPU cores. Proceedings of the 56th Annual Design Automation Conference 2019, DAC 2019. Institute of Electrical and Electronics Engineers Inc., 2019. (Proceedings - Design Automation Conference).
@inproceedings{575d121fe96b4f178cb2e0206d861ce5,
title = "FlashGPU: Placing new flash next to GPU cores",
abstract = "We propose FlashGPU, a new GPU architecture that tightly blends new flash (Z-NAND) with massive GPU cores. Specifically, we replace global memory with Z-NAND that exhibits ultra-low latency. We also architect a flash core to manage request dispatches and address translations underneath L2 cache banks of GPU cores. While Z-NAND is a hundred times faster than conventional 3D-stacked flash, its latency is still longer than DRAM. To address this shortcoming, we propose a dynamic page-placement and buffer manager in Z-NAND subsystems by being aware of bulk and parallel memory access characteristics of GPU applications, thereby offering high-throughput and low-energy consumption behaviors.",
author = "Jie Zhang and Miryeong Kwon and Hyojong Kim and Hyesoon Kim and Myoungsoo Jung",
year = "2019",
month = "6",
day = "2",
doi = "10.1145/3316781.3317827",
language = "English",
series = "Proceedings - Design Automation Conference",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
booktitle = "Proceedings of the 56th Annual Design Automation Conference 2019, DAC 2019",
address = "United States",

}

Zhang, J, Kwon, M, Kim, H, Kim, H & Jung, M 2019, FlashGPU: Placing new flash next to GPU cores. in Proceedings of the 56th Annual Design Automation Conference 2019, DAC 2019., a156, Proceedings - Design Automation Conference, Institute of Electrical and Electronics Engineers Inc., 56th Annual Design Automation Conference, DAC 2019, Las Vegas, United States, 19/6/2. https://doi.org/10.1145/3316781.3317827

FlashGPU : Placing new flash next to GPU cores. / Zhang, Jie; Kwon, Miryeong; Kim, Hyojong; Kim, Hyesoon; Jung, Myoungsoo.

Proceedings of the 56th Annual Design Automation Conference 2019, DAC 2019. Institute of Electrical and Electronics Engineers Inc., 2019. a156 (Proceedings - Design Automation Conference).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - FlashGPU

T2 - Placing new flash next to GPU cores

AU - Zhang, Jie

AU - Kwon, Miryeong

AU - Kim, Hyojong

AU - Kim, Hyesoon

AU - Jung, Myoungsoo

PY - 2019/6/2

Y1 - 2019/6/2

N2 - We propose FlashGPU, a new GPU architecture that tightly blends new flash (Z-NAND) with massive GPU cores. Specifically, we replace global memory with Z-NAND that exhibits ultra-low latency. We also architect a flash core to manage request dispatches and address translations underneath L2 cache banks of GPU cores. While Z-NAND is a hundred times faster than conventional 3D-stacked flash, its latency is still longer than DRAM. To address this shortcoming, we propose a dynamic page-placement and buffer manager in Z-NAND subsystems by being aware of bulk and parallel memory access characteristics of GPU applications, thereby offering high-throughput and low-energy consumption behaviors.

AB - We propose FlashGPU, a new GPU architecture that tightly blends new flash (Z-NAND) with massive GPU cores. Specifically, we replace global memory with Z-NAND that exhibits ultra-low latency. We also architect a flash core to manage request dispatches and address translations underneath L2 cache banks of GPU cores. While Z-NAND is a hundred times faster than conventional 3D-stacked flash, its latency is still longer than DRAM. To address this shortcoming, we propose a dynamic page-placement and buffer manager in Z-NAND subsystems by being aware of bulk and parallel memory access characteristics of GPU applications, thereby offering high-throughput and low-energy consumption behaviors.

UR - http://www.scopus.com/inward/record.url?scp=85067831721&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85067831721&partnerID=8YFLogxK

U2 - 10.1145/3316781.3317827

DO - 10.1145/3316781.3317827

M3 - Conference contribution

T3 - Proceedings - Design Automation Conference

BT - Proceedings of the 56th Annual Design Automation Conference 2019, DAC 2019

PB - Institute of Electrical and Electronics Engineers Inc.

ER -

Zhang J, Kwon M, Kim H, Kim H, Jung M. FlashGPU: Placing new flash next to GPU cores. In Proceedings of the 56th Annual Design Automation Conference 2019, DAC 2019. Institute of Electrical and Electronics Engineers Inc. 2019. a156. (Proceedings - Design Automation Conference). https://doi.org/10.1145/3316781.3317827