SSDStreamer: Specializing I/O Stack for Large-Scale Machine Learning

Jonghyun Bae, Hakbeom Jang, Jeonghun Gong, Wenjing Jin, Shine Kim, Jaeyoung Jang, Tae Jun Ham, Jinkyu Jeong, Jae W. Lee

    Research output: Contribution to journalArticlepeer-review

    Abstract

    This article presents SSDStreamer, an SSD-based caching system for large-scale machine learning. By using DRAM as stream buffer, instead of an upper-level cache, SSDStreamer significantly outperforms state-of-the-art multilevel caching systems on Apache Spark, while requiring much less DRAM capacity.

    Original languageEnglish
    Article number8770099
    Pages (from-to)73-81
    Number of pages9
    JournalIEEE Micro
    Volume39
    Issue number5
    DOIs
    Publication statusPublished - 2019 Sept 1

    Bibliographical note

    Funding Information:
    This work was supported in part by the Research Resettlement Fund for the new faculty of Seoul National University; in part by a research grant from Samsung Electronics, National Research Foundation of Korea grant funded by the Ministry of Science, ICT & Future Planning (PE Class Heterogeneous High Performance Computer Development, NRF- 2016M3C4A7952587); and in part by the Competency Development Program for Industry Specialists of the Korean Ministry of Trade, Industry and Energy (MOTIE), operated by the Korea Institute for Advancement of Technology (KIAT) (No. N0001883, HRD Program for Intelligent Semiconductor Industry).

    Funding Information:
    This work was supported in part by the Research Resettlement Fund for the new faculty of Seoul National University; in part by a research grant from Samsung Electronics, National Research Foundation of Korea grant funded by the Ministry of Science, ICT & Future Planning (PE Class Heterogeneous High Performance Computer Development, NRF-2016M3C4A7952587); and in part by the Competency Development Program for Industry Specialists of the Korean Ministry of Trade, Industry and Energy (MOTIE), operated by the Korea Institute for Advancement of Technology (KIAT) (No. N0001883, HRD Program for Intelligent Semiconductor Industry).

    Publisher Copyright:
    © 1981-2012 IEEE.

    All Science Journal Classification (ASJC) codes

    • Software
    • Hardware and Architecture
    • Electrical and Electronic Engineering

    Fingerprint

    Dive into the research topics of 'SSDStreamer: Specializing I/O Stack for Large-Scale Machine Learning'. Together they form a unique fingerprint.

    Cite this