Abstract
Deep Neural Networks (DNNs) have been found to outperform conventional programming approaches in several applications such as computer vision and natural language processing. Efficient hardware architectures for deploying DNNs on edge devices have been actively studied. Emerging memory technologies with their better scalability, non-volatility, and good read performance are ideal candidates for DNNs which are trained once and deployed over many devices. Emerging memories have also been used in DNNs accelerators for efficient computations of dot-product. However, due to immature manufacturing and limited cell endurance, emerging resistive memories often result in reliability issues like stuck-at faults, which reduce the chip yield and pose a challenge to the accuracy of DNNs. Depending on the state, stuck-at faults may or may not cause error. Fault-tolerance of DNNs can be enhanced by reducing the impact of errors resulting from the stuck-at faults. In this work, we introduce simple and light-weight Intra-block Address remapping and weight encoding techniques to improve the fault-tolerance for DNNs. The proposed schemes effectively work at the network deployment time while preserving the network organization and the original values of the parameters. Experimental results on state-of-the-art DNN models indicate that, with a small storage overhead of just 0.98%, the proposed techniques achieve up to 300× stuck-at faults tolerance capability on Cifar10 dataset and 125× on Imagenet datatset, compared to the baseline DNNs without any fault-tolerance method. By integrating with the existing schemes, the proposed schemes can further enhance the fault resilience of DNNs.
Original language | English |
---|---|
Title of host publication | 2021 58th ACM/IEEE Design Automation Conference, DAC 2021 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1075-1080 |
Number of pages | 6 |
ISBN (Electronic) | 9781665432740 |
DOIs | |
Publication status | Published - 2021 Dec 5 |
Event | 58th ACM/IEEE Design Automation Conference, DAC 2021 - San Francisco, United States Duration: 2021 Dec 5 → 2021 Dec 9 |
Publication series
Name | Proceedings - Design Automation Conference |
---|---|
Volume | 2021-December |
ISSN (Print) | 0738-100X |
Conference
Conference | 58th ACM/IEEE Design Automation Conference, DAC 2021 |
---|---|
Country/Territory | United States |
City | San Francisco |
Period | 21/12/5 → 21/12/9 |
Bibliographical note
Funding Information:This work was supported in part by the Basic Science Research Program through the National Research Foundation of Korea funded by the Ministry of Education under Grant NRF-2018R1D1A1B07049842 and Grant NRF-2020M3F3A2A01082326, in part by the Ministry of Trade, Industry and Energy through the Korea Semiconductor Research Consortium support program for the development of the future semiconductor device under Grant 10080594, and in part by the next generation intelligent semiconductor development by the Ministry of Trade, Industry and Energy (MOTIE) under Grant 20011074. Correspondence to: Joon-Sung Yang (js.yang@yonsei.ac.kr)
Publisher Copyright:
© 2021 IEEE.
All Science Journal Classification (ASJC) codes
- Computer Science Applications
- Control and Systems Engineering
- Electrical and Electronic Engineering
- Modelling and Simulation