TY - JOUR
T1 - An Error Compensation Technique for Low-Voltage DNN Accelerators
AU - Ji, Daehan
AU - Shin, Dongyeob
AU - Park, Jongsun
N1 - Funding Information:
Manuscript received July 6, 2020; revised October 17, 2020; accepted November 9, 2020. Date of publication December 14, 2020; date of current version January 28, 2021. This work was supported in part by the National Research Foundation of Korea Grant funded by the Korea Government under Grant NRF-2020R1A2C3014820, in part by the Ministry of Science and ICT (MSIT), Korea, under the Information Technology Research Center (ITRC) Support Program supervised by the Institute for Information and Communications Technology Promotion (IITP) under Grant IITP-2020-2018-0-01433, and in part by the Industrial Strategic Technology Development Program (Development of SoC technology based on Spiking Neural Cell for Smart Mobile and IoT Devices) funded by the Ministry of Trade, Industry and Energy (MOTIE, Korea) under Grant 10077445. (Corresponding author: Jongsun Park.) Daehan Ji is with the Volume Product Design Group, SK Hynix Inc., Icheon 17336, South Korea (e-mail: daehan.ji@sk.com).
Publisher Copyright:
© 1993-2012 IEEE.
PY - 2021/2
Y1 - 2021/2
N2 - Reducing supply voltages of deep neural network (DNN) accelerators has been of particular interest since it can achieve high energy efficiency for mobile/edge applications. To ensure reliable DNN operations at low voltage, improving the timing error resilience of DNN accelerator is highly required. In this article, we present an error resilient technique to support low-voltage DNN operations by detecting and compensating erroneous computations using the proposed compensation multiply-accumulate (CMAC) unit. First, the timing errors are detected using Razor flip-flops at critical data-path, and erroneous computations are identified and dumped. Using additional multiplier data-path with flip-flops, the dropped computations are compensated in the next CMAC unit without additional clock-cycle penalty. Various bit-precisions of error compensations are analyzed to efficiently tradeoff DNN accuracy and hardware overhead. To improve the DNN accuracy even at low bit-precision of compensation, two types of rounding techniques are presented to effectively reflect the actual distribution of DNN computation results. The low-voltage DNN accelerator based on the proposed error compensation scheme has been implemented using 65-nm CMOS. Post-layout simulations show that the proposed DNN accelerator for ResNet-18 achieves about 47% and 24% energy savings compared with baseline and state-of-the-art error resilient DNN accelerators, respectively.
AB - Reducing supply voltages of deep neural network (DNN) accelerators has been of particular interest since it can achieve high energy efficiency for mobile/edge applications. To ensure reliable DNN operations at low voltage, improving the timing error resilience of DNN accelerator is highly required. In this article, we present an error resilient technique to support low-voltage DNN operations by detecting and compensating erroneous computations using the proposed compensation multiply-accumulate (CMAC) unit. First, the timing errors are detected using Razor flip-flops at critical data-path, and erroneous computations are identified and dumped. Using additional multiplier data-path with flip-flops, the dropped computations are compensated in the next CMAC unit without additional clock-cycle penalty. Various bit-precisions of error compensations are analyzed to efficiently tradeoff DNN accuracy and hardware overhead. To improve the DNN accuracy even at low bit-precision of compensation, two types of rounding techniques are presented to effectively reflect the actual distribution of DNN computation results. The low-voltage DNN accelerator based on the proposed error compensation scheme has been implemented using 65-nm CMOS. Post-layout simulations show that the proposed DNN accelerator for ResNet-18 achieves about 47% and 24% energy savings compared with baseline and state-of-the-art error resilient DNN accelerators, respectively.
KW - Error compensation
KW - low-voltage DNN accelerator
KW - timing error resilient accelerator
KW - voltage scaling
UR - http://www.scopus.com/inward/record.url?scp=85098746753&partnerID=8YFLogxK
U2 - 10.1109/TVLSI.2020.3041517
DO - 10.1109/TVLSI.2020.3041517
M3 - Article
AN - SCOPUS:85098746753
SN - 1063-8210
VL - 29
SP - 397
EP - 408
JO - IEEE Transactions on Very Large Scale Integration (VLSI) Systems
JF - IEEE Transactions on Very Large Scale Integration (VLSI) Systems
IS - 2
M1 - 9293011
ER -