M. Safarpour, T. Z. Deng, J. Massingham, L. Xun, M. Sabokrou and O. Silvén, "Low-Voltage Energy Efficient Neural Inference by Leveraging Fault Detection Techniques," 2021 IEEE Nordic Circuits and Systems Conference (NorCAS), 2021, pp. 1-5, doi: 10.1109/NorCAS53631.2021.9599648
Low-voltage energy efficient neural inference by leveraging fault detection techniques
|Author:||Safarpour, Mehdi1; Deng, Tommy Z.2; Massingham, John2;|
1Center for Machine Vision and Signal Analysis, University of Oulu, Finland
2Huawei Technologies Sweden AB, Stockholm, Sweden
3University of Southampton, Southampton, UK
4Institute for Research in Fundamental Sciences (IPM), Tehran, Iran
|Online Access:||PDF Full Text (PDF, 1.7 MB)|
|Persistent link:|| http://urn.fi/urn:nbn:fi-fe2021120959917
Institute of Electrical and Electronics Engineers,
|Publish Date:|| 2021-12-09
Operating at reduced voltages offers substantial energy efficiency improvement but at the expense of increasing the probability of computational errors due to hardware faults. In this context, we targeted Deep Neural Networks (DNN) as emerging energy hungry building blocks in embedded applications. Without an error feedback mechanism, blind voltage downscaling will result in degraded accuracy or total system failure. To enable safe voltage down-scaling, in this paper two solutions based on Self-Supervised Learning (SSL) and Algorithm Based Fault Tolerance (ABFT) were developed. A DNN model trained on MNIST data-set was deployed on a Field Programmable Gate Array (FPGA) that operated at reduced voltages and employed the proposed schemes. The SSL approach provides extremely low-overhead (≈0.2%) fault detection at the cost of lower error coverage and extra training, while ABFT incurs less than 8% overheads at run-time with close to 100% error detection rate. By using the solutions, substantial energy savings, i.e., up to 40.3%, without compromising the accuracy of the model was achieved.
|Pages:||1 - 5|
2021 IEEE Nordic Circuits and Systems Conference (NorCAS)
IEEE Nordic Circuits and Systems Conference
|Type of Publication:||
A4 Article in conference proceedings
|Field of Science:||
113 Computer and information sciences
213 Electronic, automation and communications engineering, electronics
Part of this work was supported by the 6G Flagship Research Programme under Academy of Finland Grant 318927.
Support from the Walter Ahlstr¨om foundation and Riitta ja Jarmo Takasen foundation is acknowledged by the first author.
|Academy of Finland Grant Number:||
318927 (Academy of Finland Funding decision)
Source codes used to generate models in this study are available from
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.