kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Applying Reinforcement Learning to Protect Deep Neural Networks from Soft Errors
KTH, School of Industrial Engineering and Management (ITM), Engineering Design, Mechatronics and Embedded Control Systems.ORCID iD: 0000-0002-8028-3607
KTH, School of Industrial Engineering and Management (ITM), Engineering Design.
Hong Kong Univ Sci & Technol Guangzhou, Thrust Microelect, Guangzhou 511458, Peoples R China.ORCID iD: 0000-0003-0061-3475
KTH, School of Industrial Engineering and Management (ITM), Engineering Design, Mechatronics and Embedded Control Systems.ORCID iD: 0000-0001-7048-0108
2025 (English)In: Sensors, E-ISSN 1424-8220, Vol. 25, no 13, article id 4196Article in journal (Refereed) Published
Abstract [en]

With the advance of Artificial Intelligence, Deep Neural Networks are widely employed in various sensor-based systems to analyze operational conditions. However, due to the inherently nondeterministic and probabilistic natures of neural networks, the assurance of overall system performance could become a challenging task. In particular, soft errors could weaken the robustness of such networks and thereby threaten the system's safety. Conventional fault-tolerant techniques by means of hardware redundancy and software correction mechanisms often involve a tricky trade-off between effectiveness and scalability in addressing the extensive design space of Deep Neural Networks. In this work, we propose a Reinforcement-Learning-based approach to protect neural networks from soft errors by addressing and identifying the vulnerable bits. The approach consists of three key steps: (1) analyzing layer-wise resiliency of Deep Neural Networks by a fault injection simulation; (2) generating layer-wise bit masks by a Reinforcement-Learning-based agent to reveal the vulnerable bits and to protect against them; and (3) synthesizing and deploying bit masks across the network with guaranteed operation efficiency by adopting transfer learning. As a case study, we select several existing neural networks to test and validate the design. The performance of the proposed approach is compared with the performance of other baseline methods, including Hamming code and the Most Significant Bits protection schemes. The results indicate that the proposed method exhibits a significant improvement. Specifically, we observe that the proposed method achieves a significant performance gain of at least 10% to 15% over on the test network. The results indicate that the proposed method dynamically and efficiently protects the vulnerable bits compared with the baseline methods.

Place, publisher, year, edition, pages
MDPI AG , 2025. Vol. 25, no 13, article id 4196
Keywords [en]
reinforcement learning, soft errors protect, fault injection
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-371941DOI: 10.3390/s25134196ISI: 001527628100001PubMedID: 40648450Scopus ID: 2-s2.0-105010329024OAI: oai:DiVA.org:kth-371941DiVA, id: diva2:2008149
Note

QC 20251022

Available from: 2025-10-22 Created: 2025-10-22 Last updated: 2025-10-22Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMedScopus

Authority records

Su, PengLi, YuhangChen, DeJiu

Search in DiVA

By author/editor
Su, PengLi, YuhangLu, ZhonghaiChen, DeJiu
By organisation
Mechatronics and Embedded Control SystemsEngineering Design
In the same journal
Sensors
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 8 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf