Embedding Fault-Tolerance with Dual-Level Agents in Many-Core Systems
2012 (English)In: First MEDIAN Workshop (MEDIAN'12), 2012Conference paper, Presentation (Other academic)
Dual-level fault-tolerance is presented on many-core systems, provided by the software-based system agent and hardware-based local agents. The system agent performs fault-triggered energy-aware remapping with bandwidth constraints, addressing coarse-grained processor failures. The local agents achieve fine-grained link-level fault tolerance against transient and permanent errors. The paper concisely presents the architecture, dual-level fault-tolerant techniques and experiment results.
Place, publisher, year, edition, pages
Engineering and Technology
IdentifiersURN: urn:nbn:se:kth:diva-109402OAI: oai:DiVA.org:kth-109402DiVA: diva2:581837
First MEDIAN Workshop (MEDIAN'12)
QC 201201082013-01-022013-01-022013-01-08Bibliographically approved