Abstract
With the continuous scaling of semiconductor technology, failure rate is increasing significantly so that reliability becomes an important issue in multiprocessor system-on-chip (MPSoC) design. We propose an optimal checkpoint selection with task duplication hardening to tolerate transient faults. A target application is specified in a task graph, and the schedule/checkpoint placements are determined at design time. The proposed optimal algorithm minimizes the checkpoint overhead with a latency constraint. Experimental results show that the proposed algorithm effectively reduces the minimum end-to-end latency to perform a fault-tolerant schedule. In addition, the proposed algorithm dramatically decreases the checkpointing overhead on uniprocessor and multiprocessor systems compared with a greedy approach and an equidistant algorithm.
| Original language | English |
|---|---|
| Article number | 6880324 |
| Pages (from-to) | 2036-2048 |
| Number of pages | 13 |
| Journal | IEEE Transactions on Computers |
| Volume | 64 |
| Issue number | 7 |
| DOIs | |
| State | Published - 2015.07.1 |
Keywords
- Checkpoint, task graph
- multiprocessor
- optimal algorithm
- reliability
Quacquarelli Symonds(QS) Subject Topics
- Computer Science & Information Systems
Fingerprint
Dive into the research topics of 'Optimal Checkpoint Selection with Dual-Modular Redundancy Hardening'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver