Optimal Checkpoint Selection with Dual-Modular Redundancy Hardening

  • Shin Haeng Kang
  • , Hae Woo Park
  • , Sungchan Kim
  • , Hyunok Oh
  • , Soonhoi Ha

    Research output: Contribution to journalJournal articlepeer-review

    Abstract

    With the continuous scaling of semiconductor technology, failure rate is increasing significantly so that reliability becomes an important issue in multiprocessor system-on-chip (MPSoC) design. We propose an optimal checkpoint selection with task duplication hardening to tolerate transient faults. A target application is specified in a task graph, and the schedule/checkpoint placements are determined at design time. The proposed optimal algorithm minimizes the checkpoint overhead with a latency constraint. Experimental results show that the proposed algorithm effectively reduces the minimum end-to-end latency to perform a fault-tolerant schedule. In addition, the proposed algorithm dramatically decreases the checkpointing overhead on uniprocessor and multiprocessor systems compared with a greedy approach and an equidistant algorithm.

    Original languageEnglish
    Article number6880324
    Pages (from-to)2036-2048
    Number of pages13
    JournalIEEE Transactions on Computers
    Volume64
    Issue number7
    DOIs
    StatePublished - 2015.07.1

    Keywords

    • Checkpoint, task graph
    • multiprocessor
    • optimal algorithm
    • reliability

    Quacquarelli Symonds(QS) Subject Topics

    • Computer Science & Information Systems

    Fingerprint

    Dive into the research topics of 'Optimal Checkpoint Selection with Dual-Modular Redundancy Hardening'. Together they form a unique fingerprint.

    Cite this