Adaptive Checkpoint Interval Algorithm Considering Task Deadline and Lifetime Reliability for Real-Time System☆
5
TL;DR: An adaptive checkpoint interval placement algorithm (ADeLiRACI) that meets all tasks deadline and is able to meet the hard deadline along with enhancing lifetime reliability of the system is proposed.
read more
About: This article is published in Procedia Computer Science. The article was published on 01 Jan 2015. and is currently open access. The article focuses on the topics: Task (project management).
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Instantaneous Mean-Time-To-Failure (MTTF)estimation for checkpoint interval computation at run time
TL;DR: The proposed MTTF estimation technique has been incorporated in the life-time reliability-aware checkpointing mechanism and it has been shown to work excellently without violating the task deadlines in all cases.
5
Communication Induced Checkpointing based Fault Tolerance Mechanism – A Review and CIAC-FTM Framework in IoT Environment
A. Sowjanya Lakshmi,Ch Vani Priya,Gaurav Gupta +2 more
- 04 Nov 2022
TL;DR: In this paper , a communication-induced adaptive checkpointing fault-tolerant mechanism (CIAC-FTM) is proposed to address software faults at application level in IoT. This mechanism places checkpoints at required nodes depending on the type of fault detected, in turn reducing checkpoints and storage overheads.
1
Integrating request replication into FaaS platforms: an experimental evaluation
TL;DR: In this paper , the authors propose the integration of a Request Replication mechanism in FaaS platforms and describe how this integration was implemented in Fission, a well-known, open-source platform.
Lifetime Reliability-Aware Checkpointing Mechanism: Modelling and Analysis: 2013 International Symposium on Electronic System Design
Dhiraj K. Pradhan,Jimson Mathew +1 more
- 01 Jan 2013
Abstract: Check pointing mechanism is used to tolerate the impact of transient faults through roll-back operation to a previously saved system state. In this paper, we propose a novel check pointing mechanism that considers fault tolerance in a duplex system in the presence of both transient and permanent faults. The main objective of our proposed mechanism is to extend the lifetime reliability of the duplex system by avoiding or even tolerating permanent faults in microprocessors. In addition, we also propose to migrate tasks from a 'near-to-die' processor to a spare processor under a condition where the current Mean-Time-To-Failure (MTTF) value is less or equal to a pre-determined threshold MTTF value. We validate our proposed mechanism and perform overhead analysis using various case studies. Later, we compare it with one of the most popular existing check pointing mechanism, namely the roll-forward check pointing scheme [9]. We show that unlike roll-back or roll-forward mechanisms, our proposed mechanism gives significantly higher lifetime reliability with reasonable system overheads.
Communication Induced Checkpointing based Fault Tolerance Mechanism – A Review and CIAC-FTM Framework in IoT Environment
04 Nov 2022
TL;DR: In this paper , a communication-induced adaptive checkpointing fault-tolerant mechanism (CIAC-FTM) is proposed to address software faults at application level in IoT. This mechanism places checkpoints at required nodes depending on the type of fault detected, in turn reducing checkpoints and storage overheads.
References
A first order approximation to the optimum checkpoint interval
TL;DR: It is standard practice to save periodically sufficient information to enable the job to be restarted at the previous point at which information was saved, and the saving of such information at these points is called checkpointing.
693
•Book
Architecture Design for Soft Errors
Shubu Mukherjee
- 07 Mar 2008
TL;DR: This book provides a comprehensive description of the architetural techniques to tackle the soft error problem, and covers the new methodologies for quantitative analysis of soft errors as well as novel, cost-effective architectural techniques to mitigate them.
Lifetime reliability-aware task allocation and scheduling for MPSoC platforms
Lin Huang,Feng Yuan,Qiang Xu +2 more
- 20 Apr 2009
TL;DR: An analytical model is proposed to estimate the lifetime reliability of multiprocessor platforms when executing periodical tasks, and a novel lifetime reliability-aware task allocation and scheduling algorithm based on simulated annealing technique is presented.
130
Roll-forward error recovery in embedded real-time systems
Jie Xu,Brian Randell +1 more
- 03 Jun 1996
TL;DR: This paper presents and discusses two alternative roll-forward recovery schemes, especially for time-critical and interaction-intensive applications, that deliver correct, timely results even when checkpoint validation is required.
26
•Posted Content
Energy-aware checkpointing of divisible tasks with soft or hard deadlines
TL;DR: This paper aims at minimizing the energy consumption when executing a divisible workload under a bound on the total execution time, while resilience is provided through checkpointing, and proposes either an exact solution, or a function that can be optimized numerically.