Recovery point selection on a reverse binary tree task model

doi:10.1109/32.31353

Journal Article10.1109/32.31353

Recovery point selection on a reverse binary tree task model

S.-K. Chen, +2 more

- 01 Aug 1989

- IEEE Transactions on Software Engineerin...

- Vol. 15, Iss: 8, pp 963-976

7

TL;DR: An analysis is conducted of the complexity of placing recovery points where the computation is modeled as a reverse binary tree task model, and algorithms are devised for solving the recovery point placement problem.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1007/BF02703630

A survey of checkpointing algorithms for parallel and distributed computers

S. Kalaiselvi, +1 more

- 01 Oct 2000

- Sadhana-academy Proceedings in Engineeri...

TL;DR: This paper surveys the algorithms which have been reported in the literature for checkpointing parallel/distributed systems and concludes that in development of parallel programs the user has to do a fair amount of work in distributing tasks and this information can be effectively used to simplify checkpointing and rollback recovery.

...read moreread less

70

System Structure for Software Fault Tolerance

Brian Randell

- 01 Jan 1975

TL;DR: The aim is to facilitate the provision of dependable error detection and recovery facilities which can cope with errors caused by residual design inadequacies, particularly in the system software, rather than merely the occasional malfunctioning of hardware components.

...read moreread less

67

•Journal Article

Backward error recovery in distributed systems

A Antola, +1 more

- 01 Jan 1986

- Automatic Control and Computer Sciences

4

Journal Article•10.1109/32.83909

Efficient algorithms for selection of recovery points in tree task models

Subhada K. Mishra, +2 more

- 01 Jul 1991

- IEEE Transactions on Software Engineerin...

TL;DR: An algorithm to minimize the expected computation time of the task system under a uniprocessor environment has been developed for the binary tree model.

...read moreread less

4

•Dissertation

Checkpointing Algorithms for Parallel Computers

S Kalaiselvi

- 01 Feb 1997

TL;DR: Dedicated to m y beloved P a r e n t s a n d m y dear Uncle.

...read moreread less

1

References

Journal Article•10.1145/390016.808467

System structure for software fault tolerance

Brian Randell

- 01 Apr 1975

- Sigplan Notices

TL;DR: In this article, the authors present a method for structuring complex computing systems by the use of what they term "recovery blocks", "conversations", and "fault-tolerant interfaces".

...read moreread less

1.8K

Proceedings Article•10.1145/800027.808467

System structure for software fault tolerance

Brian Randell

- 01 Jan 1975

TL;DR: In this article, the authors present a method for structuring complex computing systems by the use of what they term "recovery blocks", "conversations", and "fault-tolerant interfaces".

...read moreread less

1.1K

Journal Article•10.1145/361147.361115

A first order approximation to the optimum checkpoint interval

John W. Young

- 01 Sep 1974

- Communications of The ACM

TL;DR: It is standard practice to save periodically sufficient information to enable the job to be restarted at the previous point at which information was saved, and the saving of such information at these points is called checkpointing.

...read moreread less

693

Journal Article•10.1112/BLMS/1.3.431

Introduction to combinatorial mathematics

S. G. Williamson

- 01 Nov 1969

- Bulletin of The London Mathematical Soci...

425

Book•10.1007/978-3-642-82470-8

Reliable Computer Systems

Santosh K. Shrivastava

- 01 Oct 1985

TL;DR: The terms fault, error and failure are carefully defined and distinguished in the hope that an agreed terminology will emerge in the fault tolerance community.

...read moreread less

222