Open Access
Interactive debugging in a distributed computational
R. D. Schiffenbauer
- 01 Aug 1981
12
TL;DR: The notion of probable simulation is defined, and the debugging facility is shown to achieve it, which simulates a particular computation of the program it is being used to debug that is likely to occur.
read more
Abstract: This thesis describes an implementation of a facility for interactively debugging distributed programs. These distributed programs consist of groups of cooperating processes concurrently executing on an arbitrarily extensive network of processors. The facility allows the user to monitor and control, at his leisure, the interprocess communications that occur through message passing while execution of the distributed program proceeds. It presents the user with the ability to simulate transmission errors and delays, to alter and create packets, and to precisely control the pattern of such communications. The facility serves as a tool for the detection of lurking bugs, those errors, peculiar to parallel processing, which may or may not appear during the course of any particular execution. The facility possesses a high degree of transparency towards the program being debugged. That is, it has a minimal effect on the events that define the execution of that program. Transparency is a desirable property for any debugger to possess. To achieve such transparency, the processes of the distributed program are made to execute in a logical time environment, reading logical, rather than physical, clocks. We show that the facility obeys a clock condition, with which any logical time system must comply in order to be correct. We also show that the facility actually simulates the program it is being used to debug. Finally, we show that the facility simulates a particular computation of the program that is likely to occur. The notion of probable simulation is defined, and our debugging facility is shown to achieve it.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Debugging heterogeneous distributed systems using event-based models of behavior
Peter Bates
- 01 Nov 1988
TL;DR: Event Based Behavioral Abstraction evolves naturally as a cooperative distributed program that can take better advantage of computational power available in a network computer system to enhance debugging tool transparency, reduce latency and uncertainty for fundamental debugging activities and accommodate diverse, heterogeneous architectures.
200
Monitoring and debugging distributed real-time programs
TL;DR: This paper describes how the monitor can be used to debug distributed and parallel applications by deterministic execution replay and presents a novel approach to monitoring shared variable references that provides transparent monitoring with low overhead.
86
Interactive blackbox debugging for concurrent languages
Germán S. Goldszmidt,Shmuel Katz,Shaula Yemini +2 more
- 01 Nov 1988
TL;DR: A novel approach to the design of portable integrated debugging tools for concurrent languages by partitioning the tools set into two categories and the implementation of a prototype debugging facility for OCCAM is described.
37
Distributed debugging tools for heterogeneous distributed systems
P. Bates
- 13 Jun 1988
TL;DR: The toolset is a collection of components that are collectively a distributed system for debugging distributed systems that can be combined in varying ways to provide levels of debugging service appropriate for the resources available at individual nodes.
36
High-level language debugging for concurrent programs
TL;DR: An integrated system design for debugging distributed programs written in concurrent high-level languages is described, and the implementation of a debugging facility for OCCAM is described.
34
References
Debugging heterogeneous distributed systems using event-based models of behavior
Peter Bates
- 01 Nov 1988
TL;DR: Event Based Behavioral Abstraction evolves naturally as a cooperative distributed program that can take better advantage of computational power available in a network computer system to enhance debugging tool transparency, reduce latency and uncertainty for fundamental debugging activities and accommodate diverse, heterogeneous architectures.
200
Monitoring and debugging distributed real-time programs
TL;DR: This paper describes how the monitor can be used to debug distributed and parallel applications by deterministic execution replay and presents a novel approach to monitoring shared variable references that provides transparent monitoring with low overhead.
86
Interactive blackbox debugging for concurrent languages
Germán S. Goldszmidt,Shmuel Katz,Shaula Yemini +2 more
- 01 Nov 1988
TL;DR: A novel approach to the design of portable integrated debugging tools for concurrent languages by partitioning the tools set into two categories and the implementation of a prototype debugging facility for OCCAM is described.
37
Distributed debugging tools for heterogeneous distributed systems
P. Bates
- 13 Jun 1988
TL;DR: The toolset is a collection of components that are collectively a distributed system for debugging distributed systems that can be combined in varying ways to provide levels of debugging service appropriate for the resources available at individual nodes.
36
A bibliography of parallel debuggers, 1993 edition
Cherri M. Pancake,Robert H. B. Netzer +1 more
- 01 Dec 1993
TL;DR: A bibliography of over three hundred technical reports, journal and conference papers, and Ph.D. dissertations dealing with parallel and distributed debuggers covering treatments of programming environments, debugging methodology, and techniques for program analysis.
31