Filters








6,271 Hits in 5.1 sec

Transparent logging as a technique for debugging complex distributed systems

M. Satyanarayanan, David C. Steere, Masashi Kudo, Hank Mashburn
1992 Proceedings of the 5th workshop on ACM SIGOPS European workshop Models and paradigms for distributed systems structuring - EW 5  
In this position paper, we put forth the thesis that logging is a flexible, powerful, and convenient tool for debugging complex distributed systems. We substantiate this thesis in three steps.  ...  First, we argue that logging is particularly well suited for debugging distributed systems. Next, we observe that logging is already used in distributed systems for reasons independent of debugging.  ...  Logging as a Paradigm for Building Distributed Systems Independent of its value for debugging, logging is valuable in building distributed systems [7] .  ... 
doi:10.1145/506378.506421 dblp:conf/sigopsE/SatyanarayananSKM92 fatcat:trgliyuqendzfjs7txrqhhhy6q

Transparent logging as a technique for debugging complex distributed systems

M. Satyanarayanan, David C. Steere, Masashi Kudo, Hank Mashburn
1992 Proceedings of the 5th workshop on ACM SIGOPS European workshop Models and paradigms for distributed systems structuring - EW 5  
In this position paper, we put forth the thesis that logging is a flexible, powerful, and convenient tool for debugging complex distributed systems. We substantiate this thesis in three steps.  ...  First, we argue that logging is particularly well suited for debugging distributed systems. Next, we observe that logging is already used in distributed systems for reasons independent of debugging.  ...  Logging as a Paradigm for Building Distributed Systems Independent of its value for debugging, logging is valuable in building distributed systems [7] .  ... 
doi:10.1145/506379.506421 fatcat:qmpzra3vifd4nlulifmyx5ypbe

Monitoring and debugging distributed realtime programs

Paul S. Dodd, Chinya V. Ravishankar
1992 Software, Practice & Experience  
In this paper we describe the design and implementation of an integrated monitoring and debugging system for a distributed real-time computer system.  ...  The monitor is designed to support tasks such as debugging realtime applications, aiding real-time task scheduling, and measuring system performance.  ...  Kang Shin and Dilip Kandlur of the Real-Time Computing Laboratory at the University of Michigan for useful discussions during the progress of this work, and to one of the anonymous reviewers for bringing  ... 
doi:10.1002/spe.4380221005 fatcat:on5qs7vtnbdv5nhb7zjk6mehba

Shedding light on distributed system executions

Jenny Abrahamson, Ivan Beschastnikh, Yuriy Brun, Michael D. Ernst
2014 Companion Proceedings of the 36th International Conference on Software Engineering - ICSE Companion 2014  
We present two tools: ShiVector to transparently add vector timestamps to distributed system logs, and ShiViz to help developers understand distributed system logs by visualizing them as space-time diagrams  ...  In a distributed system, the hosts execute concurrently, generating asynchronous logs that are challenging to comprehend.  ...  CONCLUSION Logging is a common debugging technique, though the traces generated by distributed systems are often difficult to reason about manually.  ... 
doi:10.1145/2591062.2591134 dblp:conf/icse/AbrahamsonBBE14 fatcat:ucoym7ucuffq5e45tx4wwwlvvm

NUbugger: A Visual Real-Time Robot Debugging System [chapter]

Brendan Annable, David Budden, Alexandre Mendes
2014 Lecture Notes in Computer Science  
The system implementation facilitates simple and rapid extension or modification, making it a useful utility for debugging any similar complex robotic framework.  ...  As modern autonomous robots have improved in their ability to demonstrate human-like motor skills and reasoning, the size and complexity of software systems have increased proportionally, with developers  ...  In recent weeks, it has proven critical to the identification of low-level issues that were plaguing system performance, but near-impossible to identify using traditional debugging methodologies.  ... 
doi:10.1007/978-3-662-44468-9_48 fatcat:m3bz3vl5hnegnp6dsytd7l4epe

Debugging Distributed Systems with Causal Nets

Gian Luigi Ferrari, Roberto Guanciale, Daniele Strollo, Emilio Tuosto
2009 Electronic Communications of the EASST  
Formal methods for deciding the properties of service oriented systems are of paramount importance. However, they may require to master sophisticated techniques that programmers may lack.  ...  Here, we propose to use causal nets to define a few debugging primitives to drive the analysis of system developed with SC (after Signal Calculus), a process calculus featuring event-notification communication  ...  We first give a hint of JSCL so that we can ground our contention on its suitability as a model for a distributed debugging framework.  ... 
doi:10.14279/tuj.eceasst.14.190.181 dblp:journals/eceasst/FerrariGST08 fatcat:5ktex2bz7fe53pzla2ljhy677i

Debugging distributed systems

Ivan Beschastnikh, Patty Wang, Yuriy Brun, Michael D. Ernst
2016 Communications of the ACM  
The complexity of distributed systems has inspired work on visualization of such systems to make them more transparent to developers.  ...  Visualizing Distributed-System Executions As noted earlier, the ability to visualize distributed-system executions can help developers understand and debug their distributed systems.  ... 
doi:10.1145/2909480 fatcat:3agouv7vazaoxpcybc7nzto7me

A Multi-Core Debug Platform for NoC-Based Systems

Shan Tang, Qiang Xu
2007 2007 Design, Automation & Test in Europe Conference & Exhibition  
As traditional debug architecture for busbased systems is not readily applicable to identify bugs in NoC-based systems, in this paper, we present a novel debug platform that supports concurrent debug access  ...  Experimental results show that the design-for-debug cost for the proposed technique in terms of area and traffic requirements is moderate 1 .  ...  Under such circumstances, we propose a debug platform for NoC-based systems as shown in Fig. 1 .  ... 
doi:10.1109/date.2007.364402 dblp:conf/date/TangX07 fatcat:jczs6phuprgsbbtntrfop73ss4

Diagnosing Distributed Systems through Log Data Analysis [article]

K. R. Chowdhary, Rajendra Purohit
2020 arXiv   pre-print
However, for parallel and distributed systems where happen-before relations are not directly available between the events, it become a challenge to fully depend on log-based analysis in such instances.  ...  solutions for performance analysis in distributed and parallel systems.  ...  Debug Log The debug log (/var/log/debug) provides messages from Linux system and applications running under this, which are helpful for debugging the system configuration as well as for debugging the applications  ... 
arXiv:2010.03960v1 fatcat:enpgmvypgjccrbtfuxx2rz5y5q

FixD : Fault Detection, Bug Reporting, and Recoverability for Distributed Applications

Cristian Tapus, David A. Noblet
2007 2007 IEEE International Parallel and Distributed Processing Symposium  
This paper presents the design of a hybrid tool, FixD , that attempts to address the deficiencies of these tools with respect to their application to distributed systems by using a novel composition of  ...  Model checking, logging, debugging, and checkpointing/recovery are great tools to identify bugs in small sequential programs.  ...  Logging, Checkpointing and Playback of Distributed Applications The observation of the deficiencies of these techniques as they apply to distributed systems is not new.  ... 
doi:10.1109/ipdps.2007.370413 dblp:conf/ipps/TapusN07 fatcat:pm52ct7mkjebzo3ai2xkknu3qa

Transition Watchpoints: Teaching Old Debuggers New Tricks

Kapil Arya, Tyler Denniston, Ariel Rabkin, Gene Cooperman
2017 The Art, Science, and Engineering of Programming  
This allows for debugging of subtle bugs that appear only after minutes or more of program execution. When a bug occurs within seconds of program startup, repeated debugging sessions suffice.  ...  Reversible debugging is preferred for bugs seen only after minutes. This architecture allows for an efficient and easy-to-write snapshot-based reversibe debugger on top of a conventional debugger.  ...  For the checkpointing system, FReD uses Distributed MultiThreaded CheckPointing (DMTCP) [ ], a transparent user-level checkpointing package.  ... 
doi:10.22152/programming-journal.org/2017/1/16 fatcat:2tvi33gbpjfp5lzpzdcvmjlale

Troubleshooting blackbox SDN control software with minimal causal sequences

Colin Scott, Sam Whitlock, H.B. Acharya, Kyriakos Zarifis, Scott Shenker, Andreas Wundsam, Barath Raghavan, Aurojit Panda, Andrew Or, Jefferson Lai, Eugene Huang, Zhi Liu (+1 others)
2014 Computer communication review  
In this paper we discuss how to improve control software troubleshooting by presenting a technique for automatically identifying a minimal sequence of inputs responsible for triggering a given bug, without  ...  We apply our technique to five open source SDN control platforms-Floodlight, NOX, POX, Pyretic, ONOS-and illustrate how the minimal causal sequences our system found aided the troubleshooting process.  ...  We thank our sheperd Nate Foster and the anonymous reviewers for their comments.  ... 
doi:10.1145/2740070.2626304 fatcat:osnxr4ntjndjtdemprlrqhc76a

Troubleshooting blackbox SDN control software with minimal causal sequences

Colin Scott, Sam Whitlock, H.B. Acharya, Kyriakos Zarifis, Scott Shenker, Andreas Wundsam, Barath Raghavan, Aurojit Panda, Andrew Or, Jefferson Lai, Eugene Huang, Zhi Liu (+1 others)
2014 Proceedings of the 2014 ACM conference on SIGCOMM - SIGCOMM '14  
In this paper we discuss how to improve control software troubleshooting by presenting a technique for automatically identifying a minimal sequence of inputs responsible for triggering a given bug, without  ...  We apply our technique to five open source SDN control platforms-Floodlight, NOX, POX, Pyretic, ONOS-and illustrate how the minimal causal sequences our system found aided the troubleshooting process.  ...  We thank our sheperd Nate Foster and the anonymous reviewers for their comments.  ... 
doi:10.1145/2619239.2626304 dblp:conf/sigcomm/ScottWRPOLHLEWAZS14 fatcat:hb7x2genzba3npsm4q63bzbm7e

Unified debugging of distributed systems with Recon

Kyu Hyung Lee, Nick Sumner, Xiangyu Zhang, Patrick Eugster
2011 2011 IEEE/IFIP 41st International Conference on Dependable Systems & Networks (DSN)  
As related systems, Recon promotes SQL-like queries for debugging distributed systems.  ...  To scale to today's complex distributed software systems, debugging and replaying techniques mostly focus on single facets of software, e.g., local concurrency, distributed messaging, or data representation  ...  ACKNOWLEDGMENT We would like to thank the anonymous reviewers for their insightful comments.  ... 
doi:10.1109/dsn.2011.5958209 dblp:conf/dsn/LeeSZE11 fatcat:wjocxzlc5fbd7kjm3iibo22qqq

Jockey

Yasushi Saito
2005 Proceedings of the Sixth sixth international symposium on Automated analysis-driven debugging - AADEBUG'05  
Jockey is implemented as a user-space library that runs as a part of the target process. While this design is the key for achieving Jockey's goal of safety and ease of use, it also poses challenges.  ...  Jockey has been applied extensively to debug real-world programs. We our experiences using Jockey as well.  ...  Jockey is designed to help debug programs that communicate with the operating system or other computers in a complex fashion.  ... 
doi:10.1145/1085130.1085139 dblp:conf/aadebug/Saito05 fatcat:xcjmvhzjlncbxhrt35zgxawcgq
« Previous Showing results 1 — 15 out of 6,271 results