Thursday, January 31, 2008
D3S: Debugging Deployed Distributed Systems
Testing large-scale distributed systems is a challenge, because some errors manifest themselves only after a distributed sequence of events that involves machine and network failures. D3S is a checker that allows developers to specify predicates on distributed properties of a deployed system, and that checks these predicates while the system is running. When D3S finds a problem it produces the sequence of state changes that led to the problem, allowing developers to quickly find the root cause. ...
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment