The Willow Architecture: Comprehensive Survivability for Large-Scale Distributed Applications
ReportThe Willow architecture is a comprehensive approach to survivability in critical distributed applications. Survivability is achieved in a deployed system using a unique combination of (a) fault avoidance by disabling vulnerable network elements intentionally when a threat is detected or predicted, (b) fault elimination by replacing system software elements when faults are discovered, and (c) fault tolerance by reconfiguring the system if non-maskable damage occurs. The key to the architecture is a powerful reconfiguration mechanism that is combined with a general control structure in which network state is sensed, analyzed, and required changes effected. The architecture can be used to deploy software functionality enhancements as well as survivability. Novel aspects include: node configuration control mechanisms; a workflow
system for resolving conflicting configurations; communications based on widearea event notification; tolerance for wide-area, hierarchic and sequential faults; and secure, scalable and delegatable trust models.
Note: Abstract extracted from PDF file via OCR
All rights reserved (no additional license for public reuse)
English
Knight, John, Dennis Heimbigner, Alexander Wolf, Antonio Carzaniga, Jonathan Hill, Premkumar Devanbu, and Michael Gertz. "The Willow Architecture: Comprehensive Survivability for Large-Scale Distributed Applications." University of Virginia Dept. of Computer Science Tech Report (2001).
University of Virginia, Department of Computer Science
2001