首页> 外文会议>ACM symposium on Operating systems principles >Using model checker and replay facility to debug complex distributed system
【24h】

Using model checker and replay facility to debug complex distributed system

机译:使用模型检查器和重放工具调试复杂的分布式系统

获取原文

摘要

A correct system is only derived from a correct implementation of a correct specification. Unfortunately, this imposes a heavy burden in the development process, especially for complex, distributed system ranging from machine room computing and storage services as well as large-scale P2P applications. A specification, if authored in formal language such as TLA+, Spec#, SPIN etc., is ready for model checking. The state explosion problem, however, prohibits all specification states to be thoroughly traversed. Often ad hoc heuristics are applied to drastically reduce the scale so as to make the model checking phase tractable. A correct implementation can be even more challenging, especially when we encounter non-deterministic bugs that are hard to reproduce. The gap between spec and implementation often leaves one to wonder whether the implementation or the spec is faulty, or even both. Motivated by our experiences in developing several complete large scale distributed systems, we are designing and implementing a suite of testing and debugging facility on top of our previously developed WiDS platform.
机译:正确的系统仅来自 correct 规范的 correct 实现。不幸的是,这给开发过程带来了沉重的负担,尤其是对于机房计算和存储服务以及大规模P2P应用程序等复杂的分布式系统而言。如果规范以TLA +,Spec#,SPIN等正式语言编写,则可以进行模型检查。但是,状态爆炸问题禁止彻底遍历所有规范状态。通常采用临时启发式方法来大大减小规模,以使模型检查阶段变得容易处理。正确的实现可能更具挑战性,尤其是当我们遇到难以复制的不确定性错误时。规范和实现之间的鸿沟常常使人们想知道实现或规范是否错误,或者甚至两者都是错误的。基于我们在开发多个完整的大规模分布式系统中的经验的激励,我们正在我们先前开发的WiDS平台之上设计和实施一套测试和调试工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号