首页> 外文会议>Proceedings of the Sixth Symposium on Operating Systems Design and Implementation(OSDI'04) >Understanding and Dealing with Operator Mistakes in Internet Services
【24h】

Understanding and Dealing with Operator Mistakes in Internet Services

机译:了解并处理互联网服务中的运营商错误

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Operator mistakes are a significant source of unavailability in modern Internet services. In this paper, we first characterize these mistakes by performing an extensive set of experiments using human operators and a realistic three-tier auction service. The mistakes we observed range from software misconfiguration, to fault misdiag-nosis, to incorrect software restarts. We next propose to validate operator actions before they are made visible to the rest of the system. We demonstrate how to accomplish this task via the creation of a validation environment that is an extension of the online system, where components can be validated using real workloads before they are migrated into the running service. We show that our prototype validation system can detect 66% of the operator mistakes that we have observed.
机译:操作员错误是现代Internet服务无法使用的重要原因。在本文中,我们首先通过使用人工操作员和现实的三层拍卖服务进行大量实验来表征这些错误。我们观察到的错误范围从软件配置错误到错误的诊断错误,再到错误的软件重启。接下来,我们建议在使操作员操作对系统其余部分可见之前对其进行验证。我们演示了如何通过创建作为在线系统扩展的验证环境来完成此任务,在该环境中,可以在将组件迁移到正在运行的服务中之前使用实际工作负载对组件进行验证。我们证明了我们的原型验证系统可以检测到我们观察到的66%的操作员错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号