【24h】

Can Linux be Rejuvenated without Reboots?

机译:无需重新引导就能使Linux焕发青春吗?

获取原文
获取原文并翻译 | 示例

摘要

Operating systems (OSes) are crucial for achieving high availability of computer systems. Even if the applications running on the operating system are highly available, a bug inside the kernel may result in a failure of the entire software stack. Rejuvenating OSes is a promising approach to prevent and recover from transient errors. Unfortunately, OS rejuvenation takes a lot of time because we do not have any method other than rebooting the entire OS. In this paper we explore the possibility of rejuvenating Linux without reboots. In our previous research, we investigated the scope of error propagation in Linux. The propagation scope is process-local if the error is confined in the process context that activated it. The scope is kernel-global if the error propagates to other processes'' contexts or global data structures. If most errors are process- local, we can rejuvenate the Linux kernel without rebooting the entire kernel because the kernel goes back to a consistent and clean state simply by killing and revoking the resources of the faulting process. Our conclusion is that Linux can be rejuvenated without reboots with high probability. Linux is coded in a defensive way and thus, most of the manifested errors (96%) were process-local and only one error was kernel- global.
机译:操作系统(OSes)对于实现计算机系统的高可用性至关重要。即使在操作系统上运行的应用程序具有很高的可用性,内核内部的错误也可能导致整个软件堆栈的故障。使操作系统恢复活力是防止瞬态错误并从中恢复的一种有前途的方法。不幸的是,使操作系统恢复活力需要大量时间,因为除了重启整个操作系统之外,我们没有其他方法。在本文中,我们探讨了无需重启即可恢复Linux活力的可能性。在我们以前的研究中,我们研究了Linux中错误传播的范围。如果错误限制在激活它的过程上下文中,则传播范围是局部过程的。如果错误传播到其他进程的上下文或全局数据结构,则作用域为全局内核。如果大多数错误是进程本地的,那么我们可以使Linux内核恢复活力,而无需重新启动整个内核,因为仅通过杀死并吊销故障进程的资源即可使内核返回到一致且干净的状态。我们的结论是,无需重启就可以使Linux焕发青春。 Linux是采用防御性方式进行编码的,因此,大多数显示的错误(96%)是局部过程的,而只有一个错误是内核全局的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号