首页> 外文会议>IEEE Network Operations and Management Symposium >TOSS: TONICS for operation support systems: system management using the world wide web and intelligent software agents
【24h】

TOSS: TONICS for operation support systems: system management using the world wide web and intelligent software agents

机译:折腾:用于操作支持系统的调节:系统管理使用万维网和智能软件代理

获取原文

摘要

An enterprise wide distributed computing environment consists of a variety of hardware/OS platforms running a wide range of mission critical applications. These platforms may be interconnected over local area networks, wide area networks, even the public Internet. The availability of the applications, platforms and associated computing resources (e.g. cpu, disk space, memory, databases, middlewares, etc.) is critical to the business mission of the enterprise. This poses demanding requirements on the design of system management procedures for such critical distributed systems. Those requirements include continuous monitoring of the health of these resources, detection of potential problems, problem notifications, and timely corrective actions. The traditional approach to meet these requirements has been mostly manual intervention from a system administrator who logs on to individual machines and issues a set of commands. However, this practice neither provides continuous monitoring, nor does it assure timely detections, notifications, or corrective actions. Moreover, it is time consuming, error prone, and insecure. We present a novel approach towards building a system management framework for mission critical distributed systems. Our framework, TOSS, offers several advantages over the existing approaches such as: Continuous monitoring of critical computing resources by intelligent software agents; Problem detection and notification by intelligent software agents; Centralized control of applications through a Web browser; Centralized configuration management through a Web browser; Security; Flexibility to monitor wide range of platforms and applications; Interoperability with third party management platforms.
机译:企业宽分布式计算环境包括运行各种任务关键应用程序的各种硬件/操作系统平台。这些平台可以互连在局域网,广域网,甚至是公共互联网上。应用程序,平台和关联的计算资源(例如CPU,磁盘空间,内存,数据库,跨方等)的可用性对企业的业务使命至关重要。这造成了对这种关键分布式系统的系统管理程序设计要求的要求。这些要求包括持续监测这些资源的健康,检测潜在问题,问题通知和及时纠正措施。符合这些要求的传统方法主要是从登录各个计算机的系统管理员手动干预,并发出一组命令。但是,这种做法既不提供持续监控,也没有确保及时检测,通知或纠正措施。而且,它是耗时的,易于出错的,不安全。我们提出了一种为支持任务关键分布式系统构建系统管理框架的新方法。我们的框架折腾,提供了现有方法的几个优势,如:智能软件代理持续监控关键计算资源;智能软件代理的问题检测和通知;通过Web浏览器集中控制应用程序;通过Web浏览器集中配置管理;安全;灵活地监控广泛的平台和应用;与第三方管理平台的互操作性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号