首页> 外文会议>International conference on product-focused software process improvement >Making Runtime Data Useful for Incident Diagnosis: An Experience Report
【24h】

Making Runtime Data Useful for Incident Diagnosis: An Experience Report

机译:使运行时数据对事件诊断有用:一份经验报告

获取原文
获取外文期刊封面目录资料

摘要

Important and critical aspects of technical debt often surface at runtime only and are difficult to measure statically. This is a particular challenge for cloud applications because of their highly distributed nature. Fortunately, mature frameworks for collecting runtime data exist but need to be integrated. In this paper, we report an experience from a project that implements a cloud application within Kubernetes on Azure. To analyze the runtime data of this software system, we instrumented our services with Zipkin for distributed tracing; with Prometheus and Grafana for analyzing metrics; and with fluentd, Elasticsearch and Kibana for collecting, storing and exploring log files. However, project team members did not utilize these runtime data until we created a unified and simple access using a chat bot. We argue that even though your project collects runtime data, this is not sufficient to guarantee its usage: In order to be useful, a simple, unified access to different data sources is required that should be integrated into tools that are commonly used by team members.
机译:技术债务的重要和关键方面通常仅在运行时浮出水面,并且很难进行静态衡量。由于云应用程序的高度分布式性质,这是一个特殊的挑战。幸运的是,已经存在用于收集运行时数据的成熟框架,但需要对其进行集成。在本文中,我们报告了一个项目的经验,该项目在Azure上的Kubernetes中实现了云应用程序。为了分析该软件系统的运行时数据,我们使用Zipkin对服务进行了检测,以进行分布式跟踪。与Prometheus和Grafana一起分析指标;并与流利的Elasticsearch和Kibana一起用于收集,存储和浏览日志文件。但是,直到我们使用聊天机器人创建了统一而简单的访问权限之后,项目团队成员才利用这些运行时数据。我们认为,即使您的项目收集了运行时数据,也不足以保证其使用:为了有用,需要对不同数据源进行简单,统一的访问,并且应将其集成到团队成员常用的工具中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号