首页> 外文会议>Asian Symposium on Programming Languages and Systems >Automatically Generating Descriptive Texts in Logging Statements: How Far Are We?
【24h】

Automatically Generating Descriptive Texts in Logging Statements: How Far Are We?

机译:在日志记录语句中自动生成描述性文本:我们有多远?

获取原文

摘要

In most cases, logs are the only accurate information available for administrators to understand system behavior and diagnose failure root causes. However, due to the lack of well-defined logging guidance, it is challenging for developers to decide what to log, especially logging statements that contain descriptive texts and variables. In this paper, we explore automatically generation of descriptive texts in logging statements and evaluate the effectiveness of various automatic generation methods. We propose that to generate descriptive texts in logging statements can be transferred as a retrieval-based Q&A task. According to the roles of query and answer, we design two retrieval strategies including Code&Code and Code&Log. To measure the similarity between the query and answer, we utilize two types of retrieval algorithms including Information retrieval-based and neural networks-based algorithms. We conduct a systematic analysis of various retrieval algorithms under different retrieval strategies in terms of their effectiveness, and assess their accuracy using the automatic metrics and human evaluation during which 5 instructive findings are presented. We believe that these findings can provide potential implications for both researchers and practitioners for relevant research. Moreover, we construct and release a log text dataset containing over 138K valid log texts from 85 Java projects in Apache ecosystem for future logging statement analysis and generation.
机译:在大多数情况下,日志是管理员可以理解系统行为和诊断失败根原因的唯一可用的准确信息。但是,由于缺乏定义明确的日志记录指导,开发人员对开发人员来说是挑战,以确定要记录的内容,尤其是记录包含描述性文本和变量的日期语句。在本文中,我们在日志记录语句中自动生成描述性文本,并评估各种自动生成方法的有效性。我们建议在记录语句中生成描述性文本,可以作为基于检索的Q&A任务传输。根据查询和答案的角色,我们设计了两种检索策略,包括代码和代码和代码和日志。为了测量查询和答案之间的相似性,我们利用了两种类型的检索算法,包括信息检索和基于神经网络的算法。在其有效性方面,我们对不同检索策略进行了各种检索算法的系统分析,并使用自动度量和人类评估来评估其准确性,在此期间提出了5个有效性调查结果。我们认为,这些调查结果可以为相关研究的研究人员和从业者提供潜在的影响。此外,我们构建和释放一个日志文本数据集,其中包含超过85个Java项目中的超过138k的有效日志文本,以获取未来的日志记录语句分析和生成。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号