首页> 外文会议>IEEE/ACM International Conference on Automated Software Engineering >Automatically generating commit messages from diffs using neural machine translation
【24h】

Automatically generating commit messages from diffs using neural machine translation

机译:使用神经机器翻译从差异自动生成提交消息

获取原文

摘要

Commit messages are a valuable resource in comprehension of software evolution, since they provide a record of changes such as feature additions and bug repairs. Unfortunately, programmers often neglect to write good commit messages. Different techniques have been proposed to help programmers by automatically writing these messages. These techniques are effective at describing what changed, but are often verbose and lack context for understanding the rationale behind a change. In contrast, humans write messages that are short and summarize the high level rationale. In this paper, we adapt Neural Machine Translation (NMT) to automatically "translate" diffs into commit messages. We trained an NMT algorithm using a corpus of diffs and human-written commit messages from the top 1k Github projects. We designed a filter to help ensure that we only trained the algorithm on higher-quality commit messages. Our evaluation uncovered a pattern in which the messages we generate tend to be either very high or very low quality. Therefore, we created a quality-assurance filter to detect cases in which we are unable to produce good messages, and return a warning instead.
机译:提交消息是理解软件发展的宝贵资源,因为它们提供了更改记录,例如功能添加和错误修复。不幸的是,程序员经常忽略编写好的提交消息。已经提出了不同的技术来通过自动编写这些消息来帮助程序员。这些技术可以有效地描述更改的内容,但通常很冗长,并且缺乏用于理解更改背后原因的上下文。相反,人类编写的消息简短,并概括了高层次的基本原理。在本文中,我们采用了神经机器翻译(NMT)来将差异自动“翻译”为提交消息。我们使用了来自前1k个Github项目的差异和人工编写的提交消息的语料库训练了NMT算法。我们设计了一个过滤器,以帮助确保仅对更高质量的提交消息进行算法训练。我们的评估发现了一种模式,在这种模式下,我们生成的消息往往是非常高或非常低的质量。因此,我们创建了一个质量保证筛选器,以检测无法生成良好消息的情况,并返回警告。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号