首页> 外文会议>IEEE/ACM International Conference on Automated Software Engineering >Automatically generating commit messages from diffs using neural machine translation
【24h】

Automatically generating commit messages from diffs using neural machine translation

机译:使用神经机翻译自动生成来自Diff的提交消息

获取原文

摘要

Commit messages are a valuable resource in comprehension of software evolution, since they provide a record of changes such as feature additions and bug repairs. Unfortunately, programmers often neglect to write good commit messages. Different techniques have been proposed to help programmers by automatically writing these messages. These techniques are effective at describing what changed, but are often verbose and lack context for understanding the rationale behind a change. In contrast, humans write messages that are short and summarize the high level rationale. In this paper, we adapt Neural Machine Translation (NMT) to automatically "translate" diffs into commit messages. We trained an NMT algorithm using a corpus of diffs and human-written commit messages from the top 1k Github projects. We designed a filter to help ensure that we only trained the algorithm on higher-quality commit messages. Our evaluation uncovered a pattern in which the messages we generate tend to be either very high or very low quality. Therefore, we created a quality-assurance filter to detect cases in which we are unable to produce good messages, and return a warning instead.
机译:提交消息是理解软件演变的宝贵资源,因为它们提供了更改的记录,例如功能添加和错误维修。不幸的是,程序员经常忽略编写良好的提交消息。已经提出了通过自动编写这些消息来帮助程序员来帮助编程器。这些技术在描述改变的情况下是有效的,但通常是冗长的,并且缺乏了解改变背后理由的背景。相比之下,人类写出短暂并总结高级理由的信息。在本文中,我们将神经机器翻译(NMT)调整为自动“转换”差异为提交消息。我们使用来自顶级1K GitHub项目的Diffs和Humb-Lored Comment消息的语料库培训了NMT算法。我们设计过滤器,以帮助确保我们只在更高质量的提交消息上培训算法。我们的评估发现了一种模式,其中我们产生的消息往往是非常高或非常低的质量。因此,我们创建了一个质量保证滤波器,以检测我们无法生成好消息的案例,并返回警告。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号