【24h】

Modeling changeset topics for feature location

机译:为功能集建模变更集主题

获取原文

摘要

Feature location is a program comprehension activity in which a developer inspects source code to locate the classes or methods that implement a feature of interest. Many feature location techniques (FLTs) are based on text retrieval models, and in such FLTs it is typical for the models to be trained on source code snapshots. However, source code evolution leads to model obsolescence and thus to the need to retrain the model from the latest snapshot. In this paper, we introduce a topic-modeling-based FLT in which the model is built incrementally from source code history. By training an online learning algorithm using changesets, the FLT maintains an up-to-date model without incurring the non-trivial computational cost associated with retraining traditional FLTs. Overall, we studied over 600 defects and features from 4 open-source Java projects. We also present a historical simulation that demonstrates how the FLT performs as a project evolves. Our results indicate that the accuracy of a changeset-based FLT is similar to that of a snapshot-based FLT, but without the retraining costs.
机译:功能位置是程序理解活动,开发人员在其中检查源代码以找到实现感兴趣功能的类或方法。许多特征定位技术(FLT)基于文本检索模型,在此类FLT中,通常需要在源代码快照上训练模型。但是,源代码的演变导致模型过时,因此需要从最新快照中重新训练模型。在本文中,我们介绍了一个基于主题建模的FLT,其中该模型是从源代码历史中逐步构建的。通过使用变更集训练在线学习算法,FLT可以保持最新模型,而不会产生与重新训练传统FLT相关的不小的计算成本。总体而言,我们研究了来自4个开源Java项目的600多个缺陷和功能。我们还提供了一个历史模拟,该模拟演示了FLT在项目发展过程中的表现。我们的结果表明,基于变更集的FLT的准确性类似于基于快照的FLT的准确性,但是没有重新培训的成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号