【24h】

Modeling changeset topics for feature location

机译:用于特征位置的模型主题建模

获取原文

摘要

Feature location is a program comprehension activity in which a developer inspects source code to locate the classes or methods that implement a feature of interest. Many feature location techniques (FLTs) are based on text retrieval models, and in such FLTs it is typical for the models to be trained on source code snapshots. However, source code evolution leads to model obsolescence and thus to the need to retrain the model from the latest snapshot. In this paper, we introduce a topic-modeling-based FLT in which the model is built incrementally from source code history. By training an online learning algorithm using changesets, the FLT maintains an up-to-date model without incurring the non-trivial computational cost associated with retraining traditional FLTs. Overall, we studied over 600 defects and features from 4 open-source Java projects. We also present a historical simulation that demonstrates how the FLT performs as a project evolves. Our results indicate that the accuracy of a changeset-based FLT is similar to that of a snapshot-based FLT, but without the retraining costs.
机译:特征位置是一个程序理解活动,其中开发人员检查源代码以找到实现感兴趣的特征的类别或方法。许多特征定位技术(FLTS)基于文本检索模型,并且在这种FLTS中,它是在源代码快照上培训的模型的典型。但是,源代码演进导致模型过时,因此需要从最新快照中恢复模型。在本文中,我们介绍了一个基于主题建模的FLT,其中模型从源代码历史记录逐步构建。通过使用变频器培训在线学习算法,FLT维护最新模型,而不会产生与刷新传统FLTS相关的非琐碎的计算成本。总体而言,我们研究了4个开源Java项目的600多个缺陷和功能。我们还提出了一个历史模拟,展示了FLT如何随着项目的发展而表现。我们的结果表明,基于混乱的FLT的准确性类似于基于快照的FLT的精度,但没有润滑成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号