首页> 外文OA文献 >MSR4SM: Using topic models to effectively mining software repositories for software maintenance tasks
【2h】

MSR4SM: Using topic models to effectively mining software repositories for software maintenance tasks

机译:MSR4SM:使用主题模型有效地挖掘软件存储库以进行软件维护任务

摘要

Abstract Context Mining software repositories has emerged as a research direction over the past decade, achieving substantial success in both research and practice to support various software maintenance tasks. Software repositories include bug repository, communication archives, source control repository, etc. When using these repositories to support software maintenance, inclusion of irrelevant information in each repository can lead to decreased effectiveness or even wrong results. Objective This article aims at selecting the relevant information from each of the repositories to improve effectiveness of software maintenance tasks. Method For a maintenance task at hand, maintainers need to implement the maintenance request on the current system. In this article, we propose an approach, MSR4SM, to extract the relevant information from each software repository based on the maintenance request and the current system. That is, if the information in a software repository is relevant to either the maintenance request or the current system, this information should be included to perform the current maintenance task. MSR4SM uses the topic model to extract the topics from these software repositories. Then, relevant information in each software repository is extracted based on the topics. Results MSR4SM is evaluated for two software maintenance tasks, feature location and change impact analysis, which are based on four subject systems, namely jEdit, ArgoUML, Rhino and KOffice. The empirical results show that the effectiveness of traditional software repositories based maintenance tasks can be greatly improved by MSR4SM. Conclusions There is a lot of irrelevant information in software repositories. Before we use them to implement a maintenance task at hand, we need to preprocess them. Then, the effectiveness of the software maintenance tasks can be improved.
机译:摘要上下文采矿软件存储库已成为过去十年的研究方向,在支持各种软件维护任务的研究和实践中均取得了巨大成功。软件存储库包括错误存储库,通信存储库,源代码控制存储库等。使用这些存储库支持软件维护时,在每个存储库中包含不相关的信息可能会导致有效性降低甚至错误的结果。目的本文旨在从每个存储库中选择相关信息,以提高软件维护任务的效率。方法对于即将进行的维护任务,维护人员需要在当前系统上实施维护请求。在本文中,我们提出了一种方法MSR4SM,它基于维护请求和当前系统从每个软件存储库中提取相关信息。也就是说,如果软件存储库中的信息与维护请求或当前系统相关,则应包括此信息以执行当前维护任务。 MSR4SM使用主题模型从这些软件存储库中提取主题。然后,根据主题提取每个软件存储库中的相关信息。结果对MSR4SM进行了两项软件维护任务(功能部件定位和变更影响分析)的评估,这些任务基于jEdit,ArgoUML,Rhino和KOffice这四个主题系统。实验结果表明,MSR4SM可以大大提高基于传统软件存储库的维护任务的效率。结论软件存储库中有许多不相关的信息。在使用它们执行手头的维护任务之前,我们需要对其进行预处理。然后,可以提高软件维护任务的效率。

著录项

  • 作者

    Sun X; Li B; Leung H; Li Y;

  • 作者单位
  • 年度 2015
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号