首页> 外文期刊>The Journal of Systems and Software >Enabling improved IR-based feature location
【24h】

Enabling improved IR-based feature location

机译:启用改进的基于IR的特征定位

获取原文
获取原文并翻译 | 示例
           

摘要

Recent solutions to software engineering problems have incorporated tools and techniques from information retrieval (IR). The use of IR requires choosing an appropriate retrieval model and deciding on a query that best captures a particular information need. Taking feature location as a representative example, three research questions are investigated: (1) the impact of query preprocessing, (2) the impact that different scraping techniques for queries have on retrieval performance, (3) the performance impact that the underlying retrieval model has on identifying the correct source-code functions (the correct documents). These research questions are addressed using the five open source projects released as part of the SEMERU dataset. In the experiments, five methods of scraping queries from modification requests and seven retrieval model instances are considered. Using the standard evaluation metric Mean Reciprocal Rank (MRR), the experimental analysis reveals that better retrieval models are not the ones commonly used by software engineering researchers. Results find that models based on query-likelihood perform about twice as well as models in common use in software engineering such as LSI and thus deserve greater attention. Furthermore, corpus preprocessing has a significant impact as the top performing setting is over 100% better than the average.
机译:针对软件工程问题的最新解决方案已结合了信息检索(IR)的工具和技术。使用IR需要选择适当的检索模型,并确定最能满足特定信息需求的查询。以特征位置为代表,研究了三个研究问题:(1)查询预处理的影响;(2)不同的抓取技术对查询的影响对检索性能的影响;(3)基础检索模型对性能的影响可以识别正确的源代码功能(正确的文档)。这些研究问题通过SEMERU数据集的一部分发布的五个开源项目得到解决。在实验中,考虑了从修改请求中抓取查询的五种方法和七个检索模型实例。使用标准评估度量平均均值排名(MRR),实验分析表明,较好的检索模型不是软件工程研究人员常用的模型。结果发现,基于查询可能性的模型的性能大约是LSI等软件工程中常用模型的两倍,因此值得关注。此外,语料库预处理具有显着影响,因为表现最佳的设置比平均水平高出100%以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号