首页> 外文会议>2011 27th IEEE International Conference on Software Maintenance >A comparison of stemmers on source code identifiers for software search
【24h】

A comparison of stemmers on source code identifiers for software search

机译:词干比较器在源代码标识符上进行软件搜索

获取原文

摘要

As the popularity of text-based source code analysis grows, the use of stemmers to strip suffixes has increased. Stemmers have been used to more accurately determine relevance between a keyword query and methods in source code for search, exploration, and bug localization. In this paper, we investigate which traditional stemmers perform best on the domain of software, specifically, Java source code. We compare the stemmers using two case studies: a comparative analysis of the unified word classes in terms of accuracy and completeness, as well as an investigation into the effectiveness of stemming for software search. Our results indicate that relative stemmer effectiveness varies with a software engineering tool such as search, justifying further research into this area.
机译:随着基于文本的源代码分析的流行,使用词干分析器剥离后缀的使用也越来越多。词干分析器已被用来更准确地确定关键字查询与源代码中用于搜索,探索和错误本地化的方法之间的相关性。在本文中,我们研究了哪些传统词干分析器在软件领域(特别是Java源代码)上表现最佳。我们使用两个案例研究来比较词干:在准确性和完整性方面对统一词类进行比较分析,以及对词干对软件搜索的有效性进行调查。我们的结果表明,相对词干提取器的有效性随诸如搜索之类的软件工程工具的不同而不同,因此有理由对该领域进行进一步的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号