...
首页> 外文期刊>Empirical Software Engineering >Wikifying software artifacts
【24h】

Wikifying software artifacts

机译:Wikify软件工件

获取原文
获取原文并翻译 | 示例
           

摘要

Context The computational linguistics community has developed tools, called wikifiers, to identify links to Wikipedia articles from free-form text. Software engineering research can leverage wikifiers to add semantic information to software artifacts. However, no empirically-grounded basis exists to choose an effective wikifier and to configure it for the software domain, on which wikifiers were not specifically trained. Objective We conducted a study to guide the selection of a wikifier and its configuration for applications in the software domain, and to measure what performance can be expected of wikifiers. Method We applied six wikifiers, with multiple configurations, to a sample of 500 Stack Overflow posts. We manually annotated the 41 124 articles identified by the wikifiers as correct or not to compare their precision and recall. Results Each wikifier, in turn, achieved the highest precision, between 13% and 82%, for different thresholds of recall, from 60% to 5%. However, filtering the wikifiers' output with a whitelist can considerably improve the precision above 79% for recall up to 30%, and above 47% for recall up to 60%. Conclusions Results reported in each wikifier's original article cannot be generalized to software-specific documents. Given that no wikifier performs universally better than all others, we provide empirically grounded insights to select a wikifier for different scenarios, and suggest ways to further improve their performance for the software domain.
机译:背景信息计算语言学社区已开发出名为Wikifiers的工具,从自由形式文本中识别与维基百科文章的链接。软件工程研究可以利用Wikifiers向软件工件添加语义信息。但是,没有经验接地的基础是为了选择有效的Wikifier并为软件域配置它,其中无线电器没有专门培训。目的我们进行了一项研究来指导选择Wikifier及其配置在软件领域中的应用程序,并测量可以预期Wikifiers的表现。方法我们应用了六个Wikifiers,具有多种配置,到500堆栈溢出柱的样本。我们手动注释了由Wikifiers确定的41个124条制品,如纠正,或不进行比较他们的精确和召回。结果每个Wikifier又实现了最高精度,而不同的召回阈值达到了最高精度,从60%到5%。然而,过滤使用白名单的Wikifiers的输出可以大大提高79%的精度,召回高于30%,高于47%,召回高达60%。结论在每个Wikifier原始文章中报告的结果不能推广到特定于软件的文件。考虑到Wikifier普遍优于所有其他Wikifier,我们提供了针对不同方案的Wikifier提供了经验接地的洞察,并建议进一步提高软件域的性能的方法。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号