首页> 外文期刊>IEEE Transactions on Software Engineering >Bridging Semantic Gaps between Natural Languages and APIs with Word Embedding
【24h】

Bridging Semantic Gaps between Natural Languages and APIs with Word Embedding

机译:桥接自然语言和API之间的语义间隙,用词嵌入

获取原文
获取原文并翻译 | 示例
           

摘要

Developers increasingly rely on text matching tools to analyze the relation between natural language words and APIs. However, semantic gaps, namely textual mismatches between words and APIs, negatively affect these tools. Previous studies have transformed words or APIs into low-dimensional vectors for matching; however, inaccurate results were obtained due to the failure of modeling words and APIs simultaneously. To resolve this problem, two main challenges are to be addressed: the acquisition of massive words and APIs for mining and the alignment of words and APIs for modeling. Therefore, this study proposes Word2API to effectively estimate relatedness of words and APIs. Word2API collects millions of commonly used words and APIs from code repositories to address the acquisition challenge. Then, a shuffling strategy is used to transform related words and APIs into tuples to address the alignment challenge. Using these tuples, Word2API models words and APIs simultaneously. Word2API outperforms baselines by 10-49.6 percent of relatedness estimation in terms of precision and NDCG. Word2API is also effective on solving typical software tasks, e.g., query expansion and API documents linking. A simple system with Word2API-expanded queries recommends up to 21.4 percent more related APIs for developers. Meanwhile, Word2API improves comparison algorithms by 7.9-17.4 percent in linking questions in Question&Answer communities to API documents.
机译:开发人员越来越依赖文本匹配工具来分析自然语言单词和API之间的关系。然而,语义间隙,即文字和API之间的文本不匹配,对这些工具产生负面影响。以前的研究将单词或API转化为用于匹配的低维向量;然而,由于同时建模词和API的失败而获得了不准确的结果。为了解决这个问题,要解决两个主要挑战:收购挖掘和对齐的挖掘和API的建模和API。因此,本研究提出了Word2API,从而有效地估计了单词和API的相关性。 Word2api收集数百万常用的单词和API从代码存储库来解决收购挑战。然后,使用次组策略用于将相关的单词和API转换为元组以解决对齐挑战。使用这些元组,同时使用Word2api模型单词和API。在精确和NDCG方面,Word2api以10-49.6%的相关性估计优于基线。 Word2api还有效地解决典型的软件任务,例如查询扩展和API文档链接。具有Word2api-admobed查询的简单系统建议为开发人员提供高达21.4%的相关API。同时,Word2api将比较算法提高7.9-17.4%,以将问题和应答社区的问题与API文件联系起来。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号