首页> 外文会议>International Conference on Software Maintenance and Evolution >Developing a model of loop actions by mining loop characteristics from a large code corpus
【24h】

Developing a model of loop actions by mining loop characteristics from a large code corpus

机译:通过从大型代码语料库中挖掘循环特征来开发循环动作模型

获取原文

摘要

Some high level algorithmic steps require more than one statement to implement, but are not large enough to be a method on their own. Specifically, many algorithmic steps (e.g., count, compare pairs of elements, find the maximum) are implemented as loop structures, which lack the higher level abstraction of the action being performed, and can negatively affect both human readers and automatic tools. Additionally, in a study of 14,317 projects, we found that less than 20% of loops are documented to help readers. In this paper, we present a novel automatic approach to identify the high level action implemented by a given loop. We leverage the available, large source of high-quality open source projects to mine loop characteristics and develop an action identification model. We use the model and feature vectors extracted from loop code to automatically identify the high level actions implemented by loops. We have evaluated the accuracy of the loop action identification and coverage of the model over 7159 open source programs. The results show great promise for this approach to automatically insert internal comments and provide additional higher level naming for loop actions to be used by tools such as code search.
机译:一些高级算法步骤需要一个以上的语句才能实现,但它们本身不足以成为一个方法。具体地,许多算法步骤(例如,计数,比较元素对,找到最大值)被实现为循环结构,其缺少对所执行的动作的更高级别的抽象,并且可能对人类阅读器和自动工具产生负面影响。另外,在对14,317个项目的研究中,我们发现记录不到20%的循环可以帮助读者。在本文中,我们提出了一种新颖的自动方法来识别由给定循环实现的高级操作。我们利用高质量开源项目的大量可用资源来挖掘循​​环特征并开发动作识别模型。我们使用从循环代码中提取的模型和特征向量来自动识别由循环实现的高级动作。我们评估了7159个开源程序中循环动作识别和模型覆盖的准确性。结果表明,这种方法有望自动插入内部注释,并为诸如代码搜索之类的工具所使用的循环操作提供更多高级命名。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号