首页> 外文会议>2016 IEEE/ACM 38th IEEE International Conference on Software Engineering >Automatically Learning Semantic Features for Defect Prediction
【24h】

Automatically Learning Semantic Features for Defect Prediction

机译:自动学习语义特征以进行缺陷预测

获取原文
获取原文并翻译 | 示例

摘要

Software defect prediction, which predicts defective code regions, can help developers find bugs and prioritize their testing efforts. To build accurate prediction models, previous studies focus on manually designing features that encode the characteristics of programs and exploring different machine learning algorithms. Existing traditional features often fail to capture the semantic differences of programs, and such a capability is needed for building accurate prediction models. To bridge the gap between programs' semantics and defect prediction features, this paper proposes to leverage a powerful representation-learning algorithm, deep learning, to learn semantic representation of programs automatically from source code. Specifically, we leverage Deep Belief Network (DBN) to automatically learn semantic features from token vectors extracted from programs' Abstract Syntax Trees (ASTs). Our evaluation on ten open source projects shows that our automatically learned semantic features significantly improve both within-project defect prediction (WPDP) and cross-project defect prediction (CPDP) compared to traditional features. Our semantic features improve WPDP on average by 14.7% in precision, 11.5% in recall, and 14.2% in F1. For CPDP, our semantic features based approach outperforms the state-of-the-art technique TCA+ with traditional features by 8.9% in F1.
机译:预测缺陷代码区域的软件缺陷预测可以帮助开发人员发现错误并确定测试工作的优先级。为了建立准确的预测模型,以前的研究集中于手动设计对程序特征进行编码的特征,并探索不同的机器学习算法。现有的传统功能通常无法捕获程序的语义差异,因此需要这种功能来构建准确的预测模型。为了弥合程序语义和缺陷预测功能之间的差距,本文提出利用功能强大的表示学习算法深度学习从源代码自动学习程序的语义表示。具体来说,我们利用深度信任网络(DBN)从从程序的抽象语法树(AST)中提取的令牌向量中自动学习语义特征。我们对十个开源项目的评估表明,与传统功能相比,我们自动学习的语义功能显着改善了项目内缺陷预测(WPDP)和跨项目缺陷预测(CPDP)。我们的语义特征将WPDP的平均精度提高了14.7%,召回率提高了11.5%,F1提升了14.2%。对于CPDP,我们基于语义特征的方法在F1中的表现优于传统特征的最新技术TCA +(8.9%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号