...
首页> 外文期刊>Intelligent data analysis >Source code-based defect prediction using deep learning and transfer learning
【24h】

Source code-based defect prediction using deep learning and transfer learning

机译:基于源代码的缺陷预测使用深度学习和转移学习

获取原文
获取原文并翻译 | 示例

摘要

Ensuring the quality of software products is important for them to be successful. Discovering errors and fixing defective software modules early in the project lifecycle (e.g. in the testing phase) can save resources and enhance software quality. Developers should prioritize testing procedures and continuously maintain their software projects; however, when there are few instances of a new project, it is hard to build an accurate defect prediction model. Different information about software projects is available and can be utilized through open repositories. Developers can leverage the labeled defect information to build a defect prediction model. The abundance of historical software information in similar domains can assist in transferring the knowledge gained from training this information to other domains for cross-project defect prediction models. Deep learning is a promising machine learner. Deep Belief network (DBN) is a deep learning algorithm that can discover latent relationships between input features by training them through multi-hidden layers; however, it is difficult to build a good prediction model from a dataset with few modules or instances. In this research, we utilized auxiliary datasets to initialize a DBN model and transfer the obtained knowledge to train the DBN model using a source project in a cross-project combination. The expressive features generated from the DBN model are used to build a classical classifier from the source class label and test it on other target project instances. Our evaluation of 13 open Java projects from the PROMISE repository shows that our proposed model achieves improvements based on F-measures (3.6%, 4.9%, and 5.1%) for the three settings of the DBN model measured against the best used benchmark model of TCA/TCA+ techniques. Moreover, T_DBN and DBN_Only models achieve improvement in terms of F-measure by (11.1% and 6.2%) against the best used benchmark model of TCA/TCA+ on Relink validation dataset.
机译:确保软件产品的质量对于他们来说很重要。在项目生命周期中早期发现错误和修复缺陷的软件模块(例如,在测试阶段)可以节省资源并提高软件质量。开发人员应优先考虑测试程序并不断维护其软件项目;但是,当新项目的情况下很少时,很难建立一个准确的缺陷预测模型。有关软件项目的不同信息可用,可通过开放存储库使用。开发人员可以利用标记的缺陷信息来构建缺陷预测模型。类似域中的历史软件信息的丰富可以帮助将从培训此信息的知识转移到跨项目缺陷预测模型的其他域中。深度学习是一个有前途的机器学习者。深度信仰网络(DBN)是一种深入学习算法,可以通过通过多隐藏层训练它们之间的输入特征之间的潜在关系;但是,很难从具有少数模块或实例的数据集构建一个良好的预测模型。在本研究中,我们利用辅助数据集来初始化DBN模型,并将所获得的知识传输以使用跨项目组合中的源项目训练DBN模型。从DBN模型生成的表达功能用于从源类标签构建一个经典分类器,并在其他目标项目实例上测试它。我们对Promise存储库的13个Open Java项目的评估表明,我们的拟议模型基于F-D测量(3.6%,4.9%和5.1%)实现了针对最佳使用基准模型的DBN模型的三种设置的改进TCA / TCA +技术。此外,T_DBN和DBN_ONLY模型通过(11.1%和6.2%)在Relink验证数据集上的最佳使用基准模型(11.1%和6.2%)的F-Measure(11.1%和6.2%)实现改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号