首页> 外文会议>Asia-Pacific Software Engineering Conference >An Empirical Study on Real Bugs for Machine Learning Programs
【24h】

An Empirical Study on Real Bugs for Machine Learning Programs

机译:机器学习程序中的实际错误的实证研究

获取原文

摘要

Due to the availability of various open source Machine Learning (ML) tools and libraries, developers nowadays can easily implement their purposes by just invoking machine learning APIs without knowing the details of the algorithm. However, the owners of ML tools and libraries usually pay more attention to the correctness and functionality of their algorithm, while spending much less effort on maintaining their code and keeping their code at a high quality level. Considering the popularity of machine learning in today's world, low quality ML tools and libraries can have a huge impact on the software products that use ML algorithms. So in this paper, we conduct an empirical study on real machine learning bugs to examine their patterns and how they evolve over time. We collect three popular machine learning projects on Github, and manually analyzed 329 closed bugs from the perspectives of their bug category, fix pattern, fix scale, fix duration, and type of software maintenance. The results show that (1) there are seven categories of bugs in machine learning programs; (2) twelve different fix patterns are commonly used to fix the bugs; (3) 63.83% of the patches belong to micro-scale-fix and small-scale-fix, and 68.39% of the bugs are fixed within one month; (4) 47.77% of the bug fixes belong to corrective activity from the view of software maintenance.
机译:由于各种开源机器学习(ML)工具和库的可用性,如今的开发人员仅通过调用机器学习API即可轻松实现其目的,而无需了解算法的细节。但是,ML工具和库的所有者通常会更加关注其算法的正确性和功能,而在维护代码和保持代码高质量方面的工作却要少得多。考虑到当今世界机器学习的普及,低质量的机器学习工具和库可能会对使用机器学习算法的软件产品产生巨大影响。因此,在本文中,我们对真实的机器学习错误进行了实证研究,以检查它们的模式以及它们随着时间的演变。我们在Github上收集了三个流行的机器学习项目,并从它们的错误类别,修复模式,修复规模,修复持续时间和软件维护类型的角度手动分析了329个已关闭的bug。结果表明:(1)机器学习程序中有七类错误; (2)通常使用十二种不同的修复模式来修复错误; (3)63.83 \%的补丁属于小规模修复和小规模修复,其中68.39 \%的错误在一个月内得到修复; (4)从软件维护的角度来看,47.77%的错误修复属于更正活动。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号