首页> 外文会议>Asia-Pacific Software Engineering Conference >An Empirical Study on Real Bugs for Machine Learning Programs
【24h】

An Empirical Study on Real Bugs for Machine Learning Programs

机译:机器学习计划真正虫子的实证研究

获取原文

摘要

Due to the availability of various open source Machine Learning (ML) tools and libraries, developers nowadays can easily implement their purposes by just invoking machine learning APIs without knowing the details of the algorithm. However, the owners of ML tools and libraries usually pay more attention to the correctness and functionality of their algorithm, while spending much less effort on maintaining their code and keeping their code at a high quality level. Considering the popularity of machine learning in today's world, low quality ML tools and libraries can have a huge impact on the software products that use ML algorithms. So in this paper, we conduct an empirical study on real machine learning bugs to examine their patterns and how they evolve over time. We collect three popular machine learning projects on Github, and manually analyzed 329 closed bugs from the perspectives of their bug category, fix pattern, fix scale, fix duration, and type of software maintenance. The results show that (1) there are seven categories of bugs in machine learning programs; (2) twelve different fix patterns are commonly used to fix the bugs; (3) 63.83% of the patches belong to micro-scale-fix and small-scale-fix, and 68.39% of the bugs are fixed within one month; (4) 47.77% of the bug fixes belong to corrective activity from the view of software maintenance.
机译:由于各种开源机器学习(ML)工具和库的可用性,现在开发人员可以通过仅在不知道算法的细节的情况下通过调用机器学习API来轻松实现目的。然而,ML工具和库车主平时多注意自己的算法的正确性和功能,同时花费更少精力维护他们的代码,并保持他们的代码在一个较高的质量水平。考虑到当今世界的机器学习的普及,低质量的ML工具和图书馆对使用ML算法的软件产品产生巨大影响。因此,在本文中,我们对真实机器学习虫进行了实证研究,以检查其模式以及它们如何随着时间的推移而发展。我们收集在Github上三种流行的机器学习项目,并手动分析了他们的错误分类的角度来看,修复模式,修复规模,修复时间,以及软件维护型329个修正的bug。结果表明(1)机器学习计划中有七个类别的错误; (2)十二种不同的修复模式通常用于解决错误; (3)63.83 %的补丁属于微尺度 - 修复和小规模 - 修复,68.39%的错误在一个月内修复; (4)47.77 %的错误修复属于软件维护视图中的纠正活动。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号