首页> 外文会议>IEEE International Conference on Software Testing, Validation and Verification >Callisto: Entropy-based Test Generation and Data Quality Assessment for Machine Learning Systems
【24h】

Callisto: Entropy-based Test Generation and Data Quality Assessment for Machine Learning Systems

机译:Callisto:用于机器学习系统的基于熵的测试生成和数据质量评估

获取原文

摘要

Machine Learning (ML) has seen massive progress in the last decade and as a result, there is a pressing need for validating ML-based systems. To this end, we propose, design and evaluate CALLISTO– a novel test generation and data quality assessment framework. To the best of our knowledge, CALLISTO is the first black box framework to leverage the uncertainty in the prediction and systematically generate new test cases for ML classifiers. Our evaluation of CALLISTO on four real world data sets reveals thousands of errors. We also show that leveraging the uncertainty in prediction can increase the number of erroneous test cases up to a factor of 20, as compared to when no such knowledge is used for testing.CALLISTO has the capability to detect low quality data in the datasets that may contain mislabelled data. We conduct and present an extensive user study to validate the results of CALLISTO on identifying low quality data from four state-of-the-art real world datasets.
机译:在过去的十年中,机器学习(ML)取得了长足的进步,因此,迫切需要验证基于ML的系统。为此,我们提出,设计和评估CALLISTO –一种新颖的测试生成和数据质量评估框架。据我们所知,CALLISTO是第一个利用预测中的不确定性并为ML分类器生成新测试用例的黑盒框架。我们对四个真实数据集的CALLISTO评估显示出数千个错误。我们还表明,与不使用此类知识进行测试的情况相比,利用预测中的不确定性可以将错误测试用例的数量增加到20倍。包含标签错误的数据。我们进行并进行了广泛的用户研究,以验证CALLISTO从四个最新的现实世界数据集中识别出低质量数据的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号