首页> 外文会议>Safety-Critical Systems Symposium >Quantifying Dataset Properties for Systematic Artificial Neural Network Classifier Verification
【24h】

Quantifying Dataset Properties for Systematic Artificial Neural Network Classifier Verification

机译:量化数据集属性,用于系统人工神经网络分类器验证

获取原文

摘要

Autonomous systems make use of a suite of algorithms for understanding the environment in which they are deployed. These algorithms typically solve one or more classic problems, such as classification, prediction and detection. This is a key step in making independent decisions in order to accomplish a set of objectives. Artificial neural networks (ANNs) are one such class of algorithms, which have shown great promise in view of their apparent ability to learn the complicated patterns underlying high-dimensional data. The decision boundary approximated by such networks is highly non-linear and difficult to interpret, which is particularly problematic in cases where these decisions can compromise the safety of either the system itself, or people. Furthermore, the choice of data used to prepare and test the network can have a dramatic impact on performance (e.g. misclassification) and consequently safety. In this paper, we introduce a novel measure for quantifying the difference between the datasets used for training ANN-based object classification algorithms, and the test datasets used for verifying and evaluating classifier performance. This measure allows performance metrics to be placed into context by characterizing the test datasets employed for evaluation. A system requirement could specify the permitted form of the functional relationship between ANN classifier performance and the dissimilarity between training and test datasets. The novel measure is empirically assessed using publicly available datasets.
机译:自主系统利用一套算法,以了解部署它们的环境。这些算法通常求解一个或多个经典问题,例如分类,预测和检测。这是制定独立决策以实现一系列目标的关键步骤。人工神经网络(ANNS)是一种这样的类别算法,鉴于他们的明显能力学习了高维数据的复杂模式的表观能力,这是非常好的。由这种网络近似的决策边界是高度非线性的并且难以解释,在这些决策可以损害系统本身或人的安全性的情况下,这是特别问题的。此外,用于准备和测试网络的数据的选择可以对性能(例如错误分类)具有显着影响,从而产生安全性。在本文中,我们介绍了一种用于量化用于训练基于ANN的对象分类算法的数据集之间的差异的新测量,以及用于验证和评估分类器性能的测试数据集。该措施允许通过表征用于评估的测试数据集来将性能度量放入上下文中。系统要求可以指定ANN分类器性能与训练和测试数据集之间的不同不相似性的允许形式。使用公共数据集进行经验评估新的措施。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号