首页> 外文会议>Discovery science >Controlled Permutations for Testing Adaptive Classifiers
【24h】

Controlled Permutations for Testing Adaptive Classifiers

机译:用于测试自适应分类器的受控排列

获取原文
获取原文并翻译 | 示例

摘要

We study evaluation of online classifiers that are designed to adapt to changes in data distribution over time (concept drift). A standard procedure to evaluate such classifiers is the test-then-train, which iteratively uses the incoming instances for testing and then for updating a classifier. Comparing classifiers based on such a test risks to give biased results, since a dataset is processed only once in a fixed sequential order. Such a test concludes how well classifiers adapt when changes happen at fixed time points, while the ultimate goal is to assess how well they would adapt when changes of a similar type happen unexpectedly. To reduce the risk of biased evaluation we propose to run multiple tests with permuted data. A random permutation is not suitable, as it makes the data distribution uniform over time and destroys the adaptive learning problem. We develop three permutation techniques with theoretical control mechanisms that ensure that different distributions in data are preserved while perturbing the data order. The idea is to manipulate blocks of data keeping individual instances close together. Our permutations reduce the risk of biased evaluation by making it possible to analyze sensitivity of classifiers to variations in the data order.
机译:我们研究在线分类器的评估,这些分类器旨在适应数据分布随时间的变化(概念漂移)。评估此类分类器的标准过程是测试然后训练,它反复使用传入的实例进行测试,然后更新分类器。基于此类测试比较分类器可能会产生偏差结果,因为数据集仅按固定的顺序处理一次。这样的测试总结了分类器在固定时间点发生更改时的适应性,而最终目标是评估当相似类型的更改意外发生时它们将如何适应。为了减少评估偏倚的风险,我们建议对排列的数据运行多个测试。随机排列是不合适的,因为它会使数据随时间分布均匀并且破坏自适应学习问题。我们开发了三种具有理论控制机制的置换技术,可确保在扰动数据顺序的同时保留数据的不同分布。这个想法是要操纵数据块,使各个实例保持在一起。我们的排列可以通过分析分类器对数据顺序变化的敏感性来降低评估偏差的风险。

著录项

  • 来源
    《Discovery science》|2011年|p.365-379|共15页
  • 会议地点 Espoo(FI);Espoo(FI)
  • 作者

    Indre Zliobaite;

  • 作者单位

    Smart Technology Research Center, Bournemouth University, Poole, UK;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 人工智能理论;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号