首页> 外文期刊>電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 >Coping with Class Balance Change in Classification: Class-Prior Estimation with Energy Distance
【24h】

Coping with Class Balance Change in Classification: Class-Prior Estimation with Energy Distance

机译:应对分类中的班级平衡变化:具有能量距离的班级优先估计

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Due to sample selection bias or non-stationarity of the environment, the class balance often changes between training and test datasets. Naive classifier training under such a situation yields a biased solution. This bias can be corrected by weighted training according to the test class balance, but this test class balance is often unknown in practice. In this paper, we consider a semi-supervised learning setup where labeled training samples and unlabeled test samples are available, and address the problem of class balance estimation. It was shown that the test class balance can be estimated by fitting a mixture of class-wise training input distributions to the test input distribution, and class balance estimators were developed under, e.g., the Kullback-Leibler divergence and the L_2 distance. In this paper, we propose a simple class balance estimator based on the energy distance and demonstrate, its usefulness through experiments.
机译:由于样本选择的偏见或环境的不稳定,课程平衡通常在训练和测试数据集之间发生变化。在这种情况下进行朴素的分类器训练会产生有偏差的解决方案。可以通过根据测试班级余额进行加权训练来纠正此偏差,但是在实践中通常不知道该测试班级余额。在本文中,我们考虑了一种半监督学习设置,其中提供了标记的训练样本和未标记的测试样本,并解决了课程平衡估计的问题。结果表明,可以通过将类训练输入分布与测试输入分布进行混合来估计测试班平衡,并且在例如Kullback-Leibler散度和L_2距离下开发了班平衡估计量。在本文中,我们提出了一种基于能量距离的简单类平衡估计器,并通过实验证明了其有用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号