...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Bootstrap FDA for counting positives accurately in imprecise environments
【24h】

Bootstrap FDA for counting positives accurately in imprecise environments

机译:Bootstrap FDA可在不精确的环境中准确计数阳性

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Many real-world classification tasks involve discriminations between two unbalanced classes in imprecise environments, in which either the training data do not represent a random sample of the target population or the class distribution may shift over time in the target population. In such situations, in order to minimize the misclassification costs, the class distribution in target population must be known for selecting the optimal threshold. Forman has presented a method, based on the distribution generated on training data and the distribution on unlabeled test data, for estimating the number of positives in target population. However, when the data size is small, it is difficult to reliably generate these distributions for estimating the number of positives. This paper presents a novel algorithm to generate these distributions based on the bootstrap and Fisher discriminant analysis. Experiment results on five UCI data sets demonstrate its effectiveness. (c) 2007 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
机译:许多现实世界中的分类任务涉及在不精确的环境中区分两个不平衡的班级,其中训练数据不能代表目标人群的随机样本,或者班级分布可能随时间推移在目标人群中变化。在这种情况下,为了最小化错误分类的成本,必须知道目标人群中的类别分布以选择最佳阈值。 Forman提出了一种基于训练数据生成的分布和未标记测试数据的分布的方法,用于估计目标人群中阳性人数。然而,当数据大小小时,难以可靠地生成这些分布以估计正数。本文提出了一种基于Bootstrap和Fisher判别分析生成这些分布的新颖算法。在五个UCI数据集上的实验结果证明了其有效性。 (c)2007模式识别学会。由Elsevier Ltd.出版。保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号