...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Large Margin Distribution Learning with Cost Interval and Unlabeled Data
【24h】

Large Margin Distribution Learning with Cost Interval and Unlabeled Data

机译:具有成本间隔和未标记数据的大利润分配学习

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In many real-world applications, different types of misclassification usually suffer from different costs, but the accurate cost is often hard to be determined and usually one can only get an interval-estimation like that one type of mistake is about 5 to 10 times more serious than the other type. On the other hand, there are usually abundant unlabeled data available, leading to great research effort about semi-supervised learning. It is noticeable that cost interval and unlabeled data usually appear simultaneously in practice tasks; however, there is rare study tackling them together. In this paper, we propose the cisLDM approach which is able to handle cost interval and exploit unlabeled data in a principled way. Rather than maximizing the minimum margin like traditional large margin classifiers, cisLDM tries to optimize the margin distribution on both labeled and unlabeled data when minimizing the worst-case total-cost and the mean total-cost simultaneously according to the cost interval. Experiments on a broad range of datasets and cost settings exhibit the impressive performance of cisLDM. In particular, cisLDM is able to reduce 47 percent more total-cost than standard SVM and 27 percent more total-cost than cost-sensitive semi-supervised SVM which assumes the true cost value is known in advance.
机译:在许多实际应用中,不同类型的错误分类通常会遭受不同的代价,但是准确的代价通常很难确定,通常只能得到一个区间估计,就像一种错误类型大约高5至10倍。比其他类型严重。另一方面,通常有大量未标记的数据可用,从而导致有关半监督学习的大量研究工作。值得注意的是,成本间隔和未标记数据通常在练习任务中同时出现。但是,很少有研究将它们结合在一起。在本文中,我们提出了cisLDM方法,该方法能够以原则性的方式处理成本间隔并利用未标记的数据。 cisLDM并没有像传统的大利润率分类器那样使最小利润率最大化,而是在根据成本间隔同时最小化最坏情况的总成本和平均总成本时,尝试优化标记数据和未标记数据的利润率分布。在广泛的数据集和成本设置上进行的实验显示了cisLDM令人印象深刻的性能。特别是,cisLDM能够比标准SVM降低47%的总成本,并且比成本敏感的半监督SVM降低27%的总成本,前提是假定真正的成本值是事先已知的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号