首页> 外文会议>2016 IEEE International Conference on Big Data Analysis >Active learning for cost-sensitive classification using logistic regression model
【24h】

Active learning for cost-sensitive classification using logistic regression model

机译:使用Logistic回归模型主动学习成本敏感分类

获取原文
获取原文并翻译 | 示例

摘要

Active learning aims to selectively label the most informative examples to save the data collection cost. While active learning has been well studied for balanced classification problems, limited research is performed in cost-sensitive scenario. In this paper, we investigate the problem of active learning for cost-sensitive classification. We first propose a general active learning framework named GEM, which chooses examples leading to the minimum generalization error. Then we incorporate the misclassification cost into expected loss calculation under the proposed framework, and derive a model estimation rule with the Newton-Raphson method using logistic regression as the base model. Finally, we present the complete active learning algorithm for cost-sensitive classification. Extensive experiments on various benchmark data sets from the UCI repository have demonstrated the effectiveness of the proposed algorithm.
机译:主动学习旨在有选择地标记最具信息价值的示例,以节省数据收集成本。尽管针对平衡的分类问题已经对主动学习进行了很好的研究,但在成本敏感的情况下进行的研究却很少。在本文中,我们研究了针对成本敏感分类的主动学习问题。我们首先提出一个通用的主动学习框架GEM,该框架选择导致最小化泛化误差的示例。然后,在提出的框架下,将错误分类的成本纳入预期损失的计算中,并以逻辑回归为基础,​​采用牛顿-拉夫森法推导模型估计规则。最后,我们提出了针对成本敏感分类的完整主动学习算法。对UCI资料库中各种基准数据集的大量实验证明了该算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号