首页> 外文会议>European Conference on Information Retrieval >Re-assessing the 'Classify and Count' Quantification Method
【24h】

Re-assessing the 'Classify and Count' Quantification Method

机译:重新评估“分类和计数”量化方法

获取原文

摘要

Learning to quantify (a.k.a. quantification) is a task concerned with training unbiased estimators of class prevalence via supervised learning. This task originated with the observation that "Classify and Count" (CC), the trivial method of obtaining class prevalence estimates, is often a biased estimator, and thus delivers suboptimal quantification accuracy. Following this observation, several methods for learning to quantify have been proposed and have been shown to outperform CC. In this work we contend that previous works have failed to use properly optimised versions of CC. We thus reassess the real merits of CC and its variants, and argue that, while still inferior to some cutting-edge methods, they deliver near-state-of-the-art accuracy once (a) hyperparameter optimisation is performed, and (b) this optimisation is performed by using a truly quantification-oriented evaluation protocol. Experiments on three publicly available binary sentiment classification datasets support these conclusions.
机译:学习量化(A.K.A.量化)是一项有关涉及通过监督学习培训的课程普遍性估计的任务。该任务源于观察“分类和计数”(CC),获得类普遍估计的琐碎方法通常是偏置估计器,因此提供了次优的量化精度。在该观察之后,已经提出了几种学习量化的方法,并已显示出优于CC。在这项工作中,我们认为以前的作品未能使用正确优化的CC版本。因此,我们重新评估了CC及其变体的真正优点,并争辩说,虽然仍然不如某种尖端方法,但它们在执行(a)封路计优化(B )通过使用真正定量的评估协议来执行这种优化。三个公开的二元情绪分类数据集的实验支持这些结论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号