首页> 外文会议>Computer processing of oriental languages : Language technology for the Knowledge-based economy >A Density-Based Re-ranking Technique for Active Learning for Data Annotations
【24h】

A Density-Based Re-ranking Technique for Active Learning for Data Annotations

机译:基于密度的重新排序技术,用于数据标注的主动学习

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

One of the popular techniques of active learning for data annotations is uncertainty sampling, however, which often presents problems when outliers are selected. To solve this problem, this paper proposes a density-based re-ranking technique, in which a density measure is adopted to determine whether an unlabeled example is an outlier. The motivation of this study is to prefer not only the most informative example in terms of uncertainty measure, but also the most representative example in terms of density measure. Experimental results of active learning for word sense disambiguation and text classification tasks using six real-world evaluation data sets show that our proposed density-based re-ranking technique can improve uncertainty sampling.
机译:主动学习中用于数据注释的流行技术之一是不确定性采样,但是,在选择离群值时通常会出现问题。为了解决这个问题,本文提出了一种基于密度的重排序技术,其中采用密度度量来确定未标记的示例是否是异常值。这项研究的动机是,不仅在不确定性度量方面更喜欢提供最多信息的示例,而且在密度度量方面也更喜欢具有代表性的示例。使用六个真实世界的评估数据集进行主动学习以进行词义消歧和文本分类任务的实验结果表明,我们提出的基于密度的重排序技术可以改善不确定性采样。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号