首页> 外文会议>Information Retrieval Technology >A Novel Reliable Negative Method Based on Clustering for Learning from Positive and Unlabeled Examples
【24h】

A Novel Reliable Negative Method Based on Clustering for Learning from Positive and Unlabeled Examples

机译:一种新颖的基于聚类的可靠负值方法,用于从正例和未标记示例中学习

获取原文

摘要

This paper investigates a new approach for training text classifiers when only a small set of positive examples is available together with a large set of unlabeled examples. The key feature of this problem is that there are no negative examples for learning. Recently, a few techniques have been reported are based on building a classifier in two steps. In this paper, we introduce a novel method for the first step, which cluster the unlabeled and positive examples to identify the reliable negative document, and then run SVM iteratively. We perform a comprehensive evaluation with other two methods, and show experimentally that it is efficient and effective.
机译:当只有一小部分积极的例子和大量未标记的例子可用时,本文研究了一种训练文本分类器的新方法。这个问题的关键特征是没有负面的例子可供学习。最近,已经报道了一些基于两步构建分类器的技术。在本文中,我们为第一步引入了一种新方法,该方法将未标记的正样本与正样本聚类,以识别可靠的负文档,然后迭代运行SVM。我们使用其他两种方法进行了综合评估,并通过实验证明了它的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号