A More Accurate Text Classifier for Positive and Unlabeled data

机译：用于肯定和未标记数据的更准确的文本分类器

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Almost all LPU algorithms rely heavily on two steps: exploiting reliable negative dataset and supplementing positive dataset. For above two steps, this paper originally proposes a two-step approach, that is, CoTrain-Active. The first step, employing CoTrain algorithm, iterates to purify the unlabeled set with two individual SVM base classifiers. The second step, adopting active-learning algorithm, further expands the positive set effectively by request the true label for the "suspect positive" examples. Comprehensive experiments demonstrate that our approach is superior to Biased-SVM which is said to be previous best. Moreover, CoTrain-Active is especially suitable for those situations where the given positive dataset P is extremely insufficient.

机译：几乎所有的LPU算法都严重依赖两个步骤：开发可靠的负数据集和补充正数据集。对于以上两步，本文最初提出了一种两步方法，即CoTrain-Active。第一步，采用CoTrain算法，使用两个单独的SVM基本分类器进行迭代以纯化未标记的集合。第二步，采用主动学习算法，通过为“可疑肯定”示例请求真实标签，进一步有效地扩展肯定集。全面的实验表明，我们的方法要优于Biased-SVM，后者据说是以前最好的。此外，CoTrain-Active特别适用于给定正数据集P非常不足的情况。

著录项

来源
《International Conference on Adaptive and Natural Computing Algorithms; 2005; Coimbra(PT)》|2005年|P.401-404|共4页
会议地点 Coimbra(PT)
作者
Rur Ming Xin; Wan li Zuo;
展开▼
作者单位

Key Laboratory of Symbol Computation and Knowledge Engineering of the Ministry of Educating, Colloege of Computer Science, JiLin University of China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;
关键词

相似文献

外文文献
中文文献
专利

1. Classifying networked text data with positive and unlabeled examples [J] . Li Mei, Pan Shirui, Zhang Yang, Pattern recognition letters . 2016,第jula1期

机译：使用肯定和未标记的示例对网络文本数据进行分类
2. Building text classifiers using positive, unlabeled and ‘outdated’ examples [J] . Han Jiayu, Zuo Wanli, Liu Lu, Concurrency and computation: practice and experience . 2016,第13期

机译：使用肯定的，未标记的和“过时的”示例来构建文本分类器
3. Dynamic classifier ensemble for positive unlabeled text stream classification [J] . Shirui Pan, Yang Zhang, Xue Li Knowledge and information systems . 2012,第2期

机译：动态分类器集成，用于积极的未标记文本流分类
4. A More Accurate Text Classifier for Positive and Unlabeled data [C] . Rur Ming Xin, Wan li Zuo International Conference on Adaptive and Natural Computing Algorithms . 2005

机译：用于正和未标记数据的更准确的文本分类器
5. Using unlabeled data to improve text classification. [D] . Nigam, Kanal Paul. 2001

机译：使用未标记的数据来改善文本分类。
6. Accurate Determination of Imaging Modality using an Ensemble of Text- and Image-Based Classifiers [O] . Charles E. Kahn Jr., Jayashree Kalpathy-Cramer, Cesar A. Lam, 2012

机译：使用基于文本和图像的分类器的组合准确确定成像模态
7. Building Text Classifiers using Positive and Unlabeled Examples [O] . Bing Liu, Yang Dai, Xiaoli Li, 2003

机译：使用正例和未标记示例构建文本分类器
8. Using EM to Classify Text from Labeled and Unlabeled Documents [R] . Nigam, K. , McCallum, A. , Thrun, S. , 1998

机译：使用Em从标记和未标记文档中分类文本

A More Accurate Text Classifier for Positive and Unlabeled data

摘要

著录项

相似文献

相关主题

期刊订阅