首页> 外文会议>Information Retrieval Technology >A Novel Reliable Negative Method Based on Clustering for Learning from Positive and Unlabeled Examples

【24h】

A Novel Reliable Negative Method Based on Clustering for Learning from Positive and Unlabeled Examples

机译：一种新颖的基于聚类的可靠负值方法，用于从正例和未标记示例中学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper investigates a new approach for training text classifiers when only a small set of positive examples is available together with a large set of unlabeled examples. The key feature of this problem is that there are no negative examples for learning. Recently, a few techniques have been reported are based on building a classifier in two steps. In this paper, we introduce a novel method for the first step, which cluster the unlabeled and positive examples to identify the reliable negative document, and then run SVM iteratively. We perform a comprehensive evaluation with other two methods, and show experimentally that it is efficient and effective.

机译：当只有一小部分积极的例子和大量未标记的例子可用时，本文研究了一种训练文本分类器的新方法。这个问题的关键特征是没有负面的例子可供学习。最近，已经报道了一些基于两步构建分类器的技术。在本文中，我们为第一步引入了一种新方法，该方法将未标记的正样本与正样本聚类，以识别可靠的负文档，然后迭代运行SVM。我们使用其他两种方法进行了综合评估，并通过实验证明了它的有效性。

著录项

来源
《Information Retrieval Technology》|2008年|P.385-392|共8页
会议地点
作者
Bangzuo Zhang; Wanli Zuo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机设备安全;
关键词
semi-supervised learning; text classification; bisecting k-means clustering; learning from positive and unlabeled examples (lpu);

机译：半监督学习;文本分类;对分k-均值聚类;从阳性和未标记的示例中学习（lpu）;

相似文献

外文文献
中文文献
专利

1. Reliable Negative Extracting Based on kNN for Learning from Positive and Unlabeled Examples [J] . Bangzuo Zhang1, 2, Wanli Zuo1 Journal of Computers . 2009,第1期

机译：基于KNN的可靠负面提取，用于学习积极和未标记的例子
2. Automatic Detection of Mis-Spelled Japanese Expressions Using a New Method for Automatic Extraction of Negative Examples Based on Positive Examples [J] . Masaki MURATA, Hitoshi ISAHARA IEICE Transactions on Information and Systems . 2002,第9期

机译：使用基于正例的自动提取负例的新方法来自动检测拼写错误的日语表达
3. Effectively Identifying Compound-Protein Interactions by Learning from Positive and Unlabeled Examples [J] . Zhanzhan Cheng, Shuigeng Zhou, Yang Wang, IEEE/ACM transactions on computational biology and bioinformatics . 2018,第6期

机译：通过从阳性和未标记的实例中学习来有效地识别化合物与蛋白质的相互作用
4. A Novel Reliable Negative Method Based on Clustering for Learning from Positive and Unlabeled Examples [C] . Bangzuo Zhang, Wanli Zuo Asia Information Retrieval Symposium . 2008

机译：一种基于群体群体群体的可靠负面法
5. Shape Theoretic and Machine Learning Based Methods for Automatic Clustering and Classification of Cardiomyocytes Based on Action Potential Morphology [D] . Gorospe, Giann 2018

机译：基于形状理论和机器学习的基于动作电位形态学的心肌细胞自动聚类和分类方法
6. ProDiGe: Prioritization Of Disease Genes with multitask machine learning from positive and unlabeled examples [O] . Fantine Mordelet, Jean-Philippe Vert 2011

机译：ProDiGe：利用多任务机器学习对疾病基因进行优先排序从阳性和未标记的示例中进行
7. Estimating the $$F_1$$ Score for Learning from Positive and Unlabeled Examples [O] . Seyed Amin Tabatabaei, Jan Klein, Mark Hoogendoorn 2020

机译：从积极和未标记的例子中估算$$ f_1 $$评分

A Novel Reliable Negative Method Based on Clustering for Learning from Positive and Unlabeled Examples

摘要

著录项

相似文献

相关主题

期刊订阅