Exploiting Unlabeled Data in Ensemble Methods

机译：在集成方法中利用未标记的数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

An adaptive semi-supervised ensemble method, ASSEMBLE, is proposed that constructs classification ensembles based on both labeled and unlabeled data. ASSEMBLE alternates between assigning "pseudo-classes" to the unlabeled data using the existing ensemble and constructing the next base classifier using both the labeled and pseudo-labeled data. Mathematically, this intuitive algorithm corresponds to maximizing the classification margin in hypothesis space as measured on both the labeled and unlabeled data. Unlike alternative approaches, ASSEMBLE does not require a semi-supervised learning method for the base classifier. ASSEMBLE can be used in conjunction with any cost-sensitive classification algorithm for both two-class and multi-class problems. ASSEMBLE using decision trees won the NIPS 2001 Unlabeled Data Competition. In addition, strong results on several benchmark datasets using both decision trees and neural networks support the proposed method.

机译：提出了一种自适应半监督集成方法ASSEMBLE，该方法基于标记和未标记的数据构造分类集合。 ASSEMBLE在以下两种方法之间进行交替：使用现有的集合为未标记的数据分配“伪类”，以及使用标记的数据和伪标记的数据构造下一个基本分类器。在数学上，此直观算法对应于最大化在标记和未标记数据上测得的假设空间中的分类裕度。与替代方法不同，ASSEMBLE不需要针对基本分类器的半监督学习方法。 ASSEMBLE可以与任何成本敏感的分类算法结合使用，以解决两类和多类问题。大会使用决策树赢得了NIPS 2001 Unlabeled Data Competition。另外，在使用决策树和神经网络的几个基准数据集上的强大结果支持了该方法。

著录项

来源
《》|2002年|p.289-296|共8页
会议地点
作者
Kristin P. Bennett; Ayhan Demiriz; Richard Maclin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术 ;
关键词
boosting; semi-supervised learning; ensemble learning; classification;

机译：促进半监督学习;整体学习;分类;

相似文献

外文文献
中文文献
专利

1. Exploiting unlabeled data to enhance ensemble diversity [J] . Min-Ling Zhang, Zhi-Hua Zhou Data Mining and Knowledge Discovery . 2013 ,第1期

机译：利用未标记的数据来增强整体多样性
2. Exploiting unlabeled data to enhance ensemble diversity [J] . Zhang M.-L., Zhou Z.-H. Data mining and knowledge discovery . 2013 ,第1期

机译：利用未标记的数据来增强整体多样性
3. Semi-supervised text categorization: Exploiting unlabeled data using ensemble learning algorithms [J] . Mohammad Reza Keyvanpour, Maryam Bahojb Imani Intelligent data analysis . 2013 ,第3期

机译：半监督文本分类：使用集成学习算法开发未标记的数据
4. Exploiting unlabeled data in ensemble methods [C] . Kristin P. Bennett, Ayhan Demiriz, Richard Maclin Proceedings of the Eighth ACM SIGKDD international conference on knowledge discovery and data mining(KDD-2000) . 2002

机译：以集成方法利用未标记的数据
5. Exploitation of unlabeled data and related tasks in semi-supervised learning. [D] . Liu, Qiuhua. 2007

机译：在半监督学习中利用未标记的数据和相关任务。
6. Exploiting the noise: improving biomarkers with ensembles of data analysis methodologies [O] . Maud HW Starmans, Melania Pintilie, Thomas John, 2012

机译：利用噪声：通过一系列数据分析方法改进生物标记
7. Exploiting Unlabeled Data in Ensemble Methods [O] . Kristin Bennett, Ayhan Demiriz 2007

机译：在集合方法中利用未标记的数据
8. Techniques for Exploiting Unlabeled Data [R] . Rwebangira, M. R. 2008

机译：利用未标记数据的技术

Exploiting Unlabeled Data in Ensemble Methods

摘要

著录项

相似文献

相关主题

期刊订阅