Graph-based boosting algorithm to learn labeled and unlabeled data

Liu Zheng; Jin Wei; Mu Ying

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Graph-based boosting algorithm to learn labeled and unlabeled data

【24h】

Graph-based boosting algorithm to learn labeled and unlabeled data

机译：基于图形的促进算法，用于学习标记和未标记的数据

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Ensemble learning is an effective technique to learn the information of data by combining multiple models. But usually the combined models are supervised learning algorithms which need a lot of labeled data to tune their parameters. Some ensemble learning algorithms were proposed to exploit the information of unlabeled data. These methods had to learn the samples with pseudo-labels due to the scarcity of labeled data. But it's inevitable for the samples with pseudo-labels to bring wrong information during training process. In this paper, we will propose a novel graph-based boosting (GBB) algorithm to learn labeled and unlabeled data. GBB is a framework combining many models linearly. And pseudo-labels will not occur during training process. GBB will assign a new weighting vector for the labeled samples and a transformed similarity matrix for all samples to train the combined model at each iteration. We also extend GBB, termed as weighted GBB (WGBB), to learn imbalanced data by adding a weighting vector for the labeled data. Finally, 14 relatively balanced datasets and 22 imbalanced datasets are used to validate the performances of GBB and WGBB respectively. Experimental results illustrate that GBB can achieve a competitive performance and WGBB has an obvious advantage to handle classification problem of imbalanced data, comparing with other related algorithms. (C) 2020 Elsevier Ltd. All rights reserved.

机译：集合学习是一种通过组合多种模型来学习数据信息的有效技术。但通常，组合模型是监督学习算法，需要大量标记的数据来调整其参数。建议一些集合学习算法利用未标记数据的信息。由于标记数据的稀缺性，这些方法必须使用伪标签来学习样品。但是对于伪标签的样本是不可避免的，以便在培训过程中带来错误的信息。在本文中，我们将提出一种基于图形的促进（GBB）算法来学习标记和未标记的数据。 GBB是一种框架，即在线性地结合了许多型号。在培训过程中不会发生伪标签。 GBB将为标记的样本和变换的相似性矩阵为所有样本分配一个新的加权向量，以便在每次迭代中培训组合模型。我们还扩展GBB称为加权GBB（WGBB），通过为标记数据添加加权向量来学习不平衡数据。最后，使用14个相对平衡的数据集和22个不平衡数据集分别用于验证GBB和WGBB的性能。实验结果表明，GBB可以实现竞争性能，而WGBB具有明显的优势，可以处理不平衡数据的分类问题，与其他相关算法相比。（c）2020 elestvier有限公司保留所有权利。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2020年第1期|共11页
作者
Liu Zheng; Jin Wei; Mu Ying;
展开▼
作者单位

Zhejiang Univ Res Ctr Analyt Instrumentat Inst Cyber Syst &

Control State Key Lab Ind Control Technol Hangzhou 310027 Peoples R China;

Zhejiang Univ Res Ctr Analyt Instrumentat Inst Cyber Syst &

Control State Key Lab Ind Control Technol Hangzhou 310027 Peoples R China;

Zhejiang Univ Res Ctr Analyt Instrumentat Inst Cyber Syst &

Control State Key Lab Ind Control Technol Hangzhou 310027 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Graph; Boosting; Semi-supervised learning; Imbalance learning;

机译：图;提升;半监督学习;不平衡学习;

相似文献

外文文献
中文文献
专利

1. Graph-based boosting algorithm to learn labeled and unlabeled data [J] . Liu Zheng, Jin Wei, Mu Ying Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第1期

机译：基于图形的促进算法，用于学习标记和未标记的数据
2. A mixture model and EM-based algorithm for class discovery, robust classification, and outlier rejection in mixed labeled/unlabeled data sets [J] . Miller D.J., Browning J. IEEE Transactions on Pattern Analysis and Machine Intelligence . 2003,第11期

机译：混合模型和基于EM的算法，用于混合标记/未标记数据集中的类发现，鲁棒分类和异常剔除
3. A fuzzy method to learn text classifier from labeled and unlabeled examples [J] . LIU Hong, HUANG Shang-teng Journal of Harbin Institute of Technology . 2004,第1期

机译：从标记和未标记示例中学习文本分类器的模糊方法
4. Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots [C] . Chen Chao-Yeh, Grauman Kristen IEEE Conference on Computer Vision and Pattern Recognition . 2013

机译：观看未贴标签的视频有助于从很少贴标签的快照中学习新的人为操作
5. Boosting algorithms for mining biomedical and biological data. [D] . Krishnaraj, Yazhene. 2009

机译：促进生物医学和生物数据挖掘的算法。
6. Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction [O] . Dokyoon Kim, Je-Gun Joung, Kyung-Ah Sohn, 2015

机译：知识增强：基于图的整合方法结合多组学数据和基因组知识可预测癌症临床结果
7. Boosting Statistical Word Alignment Using Labeled and Unlabeled Data [O] . Hua Wu, Haifeng Wang, Zhanyi Liu 2009

机译：使用标记和未标记的数据促进统计词对齐
8. Cognitive Study of Learning with Labeled and Unlabeled Data. [R] . Zhu, X., Rogers, T. T. 2012

机译：标记和未标记数据学习的认知研究。

Graph-based boosting algorithm to learn labeled and unlabeled data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅