Classification from Pairwise Similarity and Unlabeled Data

Han Bao; Gang Niu; Masashi Sugiyama

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Classification from Pairwise Similarity and Unlabeled Data

【24h】

Classification from Pairwise Similarity and Unlabeled Data

机译：从成对相似性和未标记数据分类

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Supervised learning needs a huge amount of labeled data, which can be a big bottleneck under the situation where there is a privacy concern or labeling cost is high. To overcome this problem, we propose a new weakly-supervised learning setting where only similar (S) data pairs (two examples belong to the same class) and unlabeled (U) data points are needed instead of fully labeled data, which is called SU classification. We show that an unbiased estimator of the classification risk can be obtained only from SU data, and the estimation error of its empirical risk minimizer achieves the optimal parametric convergence rate. Finally, we demonstrate the effectiveness of the proposed method through experiments.

机译：监督学习需要大量标记数据，这可能是一个隐私问题或标签成本的情况下的大瓶颈。为了克服这个问题，我们提出了一个新的弱监督学习设置，其中只需要类似（S）数据对（两个示例属于同一类）和未标记的（U）数据点而不是完全标记的数据，称为SU分类。我们表明，只能从SU数据获得分类风险的无偏估计，并且其经验风险最小化器的估计误差实现了最佳的参数收敛速率。最后，我们通过实验证明了所提出的方法的有效性。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2010期|共10页
作者
Han Bao; Gang Niu; Masashi Sugiyama;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Classification From Pairwise Similarities/Dissimilarities and Unlabeled Data via Empirical Risk Minimization [J] . Takuya Shimada, Han Bao, Issei Sato, Neural computation . 2021,第5期

机译：通过经验风险最小化分组与成对相似性/异化和未标记数据的分类
2. Generate pairwise constraints from unlabeled data for semi-supervised clustering [J] . Masud Md Abdul, Huang Joshua Zhexue, Zhong Ming, Data & Knowledge Engineering . 2019,第Sepa期

机译：从未标记的数据生成成对约束以进行半监督聚类
3. Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data [J] . Tomoya Sakai, Marthinus Christoffel Plessis, Gang Niu, JMLR: Workshop and Conference Proceedings . 2017,第4期

机译：基于来自正数据和未标记数据的分类的半监督分类
4. Binary Classification Only from Unlabeled Data by Iterative Unlabeled-unlabeled Classification [C] . Hirotaka Kaji, Masashi Sugiyama IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：仅通过迭代未标记-未标记分类从未标记数据中进行二进制分类
5. Using unlabeled data to improve text classification. [D] . Nigam, Kanal Paul. 2001

机译：使用未标记的数据来改善文本分类。
6. Learning Pairwise-Similarity Guided Sparse Functional Connectivity Network for MCI Classification [O] . Xiaobo Chen, Han Zhang, Yu Zhang, -1

机译：学习成对相似性指导的稀疏功能连接网络以进行MCI分类
7. Utilizing Unlabeled Documents in Automatic Classification with Inter-document Similarities [O] . Pan-Jun Kim, Jae-Yun Lee 2007

机译：利用未标记的文档在自动分类中与文档间相似之处

Classification from Pairwise Similarity and Unlabeled Data

摘要

著录项

相似文献

相关主题

期刊订阅