Triplex Transfer Learning: Exploiting Both Shared and Distinct Concepts for Text Classification

Zhuang F.; Luo P.; Du C.; He Q.; Shi Z.; Xiong H.

首页> 外文期刊>Cybernetics, IEEE Transactions on >Triplex Transfer Learning: Exploiting Both Shared and Distinct Concepts for Text Classification

【24h】

Triplex Transfer Learning: Exploiting Both Shared and Distinct Concepts for Text Classification

机译：三重传递学习：利用文本分类的共享和独特概念

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Transfer learning focuses on the learning scenarios when the test data from target domains and the training data from source domains are drawn from similar but different data distributions with respect to the raw features. Along this line, some recent studies revealed that the high-level concepts, such as word clusters, could help model the differences of data distributions, and thus are more appropriate for classification. In other words, these methods assume that all the data domains have the same set of shared concepts, which are used as the bridge for knowledge transfer. However, in addition to these shared concepts, each domain may have its own distinct concepts. In light of this, we systemically analyze the high-level concepts, and propose a general transfer learning framework based on nonnegative matrix trifactorization, which allows to explore both shared and distinct concepts among all the domains simultaneously. Since this model provides more flexibility in fitting the data, it can lead to better classification accuracy. Moreover, we propose to regularize the manifold structure in the target domains to improve the prediction performances. To solve the proposed optimization problem, we also develop an iterative algorithm and theoretically analyze its convergence properties. Finally, extensive experiments show that the proposed model can outperform the baseline methods with a significant margin. In particular, we show that our method works much better for the more challenging tasks when there are distinct concepts in the data.

机译：当从目标域的测试数据和源域的训练数据是从原始特征的相似但不同的数据分布中提取时，转移学习侧重于学习场景。沿着这条线，最近的一些研究表明，高级概念（例如单词簇）可以帮助对数据分布的差异进行建模，因此更适合分类。换句话说，这些方法假定所有数据域都具有相同的共享概念集，这些概念用作知识传递的桥梁。但是，除了这些共享的概念之外，每个域可能都有其自己独特的概念。有鉴于此，我们系统地分析了高级概念，并提出了一个基于非负矩阵三因子分解的通用转移学习框架，该框架允许同时探索所有领域中共享和不同的概念。由于此模型在拟合数据方面提供了更大的灵活性，因此可以导致更好的分类准确性。此外，我们建议对目标域中的流形结构进行规范化以提高预测性能。为了解决所提出的优化问题，我们还开发了一种迭代算法，并从理论上分析了其收敛性。最后，大量实验表明，所提出的模型可以大大优于基线方法。特别是，我们表明，当数据中存在不同的概念时，我们的方法对于更具挑战性的任务效果更好。

著录项

来源
《Cybernetics, IEEE Transactions on》 |2014年第7期|1191-1203|共13页
作者
Zhuang F.; Luo P.; Du C.; He Q.; Shi Z.; Xiong H.;
展开▼
作者单位

Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Common concept; distinct concept; distribution mismatch; nonnegative matrix trifactorization; triplex transfer learning;

机译：通用概念;区别概念;分布不匹配;负矩阵三因子分解;三重传递学习;

相似文献

外文文献
中文文献
专利

1. Quadruple Transfer Learning: Exploiting both shared and non-shared concepts for text classification [J] . Pan Jianhan, Hu Xuegang, Zhang Yuhong, Knowledge-Based Systems . 2015,第DECa期

机译：四重转移学习：利用共享和非共享概念进行文本分类
2. Data and systems for medication-related text classification and concept normalization from Twitter: insights from the Social Media Mining for Health (SMM4H)-2017 shared task [J] . Abeed Sarker, Maksim Belousov, Jasper Friedrichs, Journal of the American Medical Informatics Association : . 2018,第10期

机译：Twitter的药物相关文本分类和概念标准化的数据和系统：来自社交媒体挖掘的洞察力（SMM4H） - 2017年共享任务
3. Learning transferable features in meta-learning for few-shot text classification [J] . Xu Jincheng, Du Qingfeng Pattern recognition letters . 2020,第Jula期

机译：学习Meta-Learning中的可转让功能，用于几次文本分类
4. Transfer classification for distinct manifestations with shared information [C] . Lu Qi, Peijie Yin, Xiayuan Huang, World Congress on Intelligent Control and Automation . 2016

机译：通过共享信息为不同的表现进行转移分类
5. Mining a Shared Concept Space for Domain Adaptation in Text Mining. [D] . Chen, Bo. 2011

机译：在文本挖掘中挖掘用于域适应的共享概念空间。
6. Data and systems for medication-related text classification and concept normalization from Twitter: insights from the Social Media Mining for Health (SMM4H)-2017 shared task [O] . Abeed Sarker, Maksim Belousov, Jasper Friedrichs, 2018

机译：Twitter上与药物有关的文本分类和概念归一化的数据和系统：来自社交媒体健康促进会（SMM4H）-2017的共享任务的见解
7. Data and systems for medication-related text classification and concept normalization from Twitter: insights from the Social Media Mining for Health (SMM4H)-2017 shared task [O] . Abeed Sarker, Maksim Belousov, Jasper Friedrichs, 2018

机译：与Twitter相关的药物相关文本分类和概念标准化的数据和系统：来自社交媒体挖掘的洞察力 - 2217分享任务

Triplex Transfer Learning: Exploiting Both Shared and Distinct Concepts for Text Classification

摘要

著录项

相似文献

相关主题

期刊订阅