首页> 外文期刊>Cybernetics, IEEE Transactions on >Triplex Transfer Learning: Exploiting Both Shared and Distinct Concepts for Text Classification
【24h】

Triplex Transfer Learning: Exploiting Both Shared and Distinct Concepts for Text Classification

机译:三重传递学习:利用文本分类的共享和独特概念

获取原文
获取原文并翻译 | 示例
           

摘要

Transfer learning focuses on the learning scenarios when the test data from target domains and the training data from source domains are drawn from similar but different data distributions with respect to the raw features. Along this line, some recent studies revealed that the high-level concepts, such as word clusters, could help model the differences of data distributions, and thus are more appropriate for classification. In other words, these methods assume that all the data domains have the same set of shared concepts, which are used as the bridge for knowledge transfer. However, in addition to these shared concepts, each domain may have its own distinct concepts. In light of this, we systemically analyze the high-level concepts, and propose a general transfer learning framework based on nonnegative matrix trifactorization, which allows to explore both shared and distinct concepts among all the domains simultaneously. Since this model provides more flexibility in fitting the data, it can lead to better classification accuracy. Moreover, we propose to regularize the manifold structure in the target domains to improve the prediction performances. To solve the proposed optimization problem, we also develop an iterative algorithm and theoretically analyze its convergence properties. Finally, extensive experiments show that the proposed model can outperform the baseline methods with a significant margin. In particular, we show that our method works much better for the more challenging tasks when there are distinct concepts in the data.
机译:当从目标域的测试数据和源域的训练数据是从原始特征的相似但不同的数据分布中提取时,转移学习侧重于学习场景。沿着这条线,最近的一些研究表明,高级概念(例如单词簇)可以帮助对数据分布的差异进行建模,因此更适合分类。换句话说,这些方法假定所有数据域都具有相同的共享概念集,这些概念用作知识传递的桥梁。但是,除了这些共享的概念之外,每个域可能都有其自己独特的概念。有鉴于此,我们系统地分析了高级概念,并提出了一个基于非负矩阵三因子分解的通用转移学习框架,该框架允许同时探索所有领域中共享和不同的概念。由于此模型在拟合数据方面提供了更大的灵活性,因此可以导致更好的分类准确性。此外,我们建议对目标域中的流形结构进行规范化以提高预测性能。为了解决所提出的优化问题,我们还开发了一种迭代算法,并从理论上分析了其收敛性。最后,大量实验表明,所提出的模型可以大大优于基线方法。特别是,我们表明,当数据中存在不同的概念时,我们的方法对于更具挑战性的任务效果更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号