...
首页> 外文期刊>Neurocomputing >Transfer learning using a nonparametric sparse topic model
【24h】

Transfer learning using a nonparametric sparse topic model

机译:使用非参数稀疏主题模型进行转移学习

获取原文
获取原文并翻译 | 示例
           

摘要

In many domains data items are represented by vectors of counts; count data arises, for example, in bioinformatics or analysis of text documents represented as word count vectors. However, often the amount of data available from an interesting data source is too small to model the data source well. When several data sets are available from related sources, exploiting their similarities by transfer learning can improve the resulting models compared to modeling sources independently. We introduce a Bayesian generative transfer learning model which represents similarity across document collections by sparse sharing of latent topics controlled by an Indian buffet process. Unlike a prominent previous model, hierarchical Dirichlet process (HDP) based multi-task learning, our model decouples topic sharing probability from topic strength, making sharing of low-strength topics easier. In experiments, our model outperforms the HDP approach both on synthetic data and in first of the two case studies on text collections, and achieves similar performance as the HDP approach in the second case study.
机译:在许多领域中,数据项都是由计数向量表示的。例如,计数数据出现在生物信息学或以单词计数向量表示的文本文档的分析中。但是,有趣的数据源中可用的数据量通常太少,无法很好地对数据源进行建模。当可以从相关来源获得多个数据集时,与单独建模源相比,通过转移学习来利用它们的相似性可以改善所得模型。我们引入贝叶斯生成转移学习模型,该模型通过稀疏共享由印度自助餐过程控制的潜在主题来表示文档集合之间的相似性。与以前的著名模型不同,我们的模型与基于分层Dirichlet过程(HDP)的多任务学习模型不同,它使主题共享概率与主题强度脱钩,从而使低强度主题的共享更加容易。在实验中,我们的模型在合成数据上和在文本集合的两个案例研究中均优于HDP方法,并且在第二个案例研究中均具有与HDP方法相似的性能。

著录项

  • 来源
    《Neurocomputing》 |2013年第18期|124-137|共14页
  • 作者单位

    Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, P.O. Box 15400, FI-00076 Aalto, Finland;

    Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, P.O. Box 15400, FI-00076 Aalto, Finland;

    Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, P.O. Box 15400, FI-00076 Aalto, Finland;

    Helsinki Institute for Information Technology HIIT, Department of Information and Computer Science, Aalto University, P.O. Box 15400, FI-00076 Aalto, Finland;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Transfer learning; Latent Dirichlet allocation; Nonparametric Bayesian inference; Sparsity; Small sample size; Topic models;

    机译:转移学习;潜在Dirichlet分配;非参数贝叶斯推理;稀疏性样本量小;主题模型;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号