首页> 外文会议>International conference on neural information processing >Topic Model Kernel: An Empirical Study towards Probabilistically Reduced Features for Classification
【24h】

Topic Model Kernel: An Empirical Study towards Probabilistically Reduced Features for Classification

机译:主题模型内核:对概率归约特征进行分类的实证研究

获取原文

摘要

Probabilistic topic models have become a standard in modern machine learning with wide applications in organizing and summarizing 'documents' in high-dimensional data such as images, videos, texts, gene expression data, and so on. Representing data by dimensional reduction of mixture proportion extracted from topic models is not only richer in semantics than bag-of-word interpretation, but also more informative for classification tasks. This paper describes the Topic Model Kernel (TMK), a high dimensional mapping for Support Vector Machine classification of data generated from probabilistic topic models. The applicability of our proposed kernel is demonstrated in several classification tasks from real world datasets. We outperform existing kernels on the distributional features and give the comparative results on non-probabilistic data types.
机译:概率主题模型已成为现代机器学习的标准,在组织和汇总高维数据(例如图像,视频,文本,基因表达数据等)中的“文档”方面具有广泛的应用。通过从主题模型中提取的混合比例降维来表示数据,不仅在语义上比单词袋解释更丰富,而且对于分类任务也更具参考价值。本文介绍了主题模型内核(TMK),这是一种用于从概率主题模型生成的数据的支持向量机分类的高维映射。我们提出的内核的适用性在来自现实世界数据集的几个分类任务中得到了证明。我们在分布特征方面优于现有内核,并给出了非概率数据类型的比较结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号