Experiments with Non-parametric Topic Models

Wray Buntine; Swapnil Mishra

首页> 外文期刊>SIGKDD explorations >Experiments with Non-parametric Topic Models

【24h】

Experiments with Non-parametric Topic Models

机译：非参数主题模型实验

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In topic modelling, various alternative priors have been developed, for instance asymmetric and symmetric priors for the document-topic and topic-word matrices respectively, the hierarchical Dirichlet process prior for the document- topic matrix and the hierarchical Pitman-Yor process prior for the topic-word matrix. For information retrieval, language models exhibiting word burstiness are important. Indeed, this burstiness effect has been show to help topic models as well, and this requires additional word probability vectors for each document. Here we show how to combine these ideas to develop high-performing non-parametric topic models exhibiting burstiness based on standard Gibbs sampling. Experiments are done to explore the behavior of the models under different conditions and to compare the algorithms with previously published. The full non-parametric topic models with burstiness are only a small factor slower than standard Gibbs sampling for LDA and require double the memory, making them very competitive. We look at the comparative behaviour of different models and present some experimental insights.

机译：在主题建模中，已经开发了各种替代先验，例如分别针对文档主题和主题词矩阵的非对称先验和对称先验，针对文档主题矩阵的分层Dirichlet过程和针对文档主题矩阵的分层Pitman-Yor过程。主题词矩阵。对于信息检索，表现出单词突发性的语言模型很重要。确实，这种突发性效果也已显示出对主题模型的帮助，并且每个文档都需要附加的单词概率向量。在这里，我们展示了如何结合这些思想来开发基于标准Gibbs采样的表现出突发性的高性能非参数主题模型。已进行实验以探索模型在不同条件下的行为，并将算法与以前发布的算法进行比较。具有突发性的完整非参数主题模型仅比用于LDA的标准Gibbs采样慢一小部分，并且需要两倍的内存，这使其具有很高的竞争力。我们研究了不同模型的比较行为，并提出了一些实验见解。

著录项

来源
《SIGKDD explorations》 |2014年第cdarom期|共10页
作者
Wray Buntine; Swapnil Mishra;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类 TP274.2;
关键词
Topic modelling; Experimental results; Non-parametric prior; Text;

机译：主题建模;实验结果;非参数先验;文本;

相似文献

外文文献
中文文献
专利

1. Experiments with Non-parametric Topic Models [J] . Wray Buntine, Swapnil Mishra SIGKDD explorations . 2014,第CDaROM期

机译：非参数主题模型实验
2. NON-PARAMETRIC TOPIC MODEL FOR DISCOVERING GEOGRAPHICAL TOPIC VARIATIONS [J] . Qi Xiang, Huang Yu, Song Jun, 电子科学学刊（英文版） . 2014,第006期

机译：发现地理主题变化的非参数主题模型
3. Dynamic non-parametric joint sentiment topic mixture model [J] . Fu Xianghua, Yang Kun, Huang Joshua Zhexue, Knowledge-Based Systems . 2015,第jula期

机译：动态非参数联合情感话题混合模型
4. Non-parametric Method of Topic Identification Using Granularity Concept and Graph-Based Modeling [C] . Isha Ganguli, Jaya Sil, Nandita Sengupta IEEE International Conference on Soft Computing and Machine Intelligence . 2019

机译：基于粒度概念和图模型的主题识别非参数方法
5. New Models and Methods for Applied Statistics: Topics in Computer Experiments and Time Series Analysis [D] . Zhao, Yibo. 2017

机译：应用统计的新模型和方法：计算机实验和时间序列分析中的主题
6. Two general methods for population pharmacokinetic modeling: non-parametric adaptive grid and non-parametric Bayesian [O] . Tatiana Tatarinova, Michael Neely, Jay Bartroff, -1

机译：人口药代动力学建模的两种通用方法：非参数自适应网格和非参数贝叶斯
7. A non-parametric mixture model for topic modeling over time [O] . Dubey, Avinava, Hefny, Ahmed, Williamson, Sinead, 2012

机译：随时间进行主题建模的非参数混合模型
8. Technical Topic 3.2.2.d Bayesian and Non-Parametric Statistics: Integration of Neural Networks with Bayesian Networks for Data Fusion and Predictive Modeling. [R] . Bell, S. 2016

机译：技术主题3.2.2.d贝叶斯和非参数统计：神经网络与贝叶斯网络的集成，用于数据融合和预测建模。

Experiments with Non-parametric Topic Models

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅