Scalable Moment-Based Inference for Latent Dirichlet Allocation

机译：潜在狄利克雷分配的基于可伸缩矩的推理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Topic models such as Latent Dirichlet Allocation have been useful text analysis methods of wide interest. Recently, moment-based inference with provable performance has been proposed for topic models. Compared with inference algorithms that approximate the maximum likelihood objective, moment-based inference has theoretical guarantee in recovering model parameters. One such inference method is tensor orthogonal decomposition, which requires only mild assumptions for exact recovery of topics. However, it suffers from scalability issue due to creation of dense, high-dimensional tensors. In this work, we propose a speedup technique by leveraging the special structure of the tensors. It is efficient in both time and space, and only requires scanning the corpus twice. It improves over the state-of-the-art inference algorithm by one to three orders of magnitude, while preserving equal inference ability.

机译：诸如潜在狄利克雷分配等主题模型已成为广泛关注的有用的文本分析方法。最近，针对主题模型提出了具有可证明性能的基于矩的推理。与近似最大似然目标的推理算法相比，基于矩的推理在恢复模型参数方面具有理论上的保证。一种这样的推理方法是张量正交分解，它只需要适度的假设就可以准确地恢复主题。但是，由于创建密集的高维张量，因此存在可伸缩性问题。在这项工作中，我们提出了一种利用张量的特殊结构的加速技术。它在时间和空间上都是高效的，并且只需要扫描语料库两次即可。在保持相同的推理能力的同时，它比最新的推理算法提高了一个到三个数量级。

著录项

来源
《European conference on machine learning and knowledge discovery in databases》|2014年|290-305|共16页
会议地点
作者
Chi Wang; Xueqing Liu; Yanglei Song; Jiawei Han;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Inference for the Number of Topics in the Latent Dirichlet Allocation Model via Bayesian Mixture Modeling [J] . Chen Zhe, Doss Hani Journal of computational and graphical statistics: A joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America . 2019,第3期

机译：通过贝叶斯混合建模潜在的Dirichlet分配模型中的主题次数的推断
2. Optimisation towards Latent Dirichlet Allocation: Its Topic Number and Collapsed Gibbs Sampling Inference Process [J] . Bambang Subeno, Retno Kusumaningrum, Farikhin Farikhin International Journal of Electrical and Computer Engineering . 2018,第5期

机译：潜在Dirichlet分配的优化：其主题号和折叠的Gibbs抽样推断过程
3. Collective motion pattern inference via Locally Consistent Latent Dirichlet Allocation [J] . Zou Jialing, Ye Qixiang, Cui Yanting, Neurocomputing . 2016,第apra5期

机译：通过局部一致的潜在Dirichlet分配进行集体运动模式推断
4. Scalable Moment-Based Inference for Latent Dirichlet Allocation [C] . Chi Wang, Xueqing Liu, Yanglei Song, European conference on machine learning and knowledge discovery in databases . 2014

机译：基于可伸缩的时刻基于潜在Dirichlet分配的推断
5. Comparing latent Dirichlet allocation and latent semantic analysis as classifiers [D] . Anaya, Leticia H. 2011

机译：比较潜在Dirichlet分配和潜在语义分析作为分类器
6. Latent Dirichlet allocation model for world trade analysis [O] . Diego Kozlowski, Viktoriya Semeshenko, Andrea Molinari 2021

机译：世界贸易分析潜在的Dirichlet分配模型
7. Discovering novel mutation signatures by latent Dirichlet allocation with variational Bayes inference [O] . Taro Matsutani, Yuki Ueno, Tsukasa Fukunaga, 2019

机译：通过与变分贝叶斯推论的潜在Dirichlet分配发现新的突变签名

Scalable Moment-Based Inference for Latent Dirichlet Allocation

摘要

著录项

相似文献

相关主题

期刊订阅