【24h】

Minimum Volume Topic Modeling

机译:最小卷主题建模

获取原文
           

摘要

We propose a new topic modeling procedure that takes advantage of the fact that the Latent Dirichlet Allocation (LDA) log-likelihood function is asymptotically equivalent to the logarithm of the volume of the topic simplex. This allows topic modeling to be reformulated as finding the probability simplex that minimizes its volume and encloses the documents that are represented as distributions over words. A convex relaxation of the minimum volume topic model optimization is proposed, and it is shown that the relaxed problem has the same global minimum as the original problem under the separability assumption and the sufficiently scattered assumption introduced by Arora et al. (2013) and Huang et al. (2016). A locally convergent alternating direction method of multipliers (ADMM) approach is introduced for solving the relaxed minimum volume problem. Numerical experiments illustrate the benefits of our approach in terms of computation time and topic recovery performance.
机译:我们提出了一个新的主题建模程序,该程序利用了潜在Dirichlet分配(LDA)对数似然函数渐近地等于主题单纯形的对数的事实。这使得主题建模可以重新构造为找到概率单形函数,该概率单形函数可以最大程度地减少其体积,并包含以单词分布形式表示的文档。提出了最小体积主题模型优化的凸松弛方法,证明了在Arora等人提出的可分离性假设和充分分散假设下,松弛问题与原始问题具有相同的全局最小值。 (2013)和Huang等。 (2016)。为了解决松弛最小体积问题,引入了局部收敛的乘数交变方向乘积法(ADMM)。数值实验说明了我们的方法在计算时间和主题恢复性能方面的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号