【24h】

Replicated Softmax: an Undirected Topic Model

机译:复制的Softmax:无向主题模型

获取原文

摘要

We introduce a two-layer undirected graphical model, called a "Replicated Softmax", that can be used to model and automatically extract low-dimensional latent semantic representations from a large unstructured collection of documents. We present efficient learning and inference algorithms for this model, and show how a Monte-Carlo based method, Annealed Importance Sampling, can be used to produce an accurate estimate of the log-probability the model assigns to test data. This allows us to demonstrate that the proposed model is able to generalize much better compared to Latent Dirichlet Allocation in terms of both the log-probability of held-out documents and the retrieval accuracy.
机译:我们引入了两层无方向性图形模型,称为“ Replicated Softmax”,可用于建模和自动从大型非结构化文档集中提取低维潜在语义表示。我们为该模型提供了有效的学习和推理算法,并展示了基于蒙特卡洛的方法,即退火重要性抽样,可以用于对模型分配给测试数据的对数概率进行准确估计。这使我们能够证明,相对于潜在的Dirichlet分配,所提出的模型在保留文档的对数概率和检索准确性方面都能够更好地推广。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号