Release 'Bag-of-Words' Assumption of Latent Dirichlet Allocation

机译：释放“袋子的袋子”潜在Dirichlet分配的假设

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Based on vector-based representation, topic models, like latent Dirichlet allocation (LDA), are constructed for documents with 'bag-of-words' assumption. They can discover the distribution of underlying topics in a document and the distribution of keywords in a topic, which have been proved very successful and practical in many scenarios, recently. Comparing vector-based representation of documents, graph-based representation method can preserve more semantics of documents, because not only keywords but also the relations between them in documents are considered. In this paper, a topic model for graph-represented documents (GTM) is proposed. In this model, a Bernoulli distribution is used to model the formation of the edge between two keywords in a document. The experimental results show that GTM outperforms LDA in document classification task using the unveiled topics from these two models to represent documents.

机译：基于基于向量的表示，主题模型如潜在的Dirichlet分配（LDA），用于带有“单词袋”假设的文档。他们可以发现文档中的基础主题的分发以及在一个主题中的关键字分发，最近在许多情况下被证明非常成功和实用。比较基于传感器的文档表示，基于图形的表示方法可以保留更多的文档语义，因为不仅关键字，而且考虑其中的文档之间的关系。在本文中，提出了一个图形文档（GTM）的主题模型。在该模型中，伯努利分布用于模拟文档中的两个关键字之间的边缘的形成。实验结果表明，使用来自这两个模型的揭幕主题来表示文档分类任务中的GTM优于LDA来表示文档。

著录项

来源
《ISKE 2013;International Conference on Intelligent Systems and Knowledge Engineering》|2014年||共10页
会议地点
作者
Junyu Xuan; Jie Lu; Guangquan Zhang; Xiangfeng Luo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-532;
关键词
Topic model; Latent Dirichlet allocation (LDA); Graph-based document representation; Text mining;

机译：主题模型;潜在的Dirichlet分配（LDA）;基于图形的文件表示;文本挖掘;

相似文献

外文文献
中文文献
专利

1. Exploring Symmetrical and Asymmetrical Dirichlet Priors for Latent Dirichlet Allocation [J] . Shaheen Syed, Marco Spruit International journal of semantic computing . 2018,第3期

机译：探索对称和不对称的Dirichlet Priors潜在的Dirichlet分配
2. A comparative analysis of Latent Semantic analysis and Latent Dirichlet allocation topic modeling methods using Bible data [J] . Vasantha Kumari Garbhapu, Prajna Bodapati Indian Journal of Science and Technology . 2020,第44期

机译：潜在语义分析与潜在的Dirichlet分配主题建模方法的比较分析
3. Discovering Latent Topics by Gaussian Latent Dirichlet Allocation and Spectral Clustering [J] . Yuan Bo, Gao Xinbo, Niu Zhenxing, ACM transactions on multimedia computing communications and applications . 2019,第1期

机译：通过高斯潜在Dirichlet分配和谱聚类发现潜在主题
4. Release 'Bag-of-Words' Assumption of Latent Dirichlet Allocation [C] . Junyu Xuan, Jie Lu, Guangquan Zhang, ISKE 2013 . 2014

机译：释放'袋袋子'假设潜在的Dirichlet分配
5. Comparing latent Dirichlet allocation and latent semantic analysis as classifiers [D] . Anaya, Leticia H. 2011

机译：比较潜在Dirichlet分配和潜在语义分析作为分类器
6. Latent Dirichlet allocation model for world trade analysis [O] . Diego Kozlowski, Viktoriya Semeshenko, Andrea Molinari 2021

机译：世界贸易分析潜在的Dirichlet分配模型
7. Comparing hierarchical dirichlet process with latent dirichlet allocation in bug report multiclass classification [O] . Nachai Limsettho, Hideaki Hata, Ken-ichi Matsumoto 2014

机译：将分层DireChlet进程与潜在Dirichlet分配进行比较Muglate Classification

Release 'Bag-of-Words' Assumption of Latent Dirichlet Allocation

摘要

著录项

相似文献

相关主题

期刊订阅