Combining semantic graph and probabilistic topic models for discovering coherent topics

Allahyari Mehdi; Pouriyeh Seyedamin; Kochut Krys

首页> 外文期刊>Web Intelligence and Agent Systems >Combining semantic graph and probabilistic topic models for discovering coherent topics

【24h】

Combining semantic graph and probabilistic topic models for discovering coherent topics

机译：结合语义图和概率主题模型以发现相干主题

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Probabilistic topic models, which frequently represent topics as multinomial distributions over words, have been extensively used for discovering latent topics in text corpora. However, because topic models are entirely unsupervised, they may lead to topics that are not understandable in applications. Recently, several knowledge-based topic models have been proposed which primarily use word-level domain knowledge in the model to enhance the topic coherence and ignore the rich information carried by entities (e.g, persons, locations, organizations, etc.) associated with the documents. Additionally, there exists a vast amount of prior knowledge (background knowledge) represented as Linked Open Data (LOD) datasets and other ontologies, which can be incorporated into the topic models to produce coherent topics. In this paper, we introduce a novel regularization entity-based topic model (RETM ), which integrates an ontology with an entity-based topic model (EntLDA ) to increase the coherence of the identified topics through the topic modeling process. Our experimental results demonstrate the effectiveness of the proposed model in improving the coherence of topics.

机译：概率主题模型通常将主题表示为单词的多项式分布，已广泛用于发现文本语料库中的潜在主题。但是，由于主题模型是完全不受监督的，因此它们可能导致应用程序中无法理解的主题。最近，已经提出了几种基于知识的主题模型，它们主要在模型中使用单词级领域的知识来增强主题的连贯性，而忽略了与主题相关联的实体（例如，人员，位置，组织等）携带的丰富信息。文件。另外，存在大量表示为链接开放数据（LOD）数据集和其他本体的先验知识（背景知识），可以将其合并到主题模型中以产生一致的主题。在本文中，我们介绍了一种新颖的基于实体的正则化主题模型（RETM），该模型将本体与基于实体的主题模型（EntLDA）集成在一起，以通过主题建模过程提高所识别主题的一致性。我们的实验结果证明了所提出的模型在提高主题连贯性方面的有效性。

著录项

来源
《Web Intelligence and Agent Systems》 |2019年第4期|365-379|共15页
作者
Allahyari Mehdi; Pouriyeh Seyedamin; Kochut Krys;
展开▼
作者单位

Department of Computer Science Georgia Southern University;

Department of Information Technology Kennesaw State University;

Department of Computer Science University of Georgia;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Statistical learning; topic modeling; topic coherence; Semantic Web; ontologies;

机译：统计学习;主题建模;主题连贯性语义网;本体论;
入库时间 2022-08-18 05:27:05

相似文献

外文文献
中文文献
专利

1. Combine Topic Modeling with Semantic Embedding: Embedding Enhanced Topic Model [J] . Zhang Peng, Wang Suge, Li Deyu, IEEE Transactions on Knowledge and Data Engineering . 2020,第12期

机译：组合主题建模与语义嵌入：嵌入增强主题模型
2. NON-PARAMETRIC TOPIC MODEL FOR DISCOVERING GEOGRAPHICAL TOPIC VARIATIONS [J] . Qi Xiang, Huang Yu, Song Jun, 电子科学学刊（英文版） . 2014,第006期

机译：发现地理主题变化的非参数主题模型
3. Discovering author interest evolution in order-sensitive and Semantic-aware topic modeling [J] . Yang Min, Qu Qiang, Chen Xiaojun, Information Sciences: An International Journal . 2019,第期

机译：在订购敏感和语义感知主题建模中发现作者兴趣进展
4. Discovering Coherent Topics with Entity Topic Models [C] . Mehdi Allahyari, Krys Kochut IEEE/WIC/ACM International Conference on Web Intelligence . 2016

机译：使用实体主题模型发现相干主题
5. Probabilistic Topic Models for Graph Mining [D] . Cha, Young Chul. 2014

机译：图形挖掘的概率主题模型
6. Discovering Health Topics in Social Media Using Topic Models [O] . Michael J. Paul, Mark Dredze -1

机译：使用主题模型在社交媒体中发现健康主题
7. Discovering Routines from Large-Scale Human Locations using Probabilistic Topic Models [O] . Katayoun Farrahi 2011

机译：使用概率主题模型从大规模人员位置发现例程

Combining semantic graph and probabilistic topic models for discovering coherent topics

摘要

著录项

相似文献

相关主题

期刊订阅