Human Perception of Enriched Topic Models

机译：人类对丰富主题模型的感知

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Topic modeling algorithms, such as LDA, find topics, hidden structures, in document corpora in an unsupervised manner. Traditionally, applications of topic modeling over textual data use the bag-of-words model, i.e. only consider words in the documents. In our previous work we developed a framework for mining enriched topic models. We proposed a bag-of-features approach, where a document consists not only of words but also of linked named entities and their related information, such as types or categories. In this work we focused on the feature engineering and selection aspects of enriched topic modeling and evaluated the results based on two measures for assessing the understandability of estimated topics for humans: model precision and topic log odds. In our 10-model experimental setup with 7 pure resource-, 2 hybrid words/resource- and one word-based model, the traditional bag-of-words models were outperformed by 5 pure resource-based models in both measures. These results show that incorporating background knowledge into topic models makes them more understandable for humans.

机译：主题建模算法（例如LDA）以无监督的方式在文档语料库中找到主题，隐藏结构。传统上，在文本数据上进行主题建模的应用程序使用词袋模型，即仅考虑文档中的词。在我们以前的工作中，我们开发了一个用于挖掘丰富主题模型的框架。我们提出了一种功能袋方法，其中文档不仅由单词组成，而且由链接的命名实体及其相关信息（例如类型或类别）组成。在这项工作中，我们专注于丰富主题建模的特征工程和选择方面，并基于两种方法对结果进行了评估，这些方法用于评估人类对估计主题的可理解性：模型精度和主题对数比。在我们的具有10个模型的实验设置中，这7个纯资源模型，2个混合词/资源模型和一个基于单词的模型，在这两个方面，传统的词袋模型均优于5个基于纯资源的模型。这些结果表明，将背景知识整合到主题模型中可以使人类更容易理解它们。

著录项

来源
《International conference on business information systems》|2018年|15-29|共15页
会议地点
作者
Wojciech Lukasiewicz; Alexandra Todor; Adrian Paschke;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. ETM: Enrichment by topic modeling for automated clinical sentence classification to detect patients' disease history [J] . Bagheri Ayoub, Sammani Arjan, van der Heijden Peter G. M., Journal of Intelligent Information Systems . 2020,第2期

机译：ETM：由自动临床句子分类进行主题建模的富集检测患者疾病史
2. TRAVELOGUE ENRICHING AND SCENIC SPOT OVERVIEW BASED ON TEXTUAL AND VISUAL TOPIC MODELS [J] . YANWEI PANG, XIN LU, YUAN YUAN, International Journal of Pattern Recognition and Artificial Intelligence . 2011,第3期

机译：基于文本和视觉主题模型的旅行丰富和风景点概览
3. The Effect of Dietary Fiber Enrichment on Taste Perception in Humans [J] . Thavaraj Pridhuvi, Gidley Mike J., Roura Eugeni Chemical Senses . 2015,第5期

机译：膳食纤维富集对人类味觉的影响
4. Human Perception of Enriched Topic Models [C] . Wojciech Lukasiewicz, Alexandra Todor, Adrian Paschke International conference on business information systems . 2018

机译：富集主题模型的人类感知
5. Collaborative Communication Interruption Management System (C-CIMS): Modeling Interruption Timings via Prosodic and Topic Modeling for Human-Machine Teams [D] . Peters, Nia S. 2017

机译：协作通信中断管理系统（C-CIMS）：通过人机团队的韵律和主题建模对中断时间进行建模
6. Topic Modeling for Analyzing Patients’ Perceptions and Concerns of Hearing Loss on Social QA Sites: Incorporating Patients’ Perspective [O] . Junghwa Bahng, Chang Heon Lee 2020

机译：分析患者对社会问答障碍损失患者的看法和担忧的主题建模：纳入患者的观点

Human Perception of Enriched Topic Models

摘要

著录项

相似文献

相关主题

期刊订阅