Evaluating Topic Modeling Interpretability Using Topic Labeled Gold-standard Sets

Biagio Palese; Gabriele Piccoli

首页> 外文期刊>Communications of the Association for Information Systems >Evaluating Topic Modeling Interpretability Using Topic Labeled Gold-standard Sets

【24h】

Evaluating Topic Modeling Interpretability Using Topic Labeled Gold-standard Sets

机译：评估主题使用标有金标准集的主题建模解释性

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paucity of rigorous evaluation measures undermines topic modeling results’ validity and trustworthiness. Accordingly, we propose a method that researchers can use to select models when they assess topics’ human interpretability. We show how they can evaluate different topic models using gold-standard sets that humans label. Our approach ensures that the topics extracted algorithmically from an entire corpus concur with the themes humans would have identified in the same documents. By doing so, we combine human coding’s advantages for topic interpretability with algorithmic topic Modeling’s analytical efficiency and scalability. We demonstrate that one can rigorously identify optimal model parametrizations for maximum interpretability and to rigorously justify model selection. We also contribute three open access gold-standard sets in the hospitality context and make them available so other researchers can use them to benchmark their models or validate their results. Finally, we showcase a methodology for designing and developing gold-standard sets for validating topic models, which researchers interested in developing gold-standard sets in domains and contexts appropriate for their research can use.

机译：严格的评估措施的缺乏破坏了建模结果的有效性和可信度。因此，我们提出了一种方法，研究人员可以在评估主题的人类解释性时选择模型。我们展示了如何使用人类标签的金标准套装评估不同主题模型。我们的方法确保从整个语料库中提取的主题与主题人类在同一文件中识别。通过这样做，我们将人类编码的优势与算法主题建模的分析效率和可扩展性相结合。我们展示了一个人可以严格地识别最佳模型参数化，以获得最大的解释性，并严格证明模型选择。我们还在酒店上下文中提供三种开放式访问金标准集，使其可用，因此其他研究人员可以使用它们来基准测试模型或验证其结果。最后，我们展示了用于设计和开发用于验证主题模型的金标准的方法，研究人员对在适合其研究的域和上下文中开发金标准集的研究人员可以使用。

著录项

来源
《Communications of the Association for Information Systems》 |2020年第a期|共19页
作者
Biagio Palese; Gabriele Piccoli;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类 TU717;
关键词
Human Interpretable TopicsGold-standard SetText MiningTopic EvaluationTopic Interpretability MeasureTopic Modeling;

机译：人类可解释的主题标准标准Settext Minapictopic评估Topic Interpetity MeasualTopic建模;

相似文献

外文文献
中文文献
专利

1. Evaluating topic model interpretability from a primary care physician perspective [J] . Arnold Corey W., Oh Andrea, Chen Shawn, Computer Methods and Programs in Biomedicine: An International Journal Devoted to the Development, Implementation and Exchange of Computing Methodology and Software Systems in Biomedical Research and Medical Practice . 2016,第1期

机译：从初级保健医生的角度评估主题模型的可解释性
2. A Few Good Topics: Experiments in Topic Set Reduction for Retrieval Evaluation [J] . JOHN GUIVER, STEFANO MIZZARO, STEPHEN ROBERTSON ACM Transactions on Information Systems . 2009,第4期

机译：几个不错的主题：主题集约简的检索评估实验
3. Drawing openness to experience from user generated contents: An interpretable data-driven topic modeling approach [J] . Zhang Yishi, Wei Haiying, Ran Yaxuan, Expert systems with applications . 2020,第Apra期

机译：从用户生成的内容绘制开放性：可解释的数据驱动主题建模方法
4. Label-Related/Unrelated Topic Switching Model: A Partially Labeled Topic Model Handling Infinite Label-Unrelated Topics [C] . Ida Yasutoshi, Nakamura Takuma, Matsumoto Takashi 2013 2nd IAPR Asian Conference on Pattern Recognition . 2013

机译：标签相关/无关主题切换模型：处理无限标签无关主题的部分标签主题模型
5. Social Inferences from Animations in Agenesis of the Corpus Callosum: Labeled Topic Modeling [D] . Renteria-Vazquez, Tiffany A. 2019

机译：Call体再生中动画的社会推理：标记主题建模
6. Evaluating Topic Model Interpretability from a Primary Care Physician Perspective [O] . Corey W. Arnold, Andrea Oh, Shawn Chen, -1

机译：从初级保健医师的角度评估主题模型的可解释性
7. Entities as topic labels : improving topic interpretability and evaluability combining Entity Linking and Labeled LDA [O] . Nanni Federico, Ruiz Fabo Pablo 2016

机译：实体作为主题标签：结合实体链接和标记的LDA来提高主题的可解释性和可评估性

Evaluating Topic Modeling Interpretability Using Topic Labeled Gold-standard Sets

摘要

著录项

相似文献

相关主题

期刊订阅