首页> 外文会议>2011 23rd IEEE International Conference on Tools with Artificial Intelligence >TopicView: Visually Comparing Topic Models of Text Collections
【24h】

TopicView: Visually Comparing Topic Models of Text Collections

机译:TopicView:视觉比较文本集合的主题模型

获取原文

摘要

We present Topic View, an application for visually comparing and exploring multiple models of text corpora. Topic View uses multiple linked views to visually analyze both the conceptual content and the document relationships in models generated using different algorithms. To illustrate Topic View, we apply it to models created using two standard approaches: Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA). Conceptual content is compared through the combination of (i) a bipartite graph matching LSA concepts with LDA topics based on the cosine similarities of model factors and (ii) a table containing the terms for each LSA concept and LDA topic listed in decreasing order of importance. Document relationships are examined through the combination of (i) side-by-side document similarity graphs, (ii) a table listing the weights for each document's contribution to each concept/topic, and (iii) a full text reader for documents selected in either of the graphs or the table. We demonstrate the utility of Topic View's visual approach to model assessment by comparing LSA and LDA models of two example corpora.
机译:我们提出主题视图,该应用程序用于可视地比较和探索文本语料库的多种模型。主题视图使用多个链接视图,以可视方式分析使用不同算法生成的模型中的概念内容和文档关系。为了说明主题视图,我们将其应用于使用两种标准方法创建的模型:潜在语义分析(LSA)和潜在狄利克雷分配(LDA)。通过(i)基于模型因子的余弦相似度将LSA概念与LDA主题匹配的二部图和(ii)包含每个LSA概念和LDA主题的术语的列表(按重要性降序)的组合来比较概念内容。通过(i)并排的文档相似度图,(ii)列出每个文档对每个概念/主题的权重的表格以及(iii)全文阅读器中的文档组合来检查文档关系。无论是图形还是表格。通过比较两个示例语料库的LSA和LDA模型,我们演示了主题视图的可视化方法在模型评估中的实用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号