首页> 外文期刊>IEEE transactions on visualization and computer graphics >UTOPIAN: User-Driven Topic Modeling Based on Interactive Nonnegative Matrix Factorization
【24h】

UTOPIAN: User-Driven Topic Modeling Based on Interactive Nonnegative Matrix Factorization

机译:UTOPIAN:基于交互式非负矩阵分解的用户驱动主题建模

获取原文
获取原文并翻译 | 示例
       

摘要

Topic modeling has been widely used for analyzing text document collections. Recently, there have been significant advancements in various topic modeling techniques, particularly in the form of probabilistic graphical modeling. State-of-the-art techniques such as Latent Dirichlet Allocation (LDA) have been successfully applied in visual text analytics. However, most of the widely-used methods based on probabilistic modeling have drawbacks in terms of consistency from multiple runs and empirical convergence. Furthermore, due to the complicatedness in the formulation and the algorithm, LDA cannot easily incorporate various types of user feedback. To tackle this problem, we propose a reliable and flexible visual analytics system for topic modeling called UTOPIAN (User-driven Topic modeling based on Interactive Nonnegative Matrix Factorization). Centered around its semi-supervised formulation, UTOPIAN enables users to interact with the topic modeling method and steer the result in a user-driven manner. We demonstrate the capability of UTOPIAN via several usage scenarios with real-world document corpuses such as InfoVis/VAST paper data set and product review data sets.
机译:主题建模已被广泛用于分析文本文档集合。最近,各种主题建模技术取得了重大进展,尤其是以概率图形建模的形式。潜在的狄利克雷分配(LDA)等最先进的技术已成功应用于可视文本分析中。但是,大多数基于概率模型的广泛使用方法在多次运行的一致性和经验收敛方面均存在缺陷。此外,由于公式和算法的复杂性,LDA无法轻易合并各种类型的用户反馈。为了解决这个问题,我们为主题建模提出了一种可靠且灵活的可视化分析系统,称为UTOPIAN(基于交互式非负矩阵分解的用户驱动主题建模)。围绕其半监督公式,UTOPIAN使用户能够与主题建模方法进行交互,并以用户驱动的方式引导结果。我们通过几种使用场景以及实际文档语料库(例如InfoVis / VAST纸张数据集和产品评论数据集)来演示UTOPIAN的功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号