首页> 外文期刊>IEEE transactions on visualization and computer graphics >Interactive Steering of Hierarchical Clustering
【24h】

Interactive Steering of Hierarchical Clustering

机译:分层群集的互动转向

获取原文
获取原文并翻译 | 示例
       

摘要

Hierarchical clustering is an important technique to organize big data for exploratory data analysis. However, existing one-size-fits-all hierarchical clustering methods often fail to meet the diverse needs of different users. To address this challenge, we present an interactive steering method to visually supervise constrained hierarchical clustering by utilizing both public knowledge (e.g., Wikipedia) and private knowledge from users. The novelty of our approach includes 1) automatically constructing constraints for hierarchical clustering using knowledge (knowledge-driven) and intrinsic data distribution (data-driven), and 2) enabling the interactive steering of clustering through a visual interface (user-driven). Our method first maps each data item to the most relevant items in a knowledge base. An initial constraint tree is then extracted using the ant colony optimization algorithm. The algorithm balances the tree width and depth and covers the data items with high confidence. Given the constraint tree, the data items are hierarchically clustered using evolutionary Bayesian rose tree. To clearly convey the hierarchical clustering results, an uncertainty-aware tree visualization has been developed to enable users to quickly locate the most uncertain sub-hierarchies and interactively improve them. The quantitative evaluation and case study demonstrate that the proposed approach facilitates the building of customized clustering trees in an efficient and effective manner.
机译:分层聚类是组织大数据进行探索数据分析的重要技术。但是,现有的单尺寸适合所有分层聚类方法通常无法满足不同用户的不同需求。为解决这一挑战,我们通过利用来自用户的公共知识(例如,维基百科)和私人知识来介绍一个互动转向方法,以通过公共知识(例如,维基百科)和私人知识。我们的方法的新颖性包括1)使用知识(知识驱动)和内在数据分布(数据驱动)和2)自动构建分层群集的约束,并通过可视接口(用户驱动)来实现聚类的交互式转向。我们的方法首先将每个数据项映射到知识库中最相关的项目。然后使用蚁群优化算法提取初始约束树。该算法平衡树宽度和深度,并覆盖高度置信度。鉴于约束树,数据项使用进化贝叶斯玫瑰树进行分层集群。为了清楚地传达分层聚类结果,已经开发出不确定性感知树可视化以使用户能够快速定位最不确定的子层次结构并交互地改善它们。定量评估和案例研究表明,拟议的方法有助于以有效有效的方式建立定制的聚类树木。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号