首页> 外文会议>International conference on neuroinformatics >Categorical Data: An Approach to Visualization for Cluster Analysis
【24h】

Categorical Data: An Approach to Visualization for Cluster Analysis

机译:分类数据:一种用于聚类分析的可视化方法

获取原文

摘要

The problem of studying the cluster structure of a set of objects with qualitative (categorical) features is considered. We propose an approach to visualization of source data and categorical data groups in a form that is convenient for human analysis and decision-making. We generalized Andrews' idea of numeric data visualization for the case of categorical data set. The developed approach can be applied in the case when the frequency distribution of the joint appearance of feature pairs in the data sample is known. For visualization, it is proposed to use not the primary features of the data set, but new paired features that have a strong statistical relationship. In addition, we have corrected the spectral representation of Andrews curves, limiting the maximum frequency of harmonic functions. The proposed visual representation of categorical data makes it possible to estimate the number of clusters in a data set and show their differences. The technique is demonstrated on a model example in which the decision on the number of clusters is taken in conjunction with two other ways of visualizing data clusters: a silhouette and a heat map.
机译:考虑了研究具有定性(分类)特征的一组对象的簇结构的问题。我们提出一种可视化源数据和分类数据组的方法,该方法便于人类分析和决策。对于分类数据集,我们推广了安德鲁斯关于数字数据可视化的想法。在已知数据样本中特征对的联合出现的频率分布的情况下,可以应用开发的方法。为了可视化,建议不要使用数据集的主要特征,而应使用具有强大统计关系的新配对特征。另外,我们已经校正了安德鲁斯曲线的频谱表示,从而限制了谐波函数的最大频率。拟议的分类数据可视化表示使得可以估计数据集中的簇数并显示它们之间的差异。在一个模型示例中演示了该技术,在该模型示例中,对群集数量的决定与其他两种可视化数据群集的方式结合在一起:轮廓和热图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号