首页> 外文期刊>Progress in Artificial Intelligence >A new visualization tool for data mining techniques
【24h】

A new visualization tool for data mining techniques

机译:一种新的可视化工具,用于数据挖掘技术

获取原文
获取原文并翻译 | 示例
           

摘要

Clustering techniques and classification trees are two of the main techniques used in datamining but, at present, there is still a lack of visualization methods for these tools. Many graphs associated with clustering, also with hierarchical clustering, do not give any information about the values of the centroids' attributes and the relationships among them. In classification trees, graphical procedures can also be developed to help simplify their interpretation and to obtain a better understanding, but more visualization methods to support this tool are needed. This paper presents a novel visualization technique called sectors on sectors (SonS), and an extended version called multidimensional sectors on sectors (MDSonS), for improving the interpretation of several data mining algorithms.Thesemethods are applied for visualizing the results of: (a) hierarchical clustering, whichmakes it possible to extract all the existing relationships among centroids' attributes at any hierarchy level; (b) growing hierarchical self-organizing maps (GHSOM), a variant of thewell-known self-organizing maps (SOM), by means of which it is possible to visualize, simultaneously, the data information at each hierarchy level compactly and extract relationships among variables; (c) classification trees, in which the SonS is used for representing the input data information for each class presented in each terminal node of a classification tree providing extra information for a better understanding of the problem. These methods are tested by means of several data sets (real and synthetic). The achieved results show the suitability and usefulness of the proposed approaches.
机译:聚类技术和分类树是数据挖掘中使用的两种主要技术,但是目前,这些工具仍然缺少可视化方法。许多与聚类以及层次聚类相关的图都没有提供有关质心属性值及其之间关系的任何信息。在分类树中,还可以开发图形过程来帮助简化其解释并获得更好的理解,但是需要更多的可视化方法来支持此工具。本文提出了一种新颖的可视化技术,称为扇区上的扇区(SonS),以及一种扩展的版本,称为多维扇区上的扇区(MDSonS),用于改进对几种数据挖掘算法的解释。这些方法用于可视化以下结果:层次聚类,这使得在任何层次级别提取质心属性之间的所有现有关系成为可能; (b)不断增长的分层自组织图(GHSOM),这是众所周知的自组织图(SOM)的一种变体,借助它可以同时可视化每个分层级别的数据信息并提取关系在变量之间; (c)分类树,其中SonS用于表示在分类树的每个终端节点中呈现的每个类的输入数据信息,为更好地理解问题提供了额外的信息。这些方法通过几个数据集(真实和合成)进行测试。所取得的结果表明了所提出方法的适用性和实用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号