...
首页> 外文期刊>Applied Soft Computing >A self-organizing map for transactional data and the related categorical domain
【24h】

A self-organizing map for transactional data and the related categorical domain

机译:事务数据和相关分类域的自组织映射

获取原文
获取原文并翻译 | 示例

摘要

After projecting high dimensional data into a two-dimension map via the SOM, users can easily view the inner structure of the data on the 2-D map. In the early stage of data mining, it is useful for any kind of data to inspect their inner structure. However, few studies apply the SOM to transactional data and the related categorical domain, which are usually accompanied with concept hierarchies. Concept hierarchies contain information about the data but are almost ignored in such researches. This may cause mistakes in mapping. In this paper, we propose an extended SOM model, the SOMCD, which can map the varied kinds of data in the categorical domain into a 2-D map and visualize the inner structure on the map. By using tree structures to represent the different kinds of data objects and the neurons' prototypes, a new devised distance measure which takes information embedded in concept hierarchies into consideration can properly find the similarity between the data objects and the neurons. Besides the distance measure, we base the SOMCD on a tree-growing adaptation method and integrate the U-Matrix for visualization. Users can hierarchically separate the trained neurons on the SOMCD's map into different groups and cluster the data objects eventually. From the experiments in synthetic and real datasets, the SOMCD performs better than other SOM variants and clustering algorithms in visualization, mapping and clustering.
机译:通过SOM将高维数据投影到二维地图中之后,用户可以轻松地在二维地图上查看数据的内部结构。在数据挖掘的早期,对任何类型的数据检查其内部结构都是有用的。但是,很少有研究将SOM应用于交易数据和相关的分类领域,这些通常伴随着概念层次结构。概念层次结构包含有关数据的信息,但在此类研究中几乎被忽略。这可能会导致映射错误。在本文中,我们提出了一个扩展的SOM模型SOMCD,它可以将分类域中的各种数据映射到二维地图中,并可视化地图上的内部结构。通过使用树结构表示不同种类的数据对象和神经元原型,一种新的距离度量方法(考虑了概念层次结构中嵌入的信息)可以正确地找到数据对象与神经元之间的相似性。除了距离测量之外,我们还基于树木生长的自适应方法建立SOMCD,并集成了U-Matrix进行可视化。用户可以在SOMCD的地图上将受过训练的神经元分层划分为不同的组,并最终将数据对象聚类。从合成和真实数据集中的实验来看,SOMCD在可视化,映射和聚类方面比其他SOM变体和聚类算法表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号