首页> 外文期刊>Neurocomputing >Projective Art With Buffers For The High Dimensional Space Clustering And An Application To Discover Stock Associations
【24h】

Projective Art With Buffers For The High Dimensional Space Clustering And An Application To Discover Stock Associations

机译:具有高维空间聚类的缓冲区的投影艺术及其在发现股票关联中的应用

获取原文
获取原文并翻译 | 示例
       

摘要

Unlike to traditional hierarchical and partitional clustering algorithms which always fail to deal with very large databases, a neural network architecture, projective adaptive resonance theory (PART), is developed for the high dimensional space clustering. However, the success of the PART algorithm depends on both accurate parameters and satisfied orders of input data sets. These disadvantages prevent PART from being applied to realtime databases. In this paper, we propose an improved method, PART with buffer management, to overcome these disadvantages. The major contributions of our method are introducing a buffer management and a new similar degree function and buffer checkout process. The buffer management mechanism allows data sets not to be immediately clustered to one cluster. The purpose of the average similar degree is to successfully work with high similar noise data sets and partly achieve an order-independent objective without correct parameters. And the average similar degree has a good attribute, the parameter-tolerance. Namely, the clustering result does not depend on the precise choice of input parameters, and different parameter values have close clustering results including dimensions associated with clusters. The buffer checkout process can handle a huge amount of input data sets by a small buffer space. Also, simulations and comparisons in high dimensional spaces are reported, and an application by using our algorithm to find stock concurrence association rules is given finally.
机译:与始终无法处理非常大的数据库的传统分层和分区聚类算法不同,神经网络体系结构(射影自适应共振理论(PART))针对高维空间聚类而开发。但是,PART算法的成功取决于准确的参数和输入数据集的满意顺序。这些缺点使PART无法应用于实时数据库。在本文中,我们提出了一种改进的方法,即具有缓冲区管理功能的PART,以克服这些缺点。我们方法的主要贡献是引入了缓冲区管理以及新的相似度函数和缓冲区检查过程。缓冲区管理机制允许数据集不立即群集到一个群集中。平均相似度的目的是成功处理高相似噪声数据集,并在没有正确参数的情况下部分实现与订单无关的目标。并且平均相似度具有良好的属性,即参数容差。即,聚类结果不取决于输入参数的精确选择,并且不同的参数值具有紧密的聚类结果,包括与聚类相关联的维度。缓冲区签出过程可以通过较小的缓冲区空间处理大量输入数据集。此外,还对高维空间中的仿真和比较进行了报道,并最终给出了使用我们的算法查找股票并发关联规则的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号