首页> 外文期刊>Indian Journal of Science and Technology >A Weighted Distance Metric Clustering Method to Cluster Small Data Points from a Projected Database Generated from a Freespan Algorithm
【24h】

A Weighted Distance Metric Clustering Method to Cluster Small Data Points from a Projected Database Generated from a Freespan Algorithm

机译:一种基于Freespan算法生成的投影数据库中的小数据点的加权距离度量聚类方法

获取原文
           

摘要

Background/Objectives: Clustering and Sequential Pattern Mining is two most important unsupervised learning algorithms. The objective is to mine small projected databases rejected by Frequent Pattern - Projected Sequential Pattern mining (FreeSpan) technique using a weighted distance metric clustering method, a process of finding the distance between the small data points and cluster it so that it cannot be rejected. Methods/Statistical Analysis: The method involves the implementation of a distance metric clustering algorithm over a FreeSpan technique to cluster the data points of small projected databases. The FreeSpan technique can be considered as an ensemble of clustering and sequential pattern mining methods. Findings: The clustering method clusters the data points resulted from the FreeSpan technique that are ignored after the scanning process as their sizes are very small. The clustered data therefore gathers the ignored data points thereby providing an accurate clustered data containing small data points which results is trustable sequential pattern for future predictions. The proposed system reduces the complexity by incorporating just a single clustering algorithm. Therefore the major operations of the algorithm remain undisturbed and give its efficient output and also the output is found to be accurate and stable. Applications/Improvements: The technique proposed in the paper can be applied to datasets that needs to be clustered for decision making. The same technique holds good and can be made applicable to high dimensional views.
机译:背景/目的:聚类和顺序模式挖掘是两个最重要的无监督学习算法。目的是使用加权距离度量聚类方法来挖掘被频繁模式-投影顺序模式挖掘(FreeSpan)技术拒绝的小型投影数据库,该过程是查找小型数据点之间的距离并将其聚类以使其无法被拒绝的过程。方法/统计分析:该方法涉及在FreeSpan技术上实施距离度量聚类算法,以对小型投影数据库的数据点进行聚类。 FreeSpan技术可以看作是聚类和顺序模式挖掘方法的集合。结果:聚类方法对FreeSpan技术产生的数据点进行聚类,这些数据点在扫描过程后会被忽略,因为它们的大小非常小。因此,聚类数据收集了被忽略的数据点,从而提供了包含小数据点的准确聚类数据,其结果是可用于未来预测的可靠顺序模式。所提出的系统通过仅合并单个聚类算法来降低复杂度。因此,该算法的主要操作保持不受干扰,并给出了有效的输出,并且发现该输出是准确且稳定的。应用/改进:本文提出的技术可以应用于需要聚类以进行决策的数据集。相同的技术效果很好,可以应用于高维视图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号