首页> 外文会议>International symposium on methodologies for intelligent systems >Clustering Ensemble for Prioritized Sampling Based on Average and Rough Patterns
【24h】

Clustering Ensemble for Prioritized Sampling Based on Average and Rough Patterns

机译:基于平均和粗糙模式的聚类集成优先采样

获取原文

摘要

This paper proposes a clustering ensemble for prioritized sampling to tackle a big data problem. The proposal first creates separate clustering schemes of objects using different dimensions of the dataset. These clustering schemes are then combined to create a representative sample based on all the possible combinations of profiles. The resulting clustering ensemble will help system developers to reduce the number of objects that need to be analyzed while making sure that all the profile combinations are comprehensively covered. The proposal further ranks the objects in the sample based on their ability to capture important aspects of each of the criteria. The proposed approach can be used to provide a priority based analysis/modelling over an extended period of time. The prioritized analysis/models will be available for use in a reasonably short period of time. The quality of the analysis/modelling will continuously improve as more and more objects in the sample are processed according to their rank in the sample. The proposal is applied to a large set of weather stations to create a ranked sample based on hourly and monthly variations of important weather parameters, such as temperature, solar radiation, wind speed, and humidity. The experiments also demonstrate how a combination of average and rough patterns help in creating more meaningful profiles.
机译:本文提出了一种聚类集合,用于优先采样以解决大数据问题。该提案首先使用数据集的不同维度创建对象的单独聚类方案。然后,根据所有可能的配置文件组合,将这些聚类方案组合起来以创建一个有代表性的样本。由此产生的集群集成将帮助系统开发人员减少需要分析的对象的数量,同时确保全面涵盖所有配置文件组合。该提议还根据对象捕获每个标准的重要方面的能力对样本中的对象进行排名。所提出的方法可用于在延长的时间段内提供基于优先级的分析/建模。优先的分析/模型将在相当短的时间内提供使用。随着样品中越来越多的对象根据其在样品中的排名进行处理,分析/建模的质量将不断提高。该提案应用于大量的气象站,以根据重要天气参数(例如温度,太阳辐射,风速和湿度)的每小时和每月变化来创建分级样本。实验还演示了平均模式和粗糙模式的组合如何帮助创建更有意义的配置文件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号