首页> 外文会议>International Conference on Mobile Software Engineering and Systems >Best of breed solution for clustering of satellite images using bigdata platform spark
【24h】

Best of breed solution for clustering of satellite images using bigdata platform spark

机译:使用大数据平台Spark进行卫星图像聚类的最佳解决方案

获取原文

摘要

Clustering is the unsupervised process of assigning entities into groups based on similarities among those entities. Image clustering is the crucial step of mining satellite images. As the satellite imagery is getting generated at a higher rate than the previous decades, it becomes essential to have better solutions in terms of accuracy as well as performance. In this paper, we are proposing the solution over big data platform Apache Spark which performs the clustering of images using different methods viz. Scalable K-means++, Bisecting K-means and Gaussian Mixture. Since the number of clusters is not known in advance in any of the methods, we also propose a Best of Breed approach of validating the number of clusters using Simple Silhouette Index algorithm and thus to provide the best clustering possible.
机译:聚类是根据实体之间的相似性将实体分配到组中的无监督过程。图像聚类是挖掘卫星图像的关键步骤。随着卫星图像的生成速度比过去几十年更高,就准确度和性能而言,拥有更好的解决方案变得至关重要。在本文中,我们提出了基于大数据平台Apache Spark的解决方案,该平台使用不同的方法来执行图像的聚类。可扩展的K均值++,平分K均值和高斯混合。由于在任何一种方法中都无法预先知道簇的数量,因此,我们还提出了一种最佳品种方法,即使用简单轮廓索引算法来验证簇的数量,从而提供可能的最佳聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号