首页> 外文会议>International Conference on Computer Science and Education >Top-k Skyline Result Optimization Algorithm in MapReduce
【24h】

Top-k Skyline Result Optimization Algorithm in MapReduce

机译:MapReduce中的Top-k天际线结果优化算法

获取原文

摘要

Skyline is widely used in multi-objective decision-making, data visualization and other fields. With the rapid increasing of data volume, skyline of big data has also attracted more and more attention. However, skyline of big data has its own shortcomings. When the dimension increases, skyline results will be numerous, and we would like to select k points from the result sets. In this paper, we propose the top-k skyline of big data. It is a Top-k Skyline Method in MapReduce, called MR-DMKS. Firstly, we convert the multidimensional data to a single value to determine the dominance relationship of two data points. Secondly, we calculate the score by using the converted values. Thirdly, sort the data points more efficiently and accurately according to the scores using a window queue. Finally, we choose k data objects having the strongest dominating capacity. A large number of experiments show that our method is effective, and has good flexibility and scalability on real data sets as well as synthetic data sets.
机译:Skyline广泛用于多目标决策,数据可视化等领域。随着数据量的快速增长,大数据的天际线也引起了越来越多的关注。但是,大数据的天际线有其自身的缺点。当维数增加时,天际线结果将很多,我们希望从结果集中选择k个点。在本文中,我们提出了大数据的前k个天际线。它是MapReduce中的Top-k Skyline方法,称为MR-DMKS。首先,我们将多维数据转换为单个值,以确定两个数据点的优势关系。其次,我们使用转换后的值计算分数。第三,使用窗口队列根据分数更有效,更准确地对数据点进行排序。最后,我们选择具有最强控制能力的k个数据对象。大量实验表明,我们的方法是有效的,并且在真实数据集和合成数据集上均具有良好的灵活性和可伸缩性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号