Map Reduce by K-Nearest Neighbor Joins

机译：通过K最近邻居加入来减少地图

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Knowledge discovery and Data mining plays a major role in computational intensive tasks with high range of applications. With the increase of volume and dimension of data, the distributed features perform operations in a reasonable period. MapReduce programming is suitable for distributed large scale data processing that provides different ways of solutions to the same problem, that (one) has particular constraints and properties. In this paper, we give comparative analysis and its approaches for computing KNN on MapReduce[1] theoretically and experimental evaluation. Load balancing, accuracy and complexity are analyzed on each step of data preprocessing, data partitioning and computation. The experiment results in this are produced by using variety of datasets. Time and Space complexity are analyzed periodically on each dataset and gives new advantages and short comings that are discussed for each algorithm. Finally this paper can be used as a reference material to handle KNN [2] based problems in the idea of Mapreducing in Big Data.

机译：知识发现和数据挖掘在具有大量应用程序的计算密集型任务中扮演着重要角色。随着数据量和数据量的增加，分布式功能会在合理的时间内执行操作。 MapReduce编程适合于分布式大规模数据处理，该处理为同一问题提供了不同的解决方案，（一个）具有特定的约束和属性。本文对MapReduce [1]上的KNN计算进行了比较分析及其方法，从理论上和实验上进行了评估。在数据预处理，数据分区和计算的每个步骤中分析负载平衡，准确性和复杂性。通过使用各种数据集可以得出实验结果。定期分析每个数据集的时间和空间复杂度，并为每种算法提供了新的优点和缺点。最后，本文可作为参考材料，用于处理大数据中的Mapreducing概念中基于KNN [2]的问题。

著录项

来源
《International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery》|2018年|222-229|共8页
会议地点
作者
Srikanth Bethu; B Sankara Babu; S Govinda Rao; R Aruna Florence;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Partitioning algorithms; Classification algorithms; Training; Decision trees; Big Data; Clustering algorithms; Bayes methods;

机译：分区算法;分类算法;训练;决策树;大数据;聚类算法;贝叶斯方法;

相似文献

外文文献
中文文献
专利

1. Aboveground biomass mapping of La Trinidad forests in Benguet, Philippines, using Landsat Thematic Mapper data and k-nearest neighbor method. [J] . Lumbres R. I. C., Lee YoungJin Forest Science and Technology . 2014,第2期

机译：利用Landsat Thematic Mapper数据和k近邻法，在菲律宾本格特的特立尼达（La Trinidad）森林地上生物量绘图。
2. Aboveground biomass mapping of La Trinidad forests in Benguet, Philippines, using Landsat Thematic Mapper data and k-nearest neighbor method [J] . Roscinto Ian C. Lumbres, Young Jin Lee Forest Science and Technology . 2014,第2期

机译：使用Landsat Thematic Mapper数据和k近邻法，在菲律宾本格特的特立尼达森林的地上生物量图
3. Texture analysis of iodine maps and conventional images for k-nearest neighbor classification of benign and metastatic lung nodules [J] . Simon Lennartz, Alina Mager, Nils Gro?e Hokamp, Cancer Imaging . 2021,第1期

机译：碘映射和常规图像的纹理分析良性和转移性肺结节的k最近邻分类
4. Map Reduce by K-Nearest Neighbor Joins [C] . Srikanth Bethu, B Sankara Babu, S Govinda Rao, International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery . 2018

机译：由k-incelt邻居加入映射
5. Voting Nearest Neighbors: SVM Constraints Selection Algorithm Based on K-Nearest Neighbors [D] . Moreira da Costa, Leandro. 2019

机译：投票最近的邻居：基于K-Indect邻居的SVM约束选择算法
6. Texture analysis of iodine maps and conventional images for k-nearest neighbor classification of benign and metastatic lung nodules [O] . Simon Lennartz, Alina Mager, Nils Große Hokamp, 2021

机译：碘地图的纹理分析及良邻邻良肺结核k最近邻分类的常规图像
7. FML-kNN: scalable machine learning on Big Data using k-nearest neighbor joins [O] . Georgios Chatzigeorgakidis, Sophia Karagiorgou, Spiros Athanasiou, 2018

机译：FML-KNN：使用k-inceral邻接加入的大数据上可扩展机器学习

Map Reduce by K-Nearest Neighbor Joins

摘要

著录项

相似文献

相关主题

期刊订阅