Map Reduce by K-Nearest Neighbor Joins

机译：由k-incelt邻居加入映射

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Knowledge discovery and Data mining plays a major role in computational intensive tasks with high range of applications. With the increase of volume and dimension of data, the distributed features perform operations in a reasonable period. MapReduce programming is suitable for distributed large scale data processing that provides different ways of solutions to the same problem, that (one) has particular constraints and properties. In this paper, we give comparative analysis and its approaches for computing KNN on MapReduce[1] theoretically and experimental evaluation. Load balancing, accuracy and complexity are analyzed on each step of data preprocessing, data partitioning and computation. The experiment results in this are produced by using variety of datasets. Time and Space complexity are analyzed periodically on each dataset and gives new advantages and short comings that are discussed for each algorithm. Finally this paper can be used as a reference material to handle KNN [2] based problems in the idea of Mapreducing in Big Data.

机译：知识发现和数据挖掘在具有高范围应用的计算密集型任务中发挥着重要作用。随着数据量和维度的增加，分布式功能在合理的时间内执行操作。 MapReduce编程适用于分布式大规模数据处理，该数据处理提供不同的解决方案方式与同一问题，（一个）具有特定的约束和属性。在本文中，我们对Mapreduce的knn进行了比较分析及其在理论上和实验评价中的计算knn。在数据预处理，数据分区和计算的每个步骤上分析负载平衡，准确性和复杂性。实验导致这是通过使用各种数据集来生产的。周期性和空间复杂性在每个数据集上定期分析，并提供对每种算法讨论的新优点和短暂的关注。最后，本文可用作参考材料来处理基于knn [2]在大数据的MapRoding的想法中的问题。

著录项

来源
《International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery》|2018年|1 v.|共8页
会议地点
作者
Srikanth Bethu; B Sankara Babu; S Govinda Rao; R Aruna Florence;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Partitioning algorithms; Classification algorithms; Training; Decision trees; Big Data; Clustering algorithms; Bayes methods;

机译：分区算法;分类算法;培训;决策树;大数据;聚类算法;贝叶斯方法;

相似文献

外文文献
中文文献
专利

1. Aboveground biomass mapping of La Trinidad forests in Benguet, Philippines, using Landsat Thematic Mapper data and k-nearest neighbor method. [J] . Lumbres R. I. C., Lee YoungJin Forest Science and Technology . 2014,第2期

机译：利用Landsat Thematic Mapper数据和k近邻法，在菲律宾本格特的特立尼达（La Trinidad）森林地上生物量绘图。
2. Aboveground biomass mapping of La Trinidad forests in Benguet, Philippines, using Landsat Thematic Mapper data and k-nearest neighbor method [J] . Roscinto Ian C. Lumbres, Young Jin Lee Forest Science and Technology . 2014,第2期

机译：使用Landsat Thematic Mapper数据和k近邻法，在菲律宾本格特的特立尼达森林的地上生物量图
3. Texture analysis of iodine maps and conventional images for k-nearest neighbor classification of benign and metastatic lung nodules [J] . Simon Lennartz, Alina Mager, Nils Gro?e Hokamp, Cancer Imaging . 2021,第1期

机译：碘映射和常规图像的纹理分析良性和转移性肺结节的k最近邻分类
4. Map Reduce by K-Nearest Neighbor Joins [C] . Srikanth Bethu, B Sankara Babu, S Govinda Rao, International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery . 2018

机译：通过K最近邻居加入来减少地图
5. Voting Nearest Neighbors: SVM Constraints Selection Algorithm Based on K-Nearest Neighbors [D] . Moreira da Costa, Leandro. 2019

机译：投票最近的邻居：基于K-Indect邻居的SVM约束选择算法
6. Texture analysis of iodine maps and conventional images for k-nearest neighbor classification of benign and metastatic lung nodules [O] . Simon Lennartz, Alina Mager, Nils Große Hokamp, 2021

机译：碘地图的纹理分析及良邻邻良肺结核k最近邻分类的常规图像
7. FML-kNN: scalable machine learning on Big Data using k-nearest neighbor joins [O] . Georgios Chatzigeorgakidis, Sophia Karagiorgou, Spiros Athanasiou, 2018

机译：FML-KNN：使用k-inceral邻接加入的大数据上可扩展机器学习

Map Reduce by K-Nearest Neighbor Joins

摘要

著录项

相似文献

相关主题

期刊订阅