首页> 外文期刊>International Journal on Computer Science and Engineering >Conglomeration of Instance Filtering’s k- Nearest Neighborhood and Collaborative Filtering’s Item based Recommendation on Airline Dataset System using Map-Reduce and Mahout
【24h】

Conglomeration of Instance Filtering’s k- Nearest Neighborhood and Collaborative Filtering’s Item based Recommendation on Airline Dataset System using Map-Reduce and Mahout

机译:基于Map-Reduce和Mahout的航空数据集系统实例过滤k-最近邻居和基于协同过滤项的建议的合并

获取原文
           

摘要

With the growth of variety in every industry, Customer finds it difficult from N different options available in the market. Its the business responsibility to showcase the best suggested item depending on his/her needs, Ratings of the product, Opinions/Feedbacks of different customers. To make these recommendations very close to the customer bahaviour, we need to process huge existing data to be processed. Hence MapReduce is considered which is emerging as a parallel, distributed paradigm for processing and generating large data sets. This trend combined with the growing need to run Machine Learning (ML) algorithms on massive datasets has led to an increased interest in implementing ML algorithms on MapReduce. Hence using machine learning algorithms in conjunction with big data can bring outright value for any business transformation. This research focuses on a way of analyzing large amount of data to give better recommendations for users by ML algorithms, thereby converting e-commerce site visitors to buyers. We mainly address the challenges of building an efficient and useful recommendation system given a large dataset, and discuss different approaches on identifying like-minded neighbors by making use of similarity, nearest neighborhood, Pearson Correlation higher level ML algorithms. We report probabilities of these kind algorithms with huge amount of data on hadoop clusters
机译:随着每个行业品种的增长,客户发现市场上有N种不同的选择很难。根据他/她的需求,产品评级,不同客户的意见/反馈展示最佳建议项目的业务责任。为了使这些建议非常接近客户的行为,我们需要处理大量现有数据进行处理。因此,MapReduce被认为是一种并行的,分布式的范式,用于处理和生成大型数据集。这种趋势与对在海量数据集上运行机器学习(ML)算法的需求不断增长,导致对在MapReduce上实现ML算法的兴趣增加。因此,将机器学习算法与大数据结合使用可以为任何业务转型带来绝对的价值。这项研究专注于一种分析大量数据的方法,以通过ML算法为用户提供更好的建议,从而将电子商务网站的访问者转化为购买者。我们主要解决在给定大型数据集的情况下构建高效且有用的推荐系统的挑战,并讨论通过利用相似性,最近邻,Pearson Correlation高层ML算法来识别志同道合的邻居的不同方法。我们报告了这类算法在Hadoop集群上的大量数据的概率

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号