首页> 外文会议>2017 IEEE 19th International Conference on High Performance Computing and Communications Workshops >A Survey and Recommendations for Distributed, Parallel, Single Pass, Incremental Bayesian Classification Based on MapReduce for Big Data
【24h】

A Survey and Recommendations for Distributed, Parallel, Single Pass, Incremental Bayesian Classification Based on MapReduce for Big Data

机译:基于MapReduce的分布式,并行,单遍,增量贝叶斯分类的调查和建议

获取原文
获取原文并翻译 | 示例

摘要

In the emerging digital age, massive production of data is occurred actively or passively by collecting data from users and environment via applications, sensor devices and so on. That makes it important and crucial to have the ability to process big data efficiently and effectively utilize it. The challenge to process big data is that it has high volume, velocity, variety, as well as veracity and value. In this paper, we present a survey of related work and prescribe our recommendations towards building Bayesian classification for big data environments. It is based on MapReduce and is distributed, parallel, single pass and incremental which makes it feasible to be deployed and executed on cloud computing platform We also carry out scalability analysis of the proposed solution that it can train Bayesian classifier to perform predictive analytics by processing big data with large volume, velocity and variety.
机译:在新兴的数字时代,通过应用程序,传感器设备等从用户和环境收集数据,可以主动或被动地产生大量数据。这使得具有高效处理大数据和有效利用大数据的能力变得至关重要和至关重要。处理大数据的挑战在于它具有高容量,高速度,多样化以及准确性和价值。在本文中,我们对相关工作进行了调查,并提出了针对大数据环境建立贝叶斯分类的建议。它基于MapReduce,是分布式,并行,单遍和增量的,因此可以在云计算平台上部署和执行。我们还对提出的解决方案进行了可伸缩性分析,该解决方案可以训练贝叶斯分类器通过处理来执行预测分析。大数据量大,速度快,种类繁多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号