INCREMENTAL PARALLEL CLASSIFIER FOR BIG DATA WITH CASE STUDY: NAIVE BAYES USING MAPREDUCE PATTERNS

VERONICA S. MOERTINI; MOHAMAD F. SEPTRIANTO; LIPTIA VENICA

首页> 外文期刊>Journal of Theoretical and Applied Information Technology >INCREMENTAL PARALLEL CLASSIFIER FOR BIG DATA WITH CASE STUDY: NAIVE BAYES USING MAPREDUCE PATTERNS

【24h】

INCREMENTAL PARALLEL CLASSIFIER FOR BIG DATA WITH CASE STUDY: NAIVE BAYES USING MAPREDUCE PATTERNS

机译：具有案例研究的大数据增量并行分类器：使用映射减少模式的朴素贝叶斯

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Classification methods can be used to derive values from big data in the form of models, which then can be utilized to predict new cases. Several parallel classification methods for big data have been developed based on Hadoop MapReduce as well as for Spark system. As big data keeps on coming, the models must be updated from time to time to represent the old as well as the new data. The computations must be efficient and scalable for handling big data. This research aims to enhance the existing parallel classifiers such that they will perform as incremental classifier handling batches of big data. The research results are presented as follows. First, the architecture and main concept of the enhancement is presented. Secondly, the proposed incremental parallel Na?ve Bayes classifier (NBC) based on MapReduce that handles dataset with discrete attributes is discussed in detailed. Two series of experiment were performed on Hadoop clusters with 5 and 10 nodes. The results show that the incremental parallel NBC has acceptable accuracy, is efficient and scalable.

机译：分类方法可用于以模型形式从大数据中获取值，然后可用于预测新情况。已经基于Hadoop MapReduce和Spark系统开发了几种并行的大数据分类方法。随着大数据的不断涌现，必须不时更新模型以代表新旧数据。计算必须高效且可扩展，以处理大数据。这项研究旨在增强现有的并行分类器，使其可以作为增量分类器来处理大数据批次。研究结果如下。首先，介绍了增强的体系结构和主要概念。其次，详细讨论了基于MapReduce的增量并行幼稚贝叶斯分类器（NBC），该分类器处理具有离散属性的数据集。在具有5个和10个节点的Hadoop集群上进行了两个系列的实验。结果表明，增量并行NBC具有可接受的精度，有效且可扩展。

著录项

来源
《Journal of Theoretical and Applied Information Technology》 |2019年第11期|共21页
作者
VERONICA S. MOERTINI; MOHAMAD F. SEPTRIANTO; LIPTIA VENICA;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Big Data Classification MethodIncremental Parallel ClassifierMapreduce Patterns;

机译：大数据分类方法增量并行分类器映射减少模式;

相似文献

外文文献
中文文献
专利

1. MapReduce Implementation of a Multinomial and Mixed Naive Bayes Classifier [J] . Bagui Sikha, Devulapalli Keerthi, John Sharon International Journal of Intelligent Information Technologies . 2020,第2期

机译：MapReduce多项和混合野贝雷斯分类器的实施
2. A comparative study of Naive Bayes classifier and Bayes net classifier for fault diagnosis of monoblock centrifugal pump using wavelet analysis [J] . V. Muralidharan, V. Sugumaran Applied Soft Computing . 2012,第8期

机译：基于小波分析的朴素贝叶斯分类器和贝叶斯网络分类器在整体离心泵故障诊断中的比较研究。
3. Speeding up incremental wrapper feature subset selection with Naive Bayes classifier [J] . Pablo Bermejo, Jose A. Gamez, Jose M. Puerta Knowledge-Based Systems . 2014,第jana期

机译：使用朴素贝叶斯分类器加快增量包装器特征子集的选择
4. Migration of Relational Database to MongoDB and Data Analytics using Naive Bayes Classifier based on Mapreduce Approach [C] . Ganesh B. Solanke, K. Rajeswari International Conference on Computing, Communication, Control and Automation . 2017

机译：使用基于Mapreduce方法的朴素贝叶斯分类器将关系数据库迁移到MongoDB和数据分析
5. Naive Bayes algorithm for Twitter sentiment analysis and its implementation in MapReduce [D] . Li, Zhaoyu. 2014

机译：朴素贝叶斯算法的Twitter情感分析及其在MapReduce中的实现
6. Treatment patterns of drug-naive patients with type 2 diabetes mellitus: a retrospective cohort study using a Japanese hospital database [O] . Yohei Morita, Hiroki Murayama, Masato Odawara, 2019

机译：未使用过药物的2型糖尿病患者的治疗方式：一项使用日本医院数据库的回顾性队列研究
7. Uncertainty Based Under-Sampling for Learning Naive Bayes Classifiers Under Imbalanced Data Sets [O] . Christos K. Aridas, Stamatis Karlos, Vasileios G. Kanas, 2020

机译：基于不确定性的基于非衡性贝父分类器的不确定性下滑数据集

INCREMENTAL PARALLEL CLASSIFIER FOR BIG DATA WITH CASE STUDY: NAIVE BAYES USING MAPREDUCE PATTERNS

摘要

著录项

相似文献

相关主题

期刊订阅