首页> 外文会议>International Conference on Informatics and Computing >Comparison of Multinomial Na?ve Bayes with K-Nearest Neighbors, Support Vector Machine and Random Forest for Classification of “Network Attacks” Document

【24h】

Comparison of Multinomial Na?ve Bayes with K-Nearest Neighbors, Support Vector Machine and Random Forest for Classification of “Network Attacks” Document

机译：与K-CORMATE邻居的多项式NAαve Bayes的比较，支持向量机和随机林进行“网络攻击”文档的分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The objective of this paper is to categorize English documents with the topic “Network Attack” using Multinomial Na?ve Bayes method and. It then compares with K-Nearest Neighbors (KNN), Support Vector Machine Linear (SVM Linear) and Random Forest. The classification process was conducted using some feature extraction methods, such as Term Frequency-Inverse Document Frequency (TF-IDF) extraction, Count Vector, and Document Vector (Doc2vec). The experimental result showed that MNB with TF-IDF got an accuracy of 76.00%. The TF-IDF with KNN method, SVM Linear, Random Forest results from efficiency 72.66%, 78.66% and 81.66% respectively, and using Count Vector were 60.00%, 77.00%, 70.66% and 81.00% (MNB, KNN, SVM Linear, Random Forest). The experimental was also conducted using the Random Forest method (as the classifier) and Document Vector (as the feature extraction method). Thus it is obtained the accuracy of 63.33%. The MNB method was quite better to classify the document than KNN method. However, SVM and Random Forest methods were better than the MNB and KNN methods. It can be concluded that the use of TF-IDF was generally better than using Count Vector and Doc2vec. However, the Count Vector had better result compared to TF-IDF under MNB Classifies.

机译：本文的目的是使用多项式Na ve贝雷斯方法和讨论“网络攻击”主题“网络攻击”的英文文件。然后，与K-CORMALT邻居（KNN）进行比较，支持向量机线性（SVM线性）和随机林。使用一些特征提取方法进行分类过程，例如术语频率逆文档频率（TF-IDF）提取，计数矢量和文档向量（DOC2VEC）。实验结果表明，具有TF-IDF的MNB精度为76.00％。具有KNN方法的TF-IDF，SVM线性，随机森林的效率分别产生72.66％，78.66％和81.66％，并且使用计数载体为60.00％，77.00％，70.66％和81.00％（MNB，KNN，SVM线性，随机森林）。还使用随机森林方法（作为分类器）和文件向量进行实验性（作为特征提取方法）。因此，获得63.33％的准确性。 MNB方法比knn方法更好地分类文件。然而，SVM和随机森林方法优于MNB和KNN方法。可以得出结论，TF-IDF的使用通常比使用计数矢量和DOC2VEC更好。然而，与MNB分类下的TF-IDF相比，计数载体具有更好的结果。

著录项

来源
《International Conference on Informatics and Computing》|2019年|1 v.|共6页
会议地点
作者
Bambang Harjito; Ardhi Wijayanto; Kuni Nur Aini; Budi Murtiyas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Bayes methods; document handling; feature extraction; nearest neighbour methods; pattern classification; random forests; support vector machines; text analysis;

机译：贝叶斯方法;文件处理;特征提取;最近的邻法;模式分类;随机森林;支持矢量机;文本分析;

相似文献

外文文献
中文文献
专利

1. A Comparative Assessment of Support Vector Machines, Probabilistic Neural Networks, and K-Nearest Neighbor Algorithms for Water Quality Classification [J] . Fereshteh Modaresi, Shahab Araghinejad Water Resources Management . 2014,第12期

机译：支持向量机，概率神经网络和K近邻算法用于水质分类的比较评估
2. Direct comparison between support vector machine and multinomial naive Bayes algorithms for medical abstract classification. [J] . Stan Matwin, Vera Sazonova Journal of the American Medical Informatics Association : . 2012,第5期

机译：支持向量机与多项朴素贝叶斯算法之间的直接比较，用于医学摘要分类。
3. Basic Tenets of Classification Algorithms K-Nearest-Neighbor, Support Vector Machine, Random Forest and Neural Network: A Review [J] . Ernest Yeboah Boateng, Joseph Otoo, Daniel A. Abaye Journal of Data Analysis and Information Processing . 2020,第04期

机译：分类算法基本原则K-最近邻，支持向量机，随机森林和神经网络：综述
4. Comparison of Multinomial Naïve Bayes with K-Nearest Neighbors, Support Vector Machine and Random Forest for Classification of “Network Attacks” Document [C] . Bambang Harjito, Ardhi Wijayanto, Kuni Nur Aini, International Conference on Informatics and Computing . 2019

机译：多项式朴素贝叶斯与K最近邻，支持向量机和随机森林的比较，用于“网络攻击”文档的分类
5. Comparative classification of prostate cancer data using the Support Vector Machine, Random Forest, DualKS and k-Nearest Neighbours. [D] . Sakouvogui, Kekoura. 2015

机译：使用支持向量机，Random Forest，DualKS和k-Nearest邻居对前列腺癌数据进行比较分类。
6. Comparison of Random Forest k-Nearest Neighbor and Support Vector Machine Classifiers for Land Cover Classification Using Sentinel-2 Imagery [O] . Phan Thanh Noi, Martin Kappas 2018

机译：使用Sentinel-2影像进行土地覆盖分类的随机森林k最近邻和支持向量机分类器的比较
7. Comparison of Random Forest, k-Nearest Neighbor, and Support Vector Machine Classifiers for Land Cover Classification Using Sentinel-2 Imagery [O] . Phan Thanh Noi, Martin Kappas 2017

机译：利用sentinel-2影像进行土地覆盖分类的随机森林，k-最近邻和支持向量机分类器的比较

Comparison of Multinomial Na?ve Bayes with K-Nearest Neighbors, Support Vector Machine and Random Forest for Classification of “Network Attacks” Document

摘要

著录项

相似文献

相关主题

期刊订阅