Adaptive Naive Bayesian Classifier for Automatic Classification of Webpage from Massive Network Data

机译：基于海量网络数据的网页自动分类的自适应朴素贝叶斯分类器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the application of Na??ve Bayesian classifier to automatic classification of webpage. The key point in this article is that massive empirical data derives from the real traffic data collected from the backbone network of certain province in China, and we apply cumulative probability to determine the optimal size of feature vector adaptively. It's proved that the adaptive method of cumulative probability threshold selection applied in this study has good robustness. This paper focus on four feature selection methods: TF-IDF (term frequency-inverse document frequency), IG (Information Gain), MOR (Multi-class Odds Ratio), CDM (Class Discriminating Measure). We find that Na??ve Bayesian classifier performs fairly well in speed and precision on big data sets, whose precision, recall and F1 metric are all above 90% in all 6 categories of webpage.

机译：本文提出了朴素贝叶斯分类器在网页自动分类中的应用。本文的重点是海量的经验数据来源于从中国某省的骨干网收集的真实交通数据，并且我们运用累积概率来自适应地确定特征向量的最佳大小。实践证明，本文所采用的自适应累积概率阈值选择方法具有很好的鲁棒性。本文重点介绍四种特征选择方法：TF-IDF（术语频率与文档频率的倒数），IG（信息增益），MOR（多类赔率），CDM（类区分度）。我们发现，朴素贝叶斯分类器在大数据集的速度和精度方面表现相当不错，其精度，召回率和F1指标在所有6个类别的网页中均超过90％。

著录项

来源
《International Conference on Intelligent Human-Machine Systems and Cybernetics》|2014年|127-130|共4页
会议地点
作者
LinBin Xu; Jun Liu; WenLi Zhou; Qing Yan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Bayes methods; Games; Market research; Measurement; Robustness; Training; Vectors; adaptive threshold selection; big data; na??ve bayes; robustness; webpage classification;

机译：贝叶斯方法;游戏;市场调查;测量;坚固性训练;向量;自适应阈值选择;大数据;朴素的贝叶斯;健壮性网页分类;

相似文献

外文文献
中文文献
专利

1. Classification of 10 m-resolution SPOT data using a combined Bayesian Network Classifier-shape adaptive neighborhood method [J] . Jingxue Yang, Yunpeng Wang ISPRS Journal of Photogrammetry and Remote Sensing . 2012,第AUGa期

机译：使用组合贝叶斯网络分类器形状自适应邻域方法对10 m分辨率SPOT数据进行分类
2. Coupling self-organizing maps with a Naive Bayesian classifier: Stream classification studies using multiple assessment data [J] . Nikolaos Fytilis, Donna M. Rizzo Water resources research . 2013,第11期

机译：将自组织地图与朴素贝叶斯分类器耦合：使用多个评估数据的流分类研究
3. A unique feature extraction using MRDWT for automatic classification of abnormal heartbeat from ECG big data with Multilayered Probabilistic Neural Network classifier [J] . Rai Hari Mohan, Chatterjee Kalyan Applied Soft Computing . 2018,第期

机译：使用MRDWT进行MRDWT的独特特征提取，通过多层概率神经网络分类器自动分类ECG大数据的异常心跳
4. Adaptive Naive Bayesian Classifier for Automatic Classification of Webpage from Massive Network Data [C] . LinBin Xu, Jun Liu, WenLi Zhou, International Conference on Intelligent Human-Machine Systems and Cybernetics . 2014

机译：自动越来越多的贝叶斯分类器，用于自动分类来自大规模网络数据的网页
5. Development of a combined GIS, neural network and Bayesian classifier methodology for classifying remotely sensed data. [D] . Schneider, Claudio Albert. 2002

机译：结合了GIS，神经网络和贝叶斯分类器方法，对遥感数据进行分类。
6. Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification [O] . Khalil El Hindi, Hussien AlSalman, Safwan Qasem, 2018

机译：建立一个微调朴素贝叶斯分类器的集合用于文本分类
7. Classification of 10 m-resolution SPOT data using a combined Bayesian Network Classifier-shape adaptive neighborhood method [O] . Yang Jingxue, Wang Yunpeng 2012

机译：使用组合贝叶斯网络分类器形状自适应邻域方法对10 m分辨率SPOT数据进行分类
8. Bayesian Classifier Based on a Deterministic Annealing Neural Network forAircraft Fault Classification [R] . Wang, J., Chu, S. P. 1997

机译：基于确定性退火神经网络的贝叶斯分类器在航空器故障分类中的应用

Adaptive Naive Bayesian Classifier for Automatic Classification of Webpage from Massive Network Data

摘要

著录项

相似文献

相关主题

期刊订阅