Research on archives text classification based on Naive bayes

机译：基于朴素贝叶斯的档案文本分类研究

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper analyzes the data resources of archives in Gansu Province by combining with the characteristics of archives resources, and combines with Naive Bayesian classification algorithm to realize the application of archives resource classification. According to the characteristics of the file data, select the attribute that matches the text of the file text, and use the TFIDF algorithm in the file text feature attribute selection. The experimental results show that the classification model is suitable for the classification of archival text resources, and the function of automatic classification of archives is realized. Compared with the traditional Naive Bayesian classification method, the classification model proposed in this paper is 1% -2% for the classification efficiency of archives, it is a more effective classification model for the archives.

机译：结合档案资源的特点，对甘肃省档案数据资源进行了分析，并结合朴素贝叶斯分类算法，实现了档案资源分类的应用。根据文件数据的特征，选择与文件文本的文本匹配的属性，并在文件文本特征属性选择中使用TFIDF算法。实验结果表明，该分类模型适用于档案文本资源的分类，实现了档案的自动分类功能。与传统的朴素贝叶斯分类方法相比，本文提出的分类模型对档案的分类效率为1％-2％，是一种更为有效的档案分类模型。

著录项

来源
《2017 IEEE 2nd Information Technology, Networking, Electronic and Automation Control Conference》|2017年|187-190|共4页
会议地点 Chengdu(CN)
作者
Peixin Liu; Hongzhi Yu; Tao Xu; Chuanqi Lan;
展开▼
作者单位

State Key Laboratory of Chinese Language and Information Technology of Ministry of Education, Northwest University for Nationalities, Lanzhou, China;

State Key Laboratory of Chinese Language and Information Technology of Ministry of Education, Northwest University for Nationalities, Lanzhou, China;

State Key Laboratory of Chinese Language and Information Technology of Ministry of Education, Northwest University for Nationalities, Lanzhou, China;

State Key Laboratory of Chinese Language and Information Technology of Ministry of Education, Northwest University for Nationalities, Lanzhou, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Bayes methods; learning (artificial intelligence); pattern classification; text analysis;

机译：贝叶斯方法;学习（人工智能）;模式分类;文本分析;;

相似文献

外文文献
中文文献
专利

1. Integrating associative rule-based classification with Naive Bayes for text classification [J] . Hadi Wael, Al-Radaideh Qasem A., Alhawari Samer Applied Soft Computing . 2018,第期

机译：将基于关联规则的分类与Naive Bayes集成进行文本分类
2. Improved feature size customized fast correlation-based filter for Naive Bayes text classification [J] . Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第3期

机译：改进的特征尺寸自定义基于快速相关的基于快速相关的过滤器，用于Naive Bayes文本分类
3. Towards perfect text classification with Wikipedia-based semantic Naive Bayes learning [J] . Kim Han-joon, Kim Jiyun, Kim Jinseog, Neurocomputing . 2018,第NOVa13期

机译：通过基于维基百科的语义朴素贝叶斯学习实现完美的文本分类
4. Research on archives text classification based on Naive bayes [C] . Peixin Liu, Hongzhi Yu, Tao Xu, IEEE Information Technology, Networking, Electronic and Automation Control Conference . 2017

机译：基于天真贝叶斯的档案文本分类研究
5. Modern Considerations for the Use of Naive Bayes in the Supervised Classification of Genetic Sequence Data [D] . Lakin, Steven M. 2021

机译：在遗传序列数据监督分类中使用Naive Bayes的现代考虑因素
6. Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21000 child and adult deaths [O] . Pierre Miasnikof, Vasily Giannakeas, Mireille Gomes, 2015

机译：朴素贝叶斯言语尸检分类器：与基于医师的21000名儿童和成人死亡分类比较
7. A new feature selection score for multinomial naive bayes text classification based on kl-divergence [O] . Karl-michael Schneider 2004

机译：基于kl散度的多项式朴素贝叶斯文本分类的新特征选择得分

Research on archives text classification based on Naive bayes

摘要

著录项

相似文献

相关主题

期刊订阅