【24h】

Research on archives text classification based on Naive bayes

机译:基于朴素贝叶斯的档案文本分类研究

获取原文
获取原文并翻译 | 示例

摘要

This paper analyzes the data resources of archives in Gansu Province by combining with the characteristics of archives resources, and combines with Naive Bayesian classification algorithm to realize the application of archives resource classification. According to the characteristics of the file data, select the attribute that matches the text of the file text, and use the TFIDF algorithm in the file text feature attribute selection. The experimental results show that the classification model is suitable for the classification of archival text resources, and the function of automatic classification of archives is realized. Compared with the traditional Naive Bayesian classification method, the classification model proposed in this paper is 1% -2% for the classification efficiency of archives, it is a more effective classification model for the archives.
机译:结合档案资源的特点,对甘肃省档案数据资源进行了分析,并结合朴素贝叶斯分类算法,实现了档案资源分类的应用。根据文件数据的特征,选择与文件文本的文本匹配的属性,并在文件文本特征属性选择中使用TFIDF算法。实验结果表明,该分类模型适用于档案文本资源的分类,实现了档案的自动分类功能。与传统的朴素贝叶斯分类方法相比,本文提出的分类模型对档案的分类效率为1%-2%,是一种更为有效的档案分类模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号