首页> 外文期刊>Journal of computational science >Automatic text classification algorithm based on Gauss improved convolutional neural network
【24h】

Automatic text classification algorithm based on Gauss improved convolutional neural network

机译:基于高斯改进卷积神经网络的文本自动分类算法

获取原文
获取原文并翻译 | 示例
           

摘要

The traditional KNN query is a kind of algorithm with good stability and accuracy performance. However, when the sample size is too large, the computational efficiency of the algorithm is affected greatly. Therefore, a kind of parallel MKNN text classification algorithm based on clustering center text series has been proposed. Firstly, the effective dimensionality reduction of similarity calculation amount of the algorithm is realized based on the clustering center, and the original large-scale document samples are replaced with a relatively small number of clustering sample centers to realize improvement of the KNN query process. Secondly, MapReduce parallel framework is used to meet real-time demand of large-scale text classification and calculation combined with features of text classification, and to effectively overcome slow speed of the KNN query process and ensure accuracy of text classification as higher as possible. Finally, the classification speed of proposed algorithm can be effectively improved under the premise of ensuring sufficient accuracy through comparison in experiment of text classification accuracy and algorithmic efficiency with the similar single-threaded algorithm. (C) 2017 Elsevier B.V. All rights reserved.
机译:传统的KNN查询是一种具有良好稳定性和准确性的算法。但是,当样本量太大时,算法的计算效率会受到很大影响。因此,提出了一种基于聚类中心文本序列的并行MKNN文本分类算法。首先,基于聚类中心,实现了算法相似度计算量的有效降维,并用较少数量的聚类样本中心代替了原来的大规模文档样本,以实现对KNN查询过程的改进。其次,利用MapReduce并行框架结合文本分类的特点,满足大规模文本分类和计算的实时需求,有效克服了KNN查询过程的速度慢,保证了文本分类的准确性。最后,通过与相似的单线程算法进行文本分类精度和算法效率的实验比较,在保证足够精度的前提下,可以有效提高算法的分类速度。 (C)2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号