首页> 外文期刊>Journal of High Speed Networks >Incremental text categorization based on hybrid optimization-based deep belief neural network
【24h】

Incremental text categorization based on hybrid optimization-based deep belief neural network

机译:基于混合优化的深度信仰神经网络的增量文本分类

获取原文
获取原文并翻译 | 示例
       

摘要

One of the effective text categorization methods for learning the large-scale data and the accumulated data is incremental learning. The major challenge in the incremental learning is improving the accuracy as the text document consists of numerous terms. In this research, a incremental text categorization method is developed using the proposed Spider Grasshopper Crow Optimization Algorithm based Deep Belief Neural network (SGrC-based DBN) for providing optimal text categorization results. The proposed text categorization method has four processes, such as are pre-processing, feature extraction, feature selection, text categorization, and incremental learning. Initially, the database is pre-processed and fed into vector space model for the extraction of features. Once the features are extracted, the feature selection is carried out based on mutual information. Then, the text categorization is performed using the proposed SGrC-based DBN method, which is developed by the integration of the spider monkey optimization (SMO) with the Grasshopper Crow Optimization Algorithm (GCOA) algorithm. Finally, the incremental text categorization is performed based on the hybrid weight bounding model that includes the SGrC and Range degree and particularly, the optimal weights of the Range degree model is selected based on SGrC. The experimental result of the proposed text categorization method is performed by considering the data from the Reuter database and 20 Newsgroups database. The comparative analysis of the text categorization method is based on the performance metrics, such as precision, recall and accuracy. The proposed SGrC algorithm obtained a maximum accuracy of 0.9626, maximum precision of 0.9681 and maximum recall of 0.9600, respectively when compared with the existing incremental text categorization methods.
机译:用于学习大规模数据和累积数据的有效文本分类方法之一是增量学习。随着文本文档由众多术语组成,增量学习中的主要挑战正在提高准确性。在本研究中,使用基于深信神经网络(基于SGRC的DBN)的建议的蜘蛛蚱蜢乌鸦优化算法来开发增量文本分类方法,用于提供最佳文本分类结果。所提出的文本分类方法有四个进程,例如是预处理,特征提取,特征选择,文本分类和增量学习。最初,数据库被预处理并馈送到向量空间模型中以提取特征。一旦提取特征,就基于相互信息执行特征选择。然后,使用所提出的基于SGRC的DBN方法来执行文本分类,该方法是通过与蚱蜢乌鸦优化算法(GCOA)算法的蜘蛛猴优化(SMO)的集成而开发的。最后,基于包括SGRC和范围度的混合权重界限模型来执行增量文本分类,特别是,基于SGRC选择范围内模型的最佳权重。通过考虑来自REUTER数据库和20个新闻组数据库的数据来执行所提出的文本分类方法的实验结果。文本分类方法的比较分析基于性能指标,例如精度,召回和准确性。与现有的增量文本分类方法相比,所提出的SGRC算法最大精度为0.9626,最大精度为0.9681,最大召回为0.9600。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号