首页> 外文期刊>Neurocomputing >Towards perfect text classification with Wikipedia-based semantic Naive Bayes learning
【24h】

Towards perfect text classification with Wikipedia-based semantic Naive Bayes learning

机译:通过基于维基百科的语义朴素贝叶斯学习实现完美的文本分类

获取原文
获取原文并翻译 | 示例
           

摘要

This paper suggests a novel way of dramatically improving the Naive Bayes text classifier with our semantic tensor space model for document representation. In our work, we intend to achieve a perfect text classification with the semantic Naive Bayes learning that incorporates the semantic concept features into term feature statistics; for this, the Naive Bayes learning is semantically augmented under the tensor space model where the 'concept' space is regarded as an independent space equated with the 'term' and 'document' spaces, and it is produced with concept-level informative Wikipedia pages associated with a given document corpus. Through extensive experiments using three popular document corpora including Reuters-21578, 20Newsgroups, and OHSUMED corpora, we prove that the proposed method not only has superiority over the recent deep learning-based classification methods but also shows nearly perfect classification performance. (c) 2018 Elsevier B.V. All rights reserved.
机译:本文提出了一种新颖的方法,可以通过我们的语义张量空间模型来显着改进Naive Bayes文本分类器,以进行文档表示。在我们的工作中,我们打算通过语义朴素贝叶斯学习实现完美的文本分类,该学习将语义概念特征纳入术语特征统计中;为此,在张量空间模型下,朴素贝叶斯学习在语义上得到了增强,其中“概念”空间被视为与“术语”和“文档”空间等同的独立空间,并且它是由概念级的信息丰富的维基百科页面生成的与给定文档语料库相关联。通过使用三种流行的文档语料库(包括Reuters-21578、20Newsgroups和OHSUMED语料库)进行的广泛实验,我们证明了该方法不仅比最近的基于深度学习的分类方法优越,而且显示出近乎完美的分类性能。 (c)2018 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号