Towards perfect text classification with Wikipedia-based semantic Naive Bayes learning

Kim Han-joon; Kim Jiyun; Kim Jinseog; Lim Pureum

首页> 外文期刊>Neurocomputing >Towards perfect text classification with Wikipedia-based semantic Naive Bayes learning

【24h】

Towards perfect text classification with Wikipedia-based semantic Naive Bayes learning

机译：通过基于维基百科的语义朴素贝叶斯学习实现完美的文本分类

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper suggests a novel way of dramatically improving the Naive Bayes text classifier with our semantic tensor space model for document representation. In our work, we intend to achieve a perfect text classification with the semantic Naive Bayes learning that incorporates the semantic concept features into term feature statistics; for this, the Naive Bayes learning is semantically augmented under the tensor space model where the 'concept' space is regarded as an independent space equated with the 'term' and 'document' spaces, and it is produced with concept-level informative Wikipedia pages associated with a given document corpus. Through extensive experiments using three popular document corpora including Reuters-21578, 20Newsgroups, and OHSUMED corpora, we prove that the proposed method not only has superiority over the recent deep learning-based classification methods but also shows nearly perfect classification performance. (c) 2018 Elsevier B.V. All rights reserved.

机译：本文提出了一种新颖的方法，可以通过我们的语义张量空间模型来显着改进Naive Bayes文本分类器，以进行文档表示。在我们的工作中，我们打算通过语义朴素贝叶斯学习实现完美的文本分类，该学习将语义概念特征纳入术语特征统计中；为此，在张量空间模型下，朴素贝叶斯学习在语义上得到了增强，其中“概念”空间被视为与“术语”和“文档”空间等同的独立空间，并且它是由概念级的信息丰富的维基百科页面生成的与给定文档语料库相关联。通过使用三种流行的文档语料库（包括Reuters-21578、20Newsgroups和OHSUMED语料库）进行的广泛实验，我们证明了该方法不仅比最近的基于深度学习的分类方法优越，而且显示出近乎完美的分类性能。（c）2018 Elsevier B.V.保留所有权利。

著录项

来源
《Neurocomputing》 |2018年第13期|128-134|共7页
作者
Kim Han-joon; Kim Jiyun; Kim Jinseog; Lim Pureum;
展开▼
作者单位

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Wikipedia-Based Semantic Similarity Measurements for Noisy Short Texts Using Extended Naive Bayes [J] . Shirakawa Masumi, Nakayama Kotaro, Hara Takahiro, Emerging Topics in Computing, IEEE Transactions on . 2015,第2期

机译：使用扩展的朴素贝叶斯，基于维基百科的嘈杂短文本语义相似性度量
2. Integrating associative rule-based classification with Naive Bayes for text classification [J] . Hadi Wael, Al-Radaideh Qasem A., Alhawari Samer Applied Soft Computing . 2018,第期

机译：将基于关联规则的分类与Naive Bayes集成进行文本分类
3. A Frame Work for Classification of Multi Class Medical Data based on Deep Learning and Naive Bayes Classification Model [J] . N. Ramesh, G. Lavanya Devi, K Srinivasa Rao International Journal of Information Engineering and Electronic Business . 2020,第1期

机译：基于深度学习和天真贝叶斯分类模型的多类医疗数据分类的框架工作
4. Laplace Naive Bayes classifier in the classification of text in machine learning [C] . Neli Kalcheva, Nedyalko Nikolov International Conference on Biomedical Innovations and Applications . 2020

机译：Laplace Naive Bayes分类器在机器学习中的文本分类中
5. Modern Considerations for the Use of Naive Bayes in the Supervised Classification of Genetic Sequence Data [D] . Lakin, Steven M. 2021

机译：在遗传序列数据监督分类中使用Naive Bayes的现代考虑因素
6. Beating Naive Bayes at Taxonomic Classification of 16S rRNA Gene Sequences [O] . Michal Ziemski, Treepop Wisanwanichthan, Nicholas A. Bokulich, 2021

机译：在分类学分类的16S rRNA基因序列中击败天真的贝叶斯
7. A Frame Work for Classification of Multi Class Medical Data based on Deep Learning and Naive Bayes Classification Model [O] . N. Ramesh, G. Lavanya Devi, K Srinivasa Rao 2020

机译：基于深度学习和天真贝叶斯分类模型的多类医疗数据分类的框架工作

Towards perfect text classification with Wikipedia-based semantic Naive Bayes learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅