Fragments and Text Categorization

机译：片段和文本分类

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We introduce two novel methods of text categorization in which documents are split into fragments. We conducted experiments on English, French and Czech. In all cases, the problems referred to a binary document classification. We find that both methods increase the accuracy of text categorization. For the Naive Bayes classifier this increase is significant.

机译：我们介绍了两种新的文本分类方法，其中将文档分为多个片段。我们进行了英语，法语和捷克语的实验。在所有情况下，问题都涉及二进制文档分类。我们发现这两种方法都可以提高文本分类的准确性。对于朴素贝叶斯分类器，此增加是显着的。

著录项

来源
《Proceedings of the Student Research Workshop, Interactive Posters/Demonstrations, and Tutorial Abstracts》|2004年|P.227-230|共4页
会议地点
作者
Jan Blatak; Eva Mrakova; Lubos Popelinsky;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. Contextual Text Categorization: An Improved Stemming Algorithm to Increase the Quality of Categorization in Arabic Text [J] . Gadri Said, Moussaoui Abdelouahab The international arab journal of information technology . 2017,第6期

机译：上下文文本分类：一种改进的词干算法，可提高阿拉伯文本分类的质量
2. Text Document Categorization using Enhanced Sentence Vector Space Model and Bi-Gram Text Representation Model Based on Novel Fusion Techniques [J] . Abdisa Demissie Amensisa New Media and Mass Communication . 2020,第4期

机译：基于新型融合技术的基于增强句子矢量空间模型和双革文本表示模型的文本文档分类
3. A Novel Text Representation Model to Categorize Text Documents using Convolution Neural Network [J] . M. B. Revanasiddappa, B. S. Harish International Journal of Intelligent Systems and Applications . 2019,第5期

机译：利用卷积神经网络对文本文档进行分类的新型文本表示模型
4. Fragments and Text Categorization [C] . Jan Blatak, Eva Mrakova, Lubos Popelinsky, Association for Computational Linguistics Annual Meeting . 2004

机译：片段和文本分类
5. The implementation of dynamic document organization using the integration of text clustering and text categorization. [D] . Jo, Taeho. 2006

机译：使用文本聚类和文本分类的集成来实现动态文档组织。
6. SANAD: Single-label Arabic News Articles Dataset for automatic text categorization [O] . Omar Einea, Ashraf Elnagar, Ridhwan Al Debsi 2019

机译：SANAD：用于自动文本分类的单标签阿拉伯新闻文章数据集
7. TEXT CATEGORIZATION USING ONLY FRAGMENTS OF DOCUMENTS [O] . Pilaszy Istvan, Dobrowiecki Tadeusz 100

机译：仅使用文档片段进行文本分类

Fragments and Text Categorization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅