Text Categorization with Support Vector Machines. How to Represent Texts in Input Space?

Edda Leopold; Jorg Kindermann

首页> 外文期刊>Machine Learning >Text Categorization with Support Vector Machines. How to Represent Texts in Input Space?

【24h】

Text Categorization with Support Vector Machines. How to Represent Texts in Input Space?

机译：支持向量机的文本分类。如何在输入空间中表示文本？

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The choice of the kernel function is crucial to most applications for support vector machines. In this paper, however, we show that in the case of text classification, tem-frequency transformations have a larger impact on the performance of SVM than the kernel itself. We discuss the role of importance-weights (e. g. Document frequency and redundancy), which is not yet fully understood in the light of model complexity and calculation cost, and we show that time consuming lemmatization or stemming can be avoided even when classifying a highly inflectional language like German.

机译：对于支持向量机的大多数应用程序，内核功能的选择至关重要。但是，在本文中，我们表明在文本分类的情况下，与内核本身相比，时频转换对SVM的性能影响更大。我们讨论了重要性权重（例如文档频率和冗余度）的作用，鉴于模型的复杂性和计算成本，这还没有完全被理解，并且我们展示了即使对高度曲折的分类，也可以避免费时的词形化或词干提取。像德语一样的语言。

著录项

来源
《Machine Learning》 |2002年第3期|p.423-444|共22页
作者
Edda Leopold; Jorg Kindermann;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
support vector machines; text classification; lemmatization; stemming;

机译：支持向量机;文字分类词形化发芽;

相似文献

外文文献
中文文献
专利

1. Text Document Categorization using Enhanced Sentence Vector Space Model and Bi-Gram Text Representation Model Based on Novel Fusion Techniques [J] . Abdisa Demissie Amensisa New Media and Mass Communication . 2020,第4期

机译：基于新型融合技术的基于增强句子矢量空间模型和双革文本表示模型的文本文档分类
2. Afaan Oromo News Text Categorization using Decision Tree Classifier and Support Vector Machine: A Machine Learning Approach [J] . Kamal Mohammed Jimalo, Ramesh Babu P, Yaregal Assabie International Journal of Computer Trends and Technology . 2017,第1期

机译：使用决策树分类器和支持向量机的Afaan Oromo新闻文本分类：一种机器学习方法
3. Early detection of gradual concept drifts by text categorization and Support Vector Machine techniques: The TRIO algorithm [J] . M. Marseguerra Reliability Engineering & System Safety . 2014,第sepa期

机译：通过文本分类和支持向量机技术对渐变概念漂移进行早期检测：TRIO算法
4. A survey on text document categorization using enhanced sentence vector space model and bi-gram text representation model based on novel fusion techniques [C] . Abdisa Demissie Amensisa, Seema Patil, Poorva Agrawal 2018 2nd International Conference on Inventive Systems and Control . 2018

机译：基于新型融合技术的增强句向量空间模型和二元语法文本表示模型对文本文档分类的研究
5. An examination of KSS for feature selection for text categorization using support vector machines. [D] . Basu, Atreya. 2005

机译：使用支持向量机检查用于文本分类的特征选择的KSS。
6. Use of a support vector machine for categorizing free-text notes: assessment of accuracy across two institutions [O] . Adam Wright, Allison B McCoy, Stanislav Henkin, 2013

机译：使用支持向量机对自由文本注释进行分类：评估两个机构的准确性
7. Text categorization with support vector machines. How to represent texts in input space? [O] . Leopold, E., Kindermann, J. 2002

机译：使用支持向量机进行文本分类。如何在输入空间中表示文本？

Text Categorization with Support Vector Machines. How to Represent Texts in Input Space?

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅