An Investigation on Linear SVM and its Variants for Text Categorization

机译：线性SVM及其文本分类变体的研究

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Linear Support Vector Machines (SVMs) have been used successfully to classify text documents into set of concepts. With the increasing number of linear SVM formulations and decomposition algorithms publicly available, this paper performs a study on their efficiency and efficacy for text categorization tasks. Eight publicly available implementations are investigated in terms of Break Even Point (BEP), F1 measure, ROC plots, learning speed and sensitivity to penalty parameter, based on the experimental results on two benchmark text corpuses. The results show that out of the eight implementations, SVMlin and Proximal SVM perform better in terms of consistent performance and reduced training time. However being an extremely simple algorithm with training time independent of the penalty parameter and the category for which training is being done, Proximal SVM is appealing. We further investigated fuzzy proximal SVM on both the text corpuses; it showed improved generalization over proximal SVM.

机译：线性支持向量机（SVM）已成功使用，将文本文档分类为一组概念。随着越来越多的线性SVM制剂和分解算法公开可用，本文对文本分类任务的效率和功效进行了研究。基于两个基准文本语料库的实验结果，根据休息点（BEP），F1测量，ROC地块，学习速度和敏感性来调查八个公开的实施。结果表明，在八个实现中，SVMLIN和近端SVM在一致的性能和减少的训练时间方面表现更好。然而，具有独立于惩罚参数的训练时间和正在进行培训的类别的极其简单的算法，近端SVM是吸引人的。我们进一步调查了文本语料中的模糊近端SVM;它显示出改善了近端SVM的泛化。

著录项

来源
《International Conference on Machine Learning and Computing》|2010年||共5页
会议地点
作者
Kumar M. Arun; Gopal M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词
Fuzzy Proximal SVM; Proximal SVM; Support vector machines; Text categorization;

机译：模糊近端SVM;近端SVM;支持矢量机;文本分类;

相似文献

外文文献
中文文献
专利

1. Chi Square Feature Extraction Based Svms Arabic Language Text Categorization System | Science Publications [J] . Abdelwadood M.A. MESLEH Journal of computer sciences . 2007,第6期

机译：基于卡方特征提取的Svms阿拉伯语文本分类系统科学出版物
2. Automated Arabic Text Categorization Using SVM and NB [J] . Saleh Alsaleem International Arab Journal of e-Technology . 2011,第2期

机译：使用SVM和NB自动进行阿拉伯文本分类
3. Ranking and Selecting Terms for Text Categorization via SVM Discriminate Boundary [J] . Tien-Fang Kuo, Yasutoshi Yajima International journal of entelligent systems . 2010,第2期

机译：通过SVM区分边界对文本分类进行排名和选择术语
4. An Investigation on Linear SVM and its Variants for Text Categorization [C] . Kumar M. Arun, Gopal M. . 2010

机译：线性SVM及其变体的文本分类研究
5. Tracking changes: A proposal for a linguistically sensitive schema for categorizing textual variation of Hebrew bible texts in light of variant scribal practices among the Judaean Desert psalms witnesses. [D] . Sigrist, David J. 2015

机译：跟踪变化：提议一种语言敏感的模式，根据犹太沙漠圣诗目击者的不同抄写手法，对希伯来圣经文本的文本变化进行分类。
6. A Linear-RBF Multikernel SVM to Classify Big Text Corpora [O] . R. Romero, E. L. Iglesias, L. Borrajo -1

机译：用于对大文本语料库进行分类的线性RBF多核SVM
7. Effective Shrinkage of Large Multi-class Linear SVM Models for Text Categorization [O] . Jianxiong Dong, C. Y. Suen 2012

机译：用于文本分类的大型多类线性SVM模型的有效收缩

An Investigation on Linear SVM and its Variants for Text Categorization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅