首页> 外文会议>International Seminar on Application for Technology of Information and Communication >Performance Improvement Of Support Vector Machine (SVM) With Information Gain On Categorization Of Indonesian News Documents
【24h】

Performance Improvement Of Support Vector Machine (SVM) With Information Gain On Categorization Of Indonesian News Documents

机译:支持向量机(SVM)的性能改进与印度尼西亚新闻文件分类的信息增益

获取原文

摘要

More news articles which are unstoppable increasing, causing problems with grouping news according to appropriate kind of label. Therefore it is necessary to deal with the problem of grouping news by it's category like business news, political news, and sports news. The categorization of news document belong to text classification domain, a Machine Learning topic as an approach that addressed this problem. Various algorithms have been used in previous studies such as Bayesian techniques, k-Nearest Neighborhood, Neural Networks, and Support Vector Machine (SVM). This study provides an understanding of the SVM method for news categorization on Indonesian news dataset that contain several types of news category. Problems in text classification is the number of features that affecting classification performance with SVM. Use of Information Gain as feature selection improve accuracy than without any feature selection. Our model give satisfying result with 98,057% accuracy of Indonesia news classification. Improvement 2,9 points from 95,11% by SVM technique without feature selection.
机译:更多新闻文章,不可阻挡的增加,根据适当的标签造成分组新闻的问题。因此,有必要通过商业新闻,政治新闻和体育新闻等类别来处理分组新闻的问题。新闻文档的分类属于文本分类域,机器学习主题作为解决此问题的方法。在以前的研究中使用了各种算法,例如贝叶斯技术,K-CORMBIRY社区,神经网络和支持向量机(SVM)。本研究对INDonesian新闻数据集进行了新闻分类的SVM方法,提供了包含几种新闻类别的新闻分类。文本分类中的问题是使用SVM影响分类性能的功能数。使用信息增益作为特征选择提高了比没有任何特征选择的准确性。我们的模型提供了令人满意的结果,以98,057%的印度尼西亚新闻分类精度。通过SVM技术改进2,9点,无需特征选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号