首页> 外文会议>European Conference on Artificial Intelligence;Conference on Prestigious Applications of Intelligent Systems >Supervised Graph-Based Term Weighting Scheme for Effective Text Classification
【24h】

Supervised Graph-Based Term Weighting Scheme for Effective Text Classification

机译:基于图形的基于图表的术语加权方案,有效文本分类

获取原文

摘要

Due to the increase in electronic documents, automatic text classification has gained a lot of importance as manual classification of documents is time-consuming. Machine learning is the main approach for automatic text classification, where texts are represented, terms are weighted on the basis of the chosen representation and a classification model is built. Vector space model is the dominant text representation largely due to its simplicity. Graphs are becoming an alternative text representation that have the ability to capture important information in text such as term order, term co-occurrence and term relationships that are not considered by the vector space model. Substantially better text classification performance has been demonstrated for term weighting schemes which use a graph representation. In this paper, we introduce a graph-based term weighting scheme, tw-srw, which is an effective supervised term weighting method that considers the co-occurrence information in text for increasing text classification accuracy. Experimental results show that it outperforms the state-of-the-art unsupervised term weighting schemes.
机译:由于电子文件的增加,随着文件的手动分类是耗时的,自动文本分类已经增加了很多重要性。机器学习是自动文本分类的主要方法,其中表示文本,术语基于所选表示,构建了分类模型。矢量空间模型是主要的文本表示,主要是由于其简单性。图形正在成为替代文本表示,其具有捕获文本中的重要信息,例如术语顺序,术语共同发生和不被矢量空间模型考虑的术语关系。已经对使用图形表示的术语加权方案进行了基本更好的文本分类性能。在本文中,我们介绍了一种基于图形的术语加权方案,TW-SRW,这是一种有效的监督术语加权方法,其考虑文本中的共同发生信息以增加文本分类准确性。实验结果表明,它优于最先进的无监督术语加权方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号