首页> 外文会议>Signal Processing and Communications Applications Conference >Two new feature extraction methods for text classification: TESDF and SADF
【24h】

Two new feature extraction methods for text classification: TESDF and SADF

机译:两种新的文本分类特征提取方法:TESDF和SADF

获取原文

摘要

In this study, two new document weighting methods are proposed based on term frequency-inverse document frequency (TF-IDF) generally used in text mining methods. Also, insignificance of the verb in text classification which will be a new method in pre-processing have been put forward and tested. The better results were observed through using these methods when these methods compare with other method, It was observed that the performance rate hardly change and the data size which was processed decreased by omitting verbs of texts.
机译:在这项研究中,基于文本挖掘方法中通常使用的术语频率-反文档频率(TF-IDF),提出了两种新的文档加权方法。同时,提出并测试了动词在文本分类中的重要性,这将成为预处理的一种新方法。当这些方法与其他方法相比时,通过使用这些方法观察到了更好的结果,观察到通过省略文本的动词,性能率几乎不变并且处理的数据大小减小了。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号