【24h】

An improved TF-IDF approach for text classification

机译:改进的TF-IDF文本分类方法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a new improved term frequency/inverse document frequency (TF-IDF) approach which uses confidence, support and characteristic words to enhance the recall and precision of text classification. Synonyms defined by a lexicon are processed in the improved TF-IDF approach. We detailedly discuss and analyze the relationship among confidence, recall and precision. The experiments based on science and technology gave promising results that the new TF-IDF approach improves the precision and recall of text classification compared with the conventional TF-IDF approach.
机译:本文提出了一种新的改进的术语频率/文档反向频率(TF-IDF)方法,该方法使用置信度,支持度和特征词来提高文本分类的回忆性和准确性。在改进的TF-IDF方法中处理由词典定义的同义词。我们详细讨论和分析置信度,召回率和准确性之间的关系。基于科学和技术的实验给出了令人鼓舞的结果,即与传统的TF-IDF方法相比,新的TF-IDF方法提高了文本分类的精度和召回率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号