首页> 外文会议>International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems(IEA/AIE 2006); 20060627-30; Annecy(FR) >New Methods for Text Categorization Based on a New, Feature Selection Method and a New Similarity Measure Between Documents
【24h】

New Methods for Text Categorization Based on a New, Feature Selection Method and a New Similarity Measure Between Documents

机译:基于新的特征选择方法和新的文档间相似度度量的文本分类新方法

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we present a new feature selection method based on document frequencies and statistical values. We also present a new similarity measure to calculate the degree of similarity between documents. Based on the proposed feature selection method and the proposed similarity measure between documents, we present three methods for dealing with the Reuters-21578 top 10 categories text categorization. The proposed methods get higher performance for dealing with the Reuters-21578 top 10 categories text categorization than that of the method presented in [4].
机译:在本文中,我们提出了一种基于文档频率和统计值的新特征选择方法。我们还提出了一种新的相似性度量来计算文档之间的相似度。基于建议的特征选择方法和建议的文档间相似度度量,我们提出了三种用于处理Reuters-21578十大类别文本分类的方法。所提出的方法在处理Reuters-21578十大类别文本分类方面比[4]中提出的方法具有更高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号