...
首页> 外文期刊>Engineering Technology and Applied Science Research >A Novel Summarization-based Approach for Feature Reduction Enhancing Text Classification Accuracy
【24h】

A Novel Summarization-based Approach for Feature Reduction Enhancing Text Classification Accuracy

机译:一种新的基于摘要的减少特征的方法,可提高文本分类的准确性

获取原文

摘要

Automatic summarization is the process of shortening one (in single document summarization) or multiple documents (in multi-document summarization). In this paper, a new feature selection method for the nearest neighbor classifier by summarizing the original training documents based on sentence importance measure is proposed. Our approach for single document summarization uses two measures for sentence similarity: the frequency of the terms in one sentence and the similarity of that sentence to other sentences. All sentences were ranked accordingly and the sentences with top ranks (with a threshold constraint) were selected for summarization. The summary of every document in the corpus is taken into a new document used for the summarization evaluation process.
机译:自动汇总是缩短一个文档(在单个文档中)或多个文档(在多文档中)的过程。提出了一种新的基于句子重要性度量的原始训练文档摘要,用于最近邻分类器的特征选择方法。我们对单个文档进行汇总的方法使用两种度量来实现句子相似性:一个句子中术语的频率以及该句子与其他句子的相似性。对所有句子进行相应排名,并选择具有最高排名(具有阈值约束)的句子进行汇总。语料库中每个文档的摘要都会纳入一个用于摘要评估过程的新文档中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号