首页> 外文会议>Intelligent System and Knowledge Engineering, 2008 3rd International Conference on >A feature extraction method using base phrase and keyword in Chinese text
【24h】

A feature extraction method using base phrase and keyword in Chinese text

机译:基于中文短语和关键词的特征提取方法

获取原文

摘要

The feature extraction is the most key technology of text categorization. The word is used as the feature in the traditional text classification, and its effect for the text classification is evidence. The feature extraction method using base phrase and keyword changes the feature extraction of Chinese text from syntax and semantic further. In the first, analyzing the feature of baseNP and basedVP, and then make some words into baseNP and baseVP which accord to the rules of phrase, give WSD to other words in the finally. The paper proposes a stepwise feature extraction from word to phrase. The experiment results show that this method can perform much better than traditional feature extraction method, it can improve the text classification precision and recall.
机译:特征提取是文本分类的最关键技术。在传统的文本分类中,该词被用作特征,它对文本分类的作用是有力的证据。使用基本短语和关键字的特征提取方法进一步改变了中文文本从语法和语义上的特征提取。首先,分析了baseNP和baseVP的特点,然后将一些符合词组规则的词变成baseNP和baseVP,最后将WSD赋予其他词。本文提出了从单词到短语的逐步特征提取。实验结果表明,该方法性能优于传统特征提取方法,可以提高文本分类的准确性和查全率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号