【24h】

Rule-based Word Clustering for Text Classification

机译:基于规则的词聚类用于文本分类

获取原文
获取原文并翻译 | 示例

摘要

This paper introduces a rule-based, context-dependent word clustering method, with the rules derived from various domain databases and the word text orthographic properties. Besides significant dimensionality reduction, our experiments show that such rule-based word clustering improves by 8% the overall accuracy of extracting bibliographic fields from references, and by 18.32% on average the class-specific performance on the line classification of document headers.
机译:本文介绍了一种基于规则的,上下文相关的词聚类方法,该方法具有从各种域数据库和词文本正字法属性派生的规则。除了大幅减少维度外,我们的实验还表明,这种基于规则的词聚类可将参考书目字段的总体准确度提高8%,平均可提高18.32%的文档标题行分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号