首页> 外文期刊>Computer Science & Information Technology >Semantic Document Classification based on Strategies of Semantic Similarity Computation and Correlation Analysis
【24h】

Semantic Document Classification based on Strategies of Semantic Similarity Computation and Correlation Analysis

机译:基于语义相似度计算和相关分析策略的语义文档分类

获取原文
       

摘要

Document (text) classification is a common method in e-business, facilitating users in the taskssuch as document collection, analysis, categorization and storage. Semantic analysis can helpto improve the performance of document classification. Though having been considered whendesigning previous methods for automatic document classification, more focus should be givento semantics with the increase number of content-rich electronic documents, forum posts orblogs online, which can reduce human workload by a great margin. This paper proposes anovel semantic document classification approach aiming to resolve two types of semanticproblems: (1) polysemy problem, by using a novel semantic similarity computing strategy (SSC)and (2) synonym problem, by proposing a novel strong correlation analysis method (SCM).Experiments show that our strategies can help to improve the performance of the baselinemethods.
机译:文档(文本)分类是电子商务中的一种常用方法,可帮助用户完成诸如文档收集,分析,分类和存储之类的任务。语义分析可以帮助提高文档分类的性能。尽管在设计自动文档分类的先前方法时已经考虑过,但是随着内容丰富的电子文档,论坛帖子或在线博客数量的增加,应该更加关注语义,这可以大大减少人员的工作量。本文提出了anovel语义文档分类方法,旨在解决两种类型的语义问题:(1)多义问题,使用一种新颖的语义相似度计算策略(SSC)和(2)同义词问题,并提出一种新颖的强相关分析方法(SCM) )。实验表明,我们的策略可以帮助改善基准方法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号