首页> 外文会议>Advanced data mining and applications >Semantic Based Text Classification of Patent Documents to a User-Defined Taxonomy
【24h】

Semantic Based Text Classification of Patent Documents to a User-Defined Taxonomy

机译:专利文件到用户定义分类法的基于语义的文本分类

获取原文
获取原文并翻译 | 示例

摘要

We present a generic approach for semantic based classification of text documents to pre-defined categories. The proposed technique is applied to the domain of patent analytics for the purpose of classifying a collection of patent documents to one or many nodes in a user-defined taxonomy. The proposed approach is a multi-step process consisting of noun extraction, word sense disambiguation, semantic relat-edness computation between pair of words using WordNet and confidence score computation. The proposed algorithm resulted in good accuracy on experimental dataset and can be easily adapted and customized to other domains other the patent landscape analysis domain discussed in this paper.
机译:我们为文本文档的基于语义的分类提供一种通用方法,以将其分类为预定义的类别。所提出的技术被应用于专利分析领域,目的是将专利文件的集合分类到用户定义分类法中的一个或多个节点。所提出的方法是一个多步骤的过程,包括名词提取,词义消歧,使用WordNet的词对之间的语义相关性计算和置信度得分计算。提出的算法在实验数据集上具有良好的准确性,并且可以容易地适应和定制到本文讨论的专利态势分析领域以外的其他领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号