首页> 外文会议>2011 Eighth Web Information Systems and Applications Conference >A Rules and Statistical Learning Based Method for Chinese Patent Information Extraction
【24h】

A Rules and Statistical Learning Based Method for Chinese Patent Information Extraction

机译:基于规则和统计学习的中文专利信息提取方法

获取原文
获取原文并翻译 | 示例

摘要

Patent documents, as a kind of open scientific literature protected by law, the abstracts of which often highly summarize the main information. Information extraction work and analysis of the abstracts can contribute to better protection of intellectual property rights and promotion of enterprise technological innovation. This paper focus on patent abstracts and view information extraction of patent documents as a short text categorization problem, a method based on the combination of rules and statistical learning is used to annotate and extract the information of patent features, composition and usage. Experiments show that our method can not only extract the above three types of information in the patent abstracts, but also has higher accuracy when compared to the rules based method or SVM, which is an efficient and commonly used statistical learning classification algorithm.
机译:专利文件,作为一种受法律保护的开放科学文献,其摘要通常高度概括主要信息。信息提取工作和摘要分析可有助于更好地保护知识产权并促进企业技术创新。本文着眼于专利摘要,将专利文献的信息抽取视为一种短文本分类问题,采用基于规则和统计学习相结合的方法对专利特征,组成和使用情况信息进行注释和抽取。实验表明,与基于规则的方法或支持向量机(SVM)相比,我们的方法不仅可以提取专利摘要中的上述三种信息,而且具有较高的准确性,后者是一种高效且常用的统计学习分类算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号