【24h】

Modeling technological topic changes in patent claims

机译:模拟专利声明中的技术主题变更

获取原文
获取原文并翻译 | 示例

摘要

Patent claims usually embody the most essential terms and the core technological scope to define the protection of an invention, which makes them the ideal resource for patent content and topic change analysis. However, manually conducting content analysis on massive technical terms is very time consuming and laborious. Even with the help of traditional text mining techniques, it is still difficult to model topic changes over time, because single keywords alone are usually too general or ambiguous to represent a concept. Moreover, term frequency which used to define a topic cannot separate polysemous words that are actually describing a different theme. To address this issue, this research proposes a topic change identification approach based on Latent Dirichlet Allocation to model and analyze topic changes with minimal human intervention. After textual data cleaning, underlying semantic topics hidden in large archives of patent claims are revealed automatically. Concepts are defined by probability distributions over words instead of term frequency, so that polysemy is allowed. A case study using patents published in the United States Patent and Trademark Office (USPTO) from 2009 to 2013 with Australia as their assignee country is presented to demonstrate the validity of the proposed topic change identification approach. The experimental result shows that the proposed approach can be used as an automatic tool to provide machine-identified topic changes for more efficient and effective R&D management assistance.
机译:专利权利要求通常包含最基本的术语和定义发明保护的核心技术范围,这使它们成为进行专利内容和主题变更分析的理想资源。但是,以大量技术术语手动执行内容分析非常耗时且费力。即使借助传统的文本挖掘技术,仍然很难对主题随时间的变化进行建模,因为单独的单个关键字通常过于笼统或模棱两可,无法代表一个概念。而且,用于定义主题的术语频率不能分离实际上描述不同主题的多义词。为了解决这个问题,本研究提出了一种基于潜在狄利克雷分配的主题变化识别方法,以最少的人工干预对主题变化进行建模和分析。清除文本数据后,将自动显示隐藏在大型专利权利要求档案中的潜在语义主题。概念是通过单词上的概率分布而不是词频来定义的,因此允许多义性。案例研究使用了2009年至2013年在美国专利商标局(USPTO)中公开的专利,澳大利亚为受让人国,以证明所提出的主题变更识别方法的有效性。实验结果表明,所提出的方法可以用作自动工具,提供机器识别的主题更改,从而更有效地进行R&D管理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号