首页> 外文期刊>International journal of web information systems >Web-based methodology for extracting technology words in Chinese process patents
【24h】

Web-based methodology for extracting technology words in Chinese process patents

机译:基于Web的方法从中国工艺专利中提取技术单词的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Purpose - The purpose of constructing the technology/function matrix is to analyze the patents in the target domain. The extraction of technology words is an important part of the construction of technology/ function matrix. This algorithm is used to solve the problem of low efficiency of traditional Chinese process patents technology words extraction. Design/methodology/approach - The authors propose a Chinese process patents technology words extraction method based on the improved term frequency-inverse document frequency (TF-IDF) algorithm to help technicians obtain the technology words in the target domain. According to the characteristics of Chinese process patents technology words, the TF value of candidate technology words is divided into four parts, and the corpus of IDF value calculation of candidate technology words is selected. Findings - Through the test of Chinese process patents in the domain of path planning, this study shows that the method is feasible and practical. It can help users quickly and accurately obtain the technology words of Chinese process patents in the target domain Practical implications - With the increasing number of patents on the network-based patent information platform, patent analysis of massive Chinese process patents has become a research focus. The method proposed in this paper can facilitate users to extract technology words from massive Chinese process patents for patent analysis. Originality/value - This paper aims to improve the efficiency of Chinese process patents technology words extraction. The authors hope that the proposed method can reduce the labor and time cost of Chinese process patents technology words extraction.
机译:目的 - 构建技术/功能矩阵的目的是分析目标域中的专利。技术词语的提取是技术/功能矩阵构造的重要组成部分。该算法用于解决传统中国工艺专利技术单词提取效率低的问题。设计/方法/方法 - 作者提出了一种基于改进术语频率 - 逆文档频率(TF-IDF)算法的中国工艺专利技术单词提取方法,帮助技术人员获得目标域中的技术单词。根据中国工艺专利技术单词的特点,候选技术词的TF值分为四个部分,选择了IDF值计算的候选技术词语的语音。调查结果 - 通过对路径规划领域的中国工艺专利的考验,本研究表明该方法是可行和实用的。它可以帮助用户快速准确地获得中国工艺专利的技术词在目标领域的实际意义 - 随着越来越多的专利对网络的专利信息平台,大规模中国工艺专利的专利分析已成为研究重点。本文提出的方法可以促进用户从大规模中国工艺专利中提取技术单词进行专利分析。原创性/价值 - 本文旨在提高中国工艺专利技术单词提取的效率。作者希望提出的方法可以减少中国工艺专利技术单词提取的劳动力和时间成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号