首页> 外国专利> KNOWLEDGE EXTRACTING DEVICE, KNOWLEDGE EXTRACTING METHOD AND COMPUTER PROGRAM

KNOWLEDGE EXTRACTING DEVICE, KNOWLEDGE EXTRACTING METHOD AND COMPUTER PROGRAM

机译:知识提取设备,知识提取方法和计算机程序

摘要

PPROBLEM TO BE SOLVED: To provide a knowledge extracting method capable of further highly accurately extracting knowledge at a high speed, by using text data and structured data. PSOLUTION: This knowledge extracting device has a text dividing part 122 dividing the text data into words, a word pair making part 126 making a word pair by combining the words of the text data, a co-occurrence score calculating part 128 calculating a co-occurrence score of indicating an appearance degree of both composing words composing the word pair in a plurality of text data, a priority determining part 130 determining priority of the word pair based on the co-occurrence score, a text-time series data corresponding part 150 acquiring time series data corresponding to the composing word of the word pair, and a correlation coefficient calculating part 140 calculating a correlation coefficient of the word pair by using the time series data associated with the composing word of the word pair according to the determined priority. PCOPYRIGHT: (C)2009,JPO&INPIT
机译:

要解决的问题:提供一种知识提取方法,该方法能够通过使用文本数据和结构化数据进一步高精度地高速提取知识。

解决方案:该知识提取设备具有:文本划分部122,其将文本数据划分为单词;单词对生成部126,其通过组合文本数据的单词而构成单词对;同现分数计算部128,其计算表示在多个文本数据中组成单词对的两个组成单词的出现程度的共现分数,优先级确定部分130基于共现分数确定单词对的优先级,文本时间序列数据对应部分150获取与单词对的组成词相对应的时间序列数据,以及相关系数计算部分140通过使用与单词对的组成词相关联的时间序列数据,来计算单词对的相关系数。确定的优先级。

版权:(C)2009,日本特许厅&INPIT

著录项

  • 公开/公告号JP2008234618A

    专利类型

  • 公开/公告日2008-10-02

    原文格式PDF

  • 申请/专利权人 OKI ELECTRIC IND CO LTD;

    申请/专利号JP20070249739

  • 发明设计人 SHUDO KAZUHIKO;MATSUDAIRA MASAKI;

    申请日2007-09-26

  • 分类号G06F17/30;G06F17/27;

  • 国家 JP

  • 入库时间 2022-08-21 20:24:14

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号