首页> 外国专利> Information extraction device, information extraction method and information extraction program

Information extraction device, information extraction method and information extraction program

机译:信息提取装置,信息提取方法和信息提取程序

摘要

To provide an information extraction device, an information extraction method, and an information extraction program capable of accurately extracting documents having similar contents.SOLUTION: An information extraction device 1 includes: an input unit 11 configured to receive an input of a key document serving as a search key; a first feature quantity calculation unit 12 configured to calculate a feature quantity based on a word included in the document; a first similarity calculation unit 13 configured to calculate, for the feature quantity of the key document, a first similarity with respect to the feature quantity of each of a plurality of search target documents accumulated in the past; an output unit 14 configured to output a search result based on the first similarity; and an evaluation unit 15 configured to receive an evaluation value for the search result and store the evaluation value in association with a combination of a word group included in the key document and a word group included in the search result. The first feature quantity calculation unit 12 calculates, based on the evaluation value, a bias value related to a distance between a word included in the key document and another word and incorporates the calculated bias value in the feature quantity. The first similarity calculation unit 13 adjusts the first similarity on the basis of the bias value.SELECTED DRAWING: Figure 2
机译:为了提供信息提取装置,信息提取方法和能够准确提取具有类似内容的文档的信息提取程序。策略:信息提取装置1包括:输入单元11,被配置为接收用作用作的密钥文档的输入搜索键;第一特征量计算单元12被配置为基于文档中包括的单词计算特征量;第一相似性计算单元13被配置为针对密钥文档的特征量计算关于过去累积在过去累积的多个搜索目标文档的特征量的第一相似度;输出单元14被配置为基于第一相似性输出搜索结果;并且,评估单元15被配置为接收搜索结果的评估值,并与包括在键文档中包括的单词组的组合和包括在搜索结果中的单词组的组合存储评估值。第一特征量计算单元12基于评估值计算与密钥文档中包括的单词与另一个字之间的距离相关的偏差值,并将计算出的偏置值包含在特征量中。第一相似度计算单元13根据偏置值调整第一相似度。选择绘图:图2

著录项

  • 公开/公告号JP6879983B2

    专利类型

  • 公开/公告日2021-06-02

    原文格式PDF

  • 申请/专利权人 KDDI株式会社;

    申请/专利号JP20180169685

  • 发明设计人 渡邊 英;岡田 圭司;

    申请日2018-09-11

  • 分类号G06F16/24;G06F16/2455;G06F16/33;G06F16/903;

  • 国家 JP

  • 入库时间 2024-06-14 21:38:30

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号