首页> 外国专利> Model learning methods, information extraction method, model learning device, information extraction apparatus, model learning program, information extraction program, and recording medium having recorded them program

Model learning methods, information extraction method, model learning device, information extraction apparatus, model learning program, information extraction program, and recording medium having recorded them program

机译:模型学习方法,信息提取方法,模型学习装置,信息提取装置,模型学习程序,信息提取程序以及记录了这些程序的记录介质

摘要

PPROBLEM TO BE SOLVED: To provide a model learning method and an information extracting method that reduce influence of a recognition error. PSOLUTION: In a discrimination model, a terminal device generates model information during model learning by using recognition word string learning data which are given extraction object information and include an error and reference word string learning data which are given the extraction object information and include no error. Then, a recognition conviction degree identity is binarized and the model information is generated by using both a recognition word string and a reference word string. When information is extracted, recognition conviction degree identities showing whether respective recognized words recognized from input data are correct are imparted to the recognition word string. When a recognition conviction degree identity is larger than a certain threshold, the word is recognized "correctly" and when not, the word is recognized "incorrectly". Then the generated model information is used to impart the extraction object information to the recognition word string and words are extracted based upon the imparted extraction object information. PCOPYRIGHT: (C)2008,JPO&INPIT
机译:

要解决的问题:提供一种减少识别错误的影响的模型学习方法和信息提取方法。

解决方案:在判别模型中,终端设备在模型学习期间通过使用识别词串学习数据生成模型信息,该识别词串学习数据被赋予了提取对象信息,并且包括错误和参考词串学习数据,其被赋予了提取对象信息,并且不包含错误。然后,将识别定罪度同一性进行二值化,并通过使用识别词串和参考词串两者来生成模型信息。当提取信息时,将表示从输入数据中识别出的各个识别词是否正确的识别信念程度身份赋予识别词串。当识别信念程度同一性大于某个阈值时,该单词被“正确地”识别,而当否时,该单词被“错误地”识别。然后,所生成的模型信息用于将提取对象信息赋予识别词串,并且基于所赋予的提取对象信息来提取单词。

版权:(C)2008,日本特许厅&INPIT

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号