首页> 外国专利> Efficient corpus search and annotation management for a question answering system

Efficient corpus search and annotation management for a question answering system

机译:有效的语料库搜索和注释管理,了解应答系统

摘要

A computer converts a question received in a natural language format into a string of text elements. The computer searches a corpus comprising unstructured passages with the string of the text elements as search terms to identify a selection of unstructured passages from the corpus relevant to the text elements. The computer annotates the selection of relevant unstructured passages with one or more annotations according to at least one natural language annotation type to generate an annotated selection knowledge base. The computer modifies the string of text elements by annotating at least one of the text elements according to the at least one natural language annotation type. The computer searches the annotated selection knowledge base using the modified string of text elements to generate a selection of ranked passages. The computer identifies an answer to the question based on the selection of ranked passages.
机译:计算机将以自然语言格式收到的问题转换为一串文本元素。计算机搜索包括与文本元素的字符串的语料库,作为搜索术语,以从与文本元素相关的语料库中识别非结构化段的选择。计算机注释与根据至少一种自然语言注释类型的一个或多个注释选择相关的非结构化段的选择,以生成注释选择知识库。通过根据至少一个自然语言注释类型注释至少一个文本元素来修改文本元素的字符串。计算机使用修改后的文本元素搜索注释的选择知识库以生成一系列排名段。计算机根据排列段落的选择识别问题的答案。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号