首页> 外国专利> A METHOD AND SYSTEM FOR DESCRIBING AND IDENTIFYING CONCEPTS IN NATURAL LANGUAGE TEXT FOR INFORMATION RETRIEVAL AND PROCESSING

A METHOD AND SYSTEM FOR DESCRIBING AND IDENTIFYING CONCEPTS IN NATURAL LANGUAGE TEXT FOR INFORMATION RETRIEVAL AND PROCESSING

机译:用于信息检索和处理的自然语言文本中描述和识别概念的方法和系统

摘要

A method for information retrieval that matches occurrences of concepts in natural language text documents against descriptions of concepts in user queries. Said method, implemented in a computer system, includes a preferred version of the method that comprises (1) annotating natural language text in documents and other text-forms with linguistic information and Concepts and Concept Rules expressed in a Concept Specification Language (CSL) for a particular domain, (2) pruning and optimizing synonyms for a particular domain, (3) defining and learning said CSL Concepts and Concept Rules, (4) checking user-defined descriptions of Concepts represented in CSL (including user queries), and (5) retrieval by matching said user-defined descriptions (and queries) against said annotated text. CSL is a language for expressing linguistically-based patterns. Said patterns can represent the linguistic manifestations of concepts in text. Said concepts may derive from the sublanguages used by experts to analyze specialized domains including, but not limited to, insurance claims, police incident reports, medical reports, and aviation incident reports.
机译:一种将自然语言文本文档中的概念出现与用户查询中的概念描述进行匹配的信息检索方法。在计算机系统中实现的所述方法包括该方法的优选版本,该优选版本包括:(1)使用语言信息以及以概念规范语言(CSL)表示的概念和概念规则来注释文档和其他文本形式中的自然语言文本。特定域,(2)修剪和优化特定域的同义词,(3)定义和学习所述CSL概念和概念规则,(4)检查用户定义的对CSL中表示的概念的描述(包括用户查询),以及( 5)通过将所述用户定义的描述(和查询)与所述带注释的文本进行匹配来进行检索。 CSL是用于表达基于语言的模式的语言。所述模式可以代表文本中概念的语言表达。所述概念可以源自专家用来分析专门领域的子语言,所述专门领域包括但不限于保险索赔,警察事件报告,医疗报告和航空事件报告。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号