首页>
外国专利>
Systems and methods for structural indexing of natural language text
Systems and methods for structural indexing of natural language text
展开▼
机译:自然语言文本的结构化索引系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A structural natural language index is created by segmenting documents within a repository into text portions and extracting named entity, co-reference, lexical entries, structural-semantic relationships, speaker attribution and meronymic derived features. A constituent structure is determined that contains the constituent elements and ordering information sufficient to reconstruct the text portion. A functional structure of the text portions is determined. A set of characterizing predicative triples are formed from the functional structure by applying linearization transfer rules. The constituent structure, the characterizing predicative triples and the derived features are combined to form a canonical form of the text portion. Each canonical form is added to the structural natural language index. A retrieved question is classified to determine question type and a corresponding canonical form for the question is generated. The entries in the structural natural language index are searched for entries matching the canonical form of the question and relevant to the question type. The characterizing predicative triples are used in conjunction with a generation grammar to create an answer. If the generation fails, some or all of the constituent structure of the matching entry is returned as the answer.
展开▼