首页>
外国专利>
METHOD OF COMPUTERIZED SEMANTIC INDEXING OF NATURAL LANGUAGE TEXT, METHOD OF COMPUTERIZED SEMANTIC INDEXING OF COLLECTION OF NATURAL LANGUAGE TEXTS, AND MACHINE-READABLE MEDIA
METHOD OF COMPUTERIZED SEMANTIC INDEXING OF NATURAL LANGUAGE TEXT, METHOD OF COMPUTERIZED SEMANTIC INDEXING OF COLLECTION OF NATURAL LANGUAGE TEXTS, AND MACHINE-READABLE MEDIA
展开▼
机译:自然语言文本的计算机化语义索引方法,自然语言文本的计算机化语义索引方法以及机器可读媒体
展开▼
页面导航
摘要
著录项
相似文献
摘要
The present invention relates to the information technologies field, namely, to methods of computerized semantic indexing of natural language texts. The use of the present invention permits for extending the set of methods for indexing the natural languages texts by means of employing techniques of the computerized linguistic analysis thereof and further usage of obtained results for building indices, which ensures the semantic navigation through documents and document collections, as well as the highly-precise and quick search of facts and documents relevant to the user's information needs, particularly, in reference to the high-inflectional language texts. The method of computerized semantic indexing of natural language text comprises steps of: segmenting the text in the electronic form into tokens; identifying stable phrases; forming sentences; by addressing the linguistic and heuristic rules formed in the database in the predetermined linguistic environment, identifying the semantically meaningful objects (named entities) and the semantically meaningful relations therebetween (named relations); for every named relations, forming the set of triples, where single first type triple corresponding to the relation established by the named relation between two named entities, each of the set of the second type triples corresponding to a value of particular attribute of one of those entities, and each of the set of the third type triples corresponding to a value of particular attribute of the named relation itself; at the set of the formed triples, indexing all named entities related by the named relations separately, all pairs of the kind "named entity - named relation", and all triples of the kind "named entity - named relation - named entity", while taking into account the attributes of respective named entities and/or named relations; and storing in the database the formed triples and the obtained indices together with the reference to the initial text from which those triples have been formed.
展开▼