首页>
外国专利>
METHOD, SYSTEM AND COMPUTER PROGRAM FOR GENERATING A QUERY REPRESENTATION OF A DOCUMENT, AND QUERYING A DOCUMENT RETRIEVAL SYSTEM USING SAID QUERY REPRESENTATION
METHOD, SYSTEM AND COMPUTER PROGRAM FOR GENERATING A QUERY REPRESENTATION OF A DOCUMENT, AND QUERYING A DOCUMENT RETRIEVAL SYSTEM USING SAID QUERY REPRESENTATION
展开▼
机译:用于生成文档的查询表示,以及使用所述查询查询表示来查询文档检索系统的方法,系统和计算机程序
展开▼
页面导航
摘要
著录项
相似文献
摘要
In a method and system of generating a query representation of an electronic query document, the query document is processed by a computer processor. The computer processor is configured to identify words and sentences in the query document, generate for each word a corresponding part-of-speech, POS, category of the word, identify each sequence of words having a predetermined sequence of POS categories, and store the identified sequences of words as the query representation of the query document. In a method and system for querying a document retrieval system, the document retrieval system is queried with a plurality of the stored identified sequences of words; and target documents are retrieved from the document retrieval system. The target documents have at least one sequence of words in common with the query document. In a method and system for clustering similar documents in a set of electronic documents, one document of the set of documents is designated as a query document. The query document is processed to store identified sequences of words as a query representation of the query document. Each remaining one of the set of documents is queried with a plurality of the stored identified sequences of words. A similarity value for each query of a remaining one of the set of documents is determined, and documents in the set of documents are clustered based on the similarity values.
展开▼