首页>
外国专利>
Automated category discovery for a terminological knowledge base
Automated category discovery for a terminological knowledge base
展开▼
机译:术语知识库的自动类别发现
展开▼
页面导航
摘要
著录项
相似文献
摘要
A terminological system automatically generates sub-categories from categories of a knowledge base. The knowledge base includes a plurality of hierarchically arranged categories, as well as terms associated with the categories. A subset of the categories of the knowledge base are designated “dimensional categories.” The system also stores a corpus of documents, including themes and corresponding theme weights for each document. A target category is selected to generate sub-categories. A set of themes from the corpus of documents are selected for each term. Dimensional category vectors, one for each term, are generated by associating the set of themes for a term to a dimensional category in the knowledge base. The dimensional category vectors for each term are analyzed to determine if one or more clusters of terminological groups exist to generate new sub-categories. A content processing system, which generates themes and theme weights, is also disclosed.
展开▼