首页>
外国专利>
Large scale unsupervised hierarchical document categorization using ontological guidance
Large scale unsupervised hierarchical document categorization using ontological guidance
展开▼
机译:本体指导下的大规模无监督分层文档分类
展开▼
页面导航
摘要
著录项
相似文献
摘要
A classification method includes constructing queries from category descriptors representing categories of a taxonomy of hierarchically organized categories. The query constructed for a category c includes a query component based on descriptors of the category c and at least one query component based on descriptors of an ancestor or descendant category of the category c. A documents database is queried using the constructed queries to retrieve pseudo-relevant documents. Language models for the categories of the taxonomy are extracted from the pseudo-relevant documents by inferring a hierarchical topic model representing the taxonomy. An input document is classified by optimizing mixture weights of a weighted combination of categories of the hierarchical topic model respective to the input document.
展开▼