首页>
外国专利>
EFFICIENTLY BUILDING COMPACT MODELS FOR LARGE TAXONOMY TEXT CLASSIFICATION
EFFICIENTLY BUILDING COMPACT MODELS FOR LARGE TAXONOMY TEXT CLASSIFICATION
展开▼
机译:高效地建立紧凑模型,用于大分类法文本分类
展开▼
页面导航
摘要
著录项
相似文献
摘要
A taxonomy model is determined with a reduced number of weights. For example, the taxonomy model is a tangible representation of a hierarchy of nodes that represents a hierarchy of classes that, when labeled with a representation of a combination of weights, is usable to classify documents having known features but unknown class. For each node of the taxonomy, the training example documents are processed to determine the features for which there are a sufficient number of training example documents having a class label corresponding to at least one of the leaf nodes of a subtree having that node as a root node. For each node of the taxonomy, a sparse weight vector is determined for that node, including setting zero weights, for that node, those features determined to not appear at least a minimum number of times in a given set of leaf nodes in the sub-tree with that node as a root node. The sparse weight vectors can be learned by solving an optimization problem using a maximum entropy classifier, or a large margin classifier with a sequential dual method (SDM) with margin or slack resealing. The determined sparse weight vectors are tangibly embodied in a computer-readable medium in association with the tangible representation of the nodes of the taxonomy.
展开▼