首页> 外国专利> Text classification using concept kernel

Text classification using concept kernel

机译:使用概念内核进行文本分类

摘要

Texts may be classified by mapping the texts to concept space, and by dividing the concept space based on substantive classes. A concept space containing a diverse set of concepts is defined. One example of a concept space is the set of on-line encyclopedia articles, each of which is an example of a concept. A text is scored for relevance against each concept, and a vector is created containing each of the scores. The vector represents the text's position in concept space. For any given substantive class of texts, the concept space may be divided into regions containing texts that are members/non-members of the class. The dividing boundary may be determined by training a classifier on a set of labeled examples of texts that fall inside and outside the class.
机译:可以通过将文本映射到概念空间并通过基于实质性类划分概念空间来对文本进行分类。定义了一个包含各种概念集的概念空间。概念空间的一个示例是一组在线百科全书文章,每个文章都是一个概念的例子。对文本与每个概念的相关性进行评分,并创建一个包含每个评分的向量。向量表示文本在概念空间中的位置。对于任何给定的实质性文本类别,可以将概念空间划分为包含文本的区域,这些文本是该类别的成员/非成员。可以通过对分类器进行训练,确定分类边界,该分类器是对落入该类内部和外部的文本的一组带标签的示例进行的。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号