首页> 外国专利> METHOD AND APPARATUS FOR SELECTING IMPORTANT FEATURE OF MULTI-LABEL DATA APPARATUS FOR SELECTING IMPORTANT WORD OF MULTI-CATEGORY DOCUMENT

METHOD AND APPARATUS FOR SELECTING IMPORTANT FEATURE OF MULTI-LABEL DATA APPARATUS FOR SELECTING IMPORTANT WORD OF MULTI-CATEGORY DOCUMENT

机译:用于选择多类别文档的重要单词的多标签数据设备的重要特征的方法和装置

摘要

The present invention relates to a method and a device to sort key characteristics of multilabel data, and a device to sort key words of a multicategory document. The method includes: a step in which an importance score calculating part calculates an importance score for each characteristic based on a mutual information quantity between all key labels selected from every label group in accordance with preset standards and each of all characteristics about multilabel data; a step in which an approximate importance score calculating part calculates an approximate importance score for each of the characteristics based on an approximate mutual information quantity between all nonimportant labels selected in accordance with preset standards and each of all the characteristics; and a step in which a total importance score calculating part calculates a total importance score for each of the characteristics by adding up the importance and approximate importance scores for each of all the characteristics.
机译:本发明涉及对多标签数据的关键特征进行分类的方法和设备,以及对多分类文档的关键词进行分类的设备。该方法包括:步骤,重要性分数计算部分基于根据预设标准从每个标签组中选择的所有关键标签与关于多标签数据的所有特征中的每个特征之间的互信息量,为每个特征计算重要性分数;步骤,在步骤中,近似重要性得分计算部分基于根据预设标准选择的所有非重要标签与所有特征中的每个之间的近似互信息量,为每个特征计算近似重要性得分;总重要性得分计算部分通过将所有特征的重要性和近似重要性得分相加来计算每个特征的总重要性得分。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号