首页> 外文会议>International Joint Conference on Neural Networks >Dealing with highly imbalanced textual data gathered into similar classes
【24h】

Dealing with highly imbalanced textual data gathered into similar classes

机译:处理收集到相似类中的高度不平衡的文本数据

获取原文

摘要

This paper deals with a new feature selection and feature contrasting approach for classification of highly imbalanced textual data with a high degree of similarity between associated classes. An example of such classification context is illustrated by the task of classifying bibliographic references into a patent classification scheme. This task represents one of the domains of investigation of the QUAERO project, with the final goal of helping experts to evaluate upcoming patents through the use of related research.
机译:本文研究了一种新的特征选择和特征对比方法,用于对高度不平衡的文本数据进行分类,并在相关类之间具有高度相似性。通过将书目参考文献分类为专利分类方案的任务来说明这种分类上下文的一个示例。该任务代表了QUAERO项目的研究领域之一,其最终目标是帮助专家通过使用相关研究评估即将到来的专利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号