首页> 外文会议>IEEE International Symposium on Signal Processing and Information Technology >Comparison of score normalization methods applied to multi-label classification
【24h】

Comparison of score normalization methods applied to multi-label classification

机译:分数标准化方法的比较应用于多标签分类

获取原文

摘要

Our paper deals with the multi-label text classification of the newspaper articles, where the classifier must decide if a document does or does not belong to each topic from the predefined topic set. A generative classifier is used to tackle this task and the problem with finding a threshold for the positive classification is mainly addressed. This threshold can vary for each document depending on the content of the document (words used, length of the document, etc.). An extensive comparison of the score normalization methods, primary proposed in the speaker identification/verification task, for robustly finding the threshold defining the boundary between the “correct” and the “incorrect” topics of a document is presented. Score normalization methods (based on World Model and Unconstrained Cohort Normalization) applied to the topic identification task has shown an improvement of results in our former experiments, therefore in this paper an in-depth experiments with more score normalization techniques applied to the multi-label classification were performed. Thorough analysis of the effects of the various parameters setting is presented.
机译:我们的论文处理了报纸文章的多标签文本分类,其中分类器必须决定文档是否有或不属于预定义主题集中的每个主题。生成分类器用于解决此任务,并且主要解决了查找正面分类的阈值的问题。每个文档可以根据文档的内容(文档的长度等)而异。呈现扬声器识别/验证任务中提出的分数标准化方法的广泛比较,用于稳健地查找定义“正确”和“不正确”主题之间的边界的阈值。评分标准化方法(基于世界模型和无约束群体规范化)应用于主题识别任务表明,我们以前的实验中的结果提高了结果,因此在本文中,一个深入的实验,具有更多分数标准化技术,适用于多标签进行分类。介绍了对各种参数设置的效果的彻底分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号