...
首页> 外文期刊>Statistics and computing >Semi-parametric analysis of multi-rater data
【24h】

Semi-parametric analysis of multi-rater data

机译:多评估者数据的半参数分析

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Datasets that are subjectively labeled by a number of experts are becoming more common in tasks such as biological text annotation where class definitions are necessarily somewhat subjective. Standard classification and regression models are not suited to multiple labels and typically a pre-processing step (normally assigning the majority class) is performed. We propose Bayesian models for classification and ordinal regression that naturally incorporate multiple expert opinions in defining predictive distributions. The models make use of Gaussian process priors, resulting in great flexibility and particular suitability to text based problems where the number of covariates can be far greater than the number of data instances. We show that using all labels rather than just the majority improves performance on a recent biological dataset.
机译:由许多专家主观标记的数据集在诸如生物文本注释之类的任务中变得越来越普遍,在这些任务中类定义必然有些主观。标准分类和回归模型不适用于多个标签,通常需要执行预处理步骤(通常分配多数类)。我们提出了用于分类和序数回归的贝叶斯模型,该模型自然包含了多个专家意见来定义预测分布。这些模型利用了高斯过程先验,因此具有很大的灵活性,并且特别适合于基于文本的问题,因为协变量的数量可能远大于数据实例的数量。我们表明,使用所有标签而不是仅使用大多数标签可以改善最近的生物学数据集的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号