【24h】

Event induced bias in label fusion

机译:事件引起的标签融合偏倚

获取原文
获取原文并翻译 | 示例

摘要

In a two class label scenario, classification systems may be used to assess whether or not an element of interest belongs to the "target" or "non-target" class. The performance of the system is summarized visually as a tradeoff between the proportions of elements correctly labeled as "target" plotted against the proportion of elements incorrectly labeled as "target." These proportions are empirical estimates of the true and false positive rates, and their trade-off plot is known as a receiver operating characteristic (ROC) curve. Classification performance can be increased, however, if the information provided by multiple systems can be fused together to create a new, combined system. This research focuses on label-fusion as a common method to increase classification performance and quantifying the bias that occurs when misspecifying the partitioning of the underlying event set. This partitioning will be defined in terms of what be called within and across label fusion. When incorrect assumptions are made about the partitioning of the event set, bias will occur and both the ROC curve and its optimal parameters will be incorrectly quantified. In this work, we analyze the effects of individual classification system performance, correlation, and target environment on the magnitude of this performance bias. This work will then inspire the development of formulas to adjust optimal performance measures to appropriately reflect the fused system performance according to event set partitioning. As such, bias may be appropriately adjusted without redesigning the fused system, allowing greater use of currently fused systems across multiple platforms and environments.
机译:在两类标签的情况下,分类系统可用于评估感兴趣的元素是否属于“目标”或“非目标”类。该系统的性能从视觉上总结为正确标记为“目标”的元素比例与错误标记为“目标”的元素比例之间的折衷。这些比例是对真实和假阳性率的经验估计,它们的折衷图被称为接收器工作特性(ROC)曲线。但是,如果可以将多个系统提供的信息融合在一起以创建新的组合系统,则可以提高分类性能。这项研究将标签融合作为提高分类性能和量化错误指定基础事件集分区时出现的偏差的常用方法。将根据标签融合内和融合之间的定义来定义此分区。如果对事件集的划分做出错误的假设,则会出现偏差,并且ROC曲线及其最佳参数都将被错误地量化。在这项工作中,我们分析了各个分类系统性能,相关性和目标环境对这种性能偏差的影响。然后,这项工作将启发公式的制定,以调整最佳性能指标,以根据事件集划分适当地反映融合的系统性能。这样,可以在不重新设计融合系统的情况下适当地调整偏差,从而允许跨多个平台和环境更多地使用当前融合系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号