首页> 外文会议> >Evaluating machine learning algorithms for applications with humans in the loop
【24h】

Evaluating machine learning algorithms for applications with humans in the loop

机译:评估人员参与的应用程序的机器学习算法

获取原文

摘要

Applications employing data classification such as smart lighting that involve human factors such as perception lead to non-deterministic input-output relationships where more than one output may be acceptable for a given input. For these so called non-deterministic multiple output classification (nDMOC) problems, the relationship between the input and output may change over time making it difficult for the machine learning (ML) algorithms in a batch setting to make predictions for a given context. In this paper, we describe the nature of nDMOC problems and discuss the Relevance Score (RS) that is suitable in this context as a performance metric. RS determines the extent by which a predicted output is relevant to the user's context and behaviors, taking into account the inconsistencies that come with human (perception) factors. We tailor the RS metric so that it can be used to evaluate ML algorithms in an online setting at run-time. We assess the performance of a number of ML algorithms, using a smart lighting dataset with non-deterministic one-to-many input-output relationships. The results indicate that using RS instead of classification accuracy (CA) is suitable to analyze the performance of conventional ML algorithms applied to the category of nDMOC problems. Instance-based online ML gives the best RS performance. An interesting finding is that the RS keeps increasing with increasing number of samples, even after the CA performance converges.
机译:采用诸如智能照明之类的数据分类的应用涉及诸如感知之类的人为因素,导致了不确定的输入-输出关系,对于给定的输入,不止一个输出是可以接受的。对于这些所谓的非确定性多输出分类(nDMOC)问题,输入和输出之间的关系可能会随着时间而变化,从而使批处理设置中的机器学习(ML)算法难以针对给定的上下文进行预测。在本文中,我们描述了nDMOC问题的性质,并讨论了在这种情况下适合用作性能指标的相关性得分(RS)。 RS考虑到人为因素(感知因素)带来的不一致性,确定了预测输出与用户的上下文和行为相关的程度。我们对RS度量进行了定制,以便可以在运行时在线设置中将其用于评估ML算法。我们使用具有不确定的一对多输入输出关系的智能照明数据集来评估许多ML算法的性能。结果表明,使用RS代替分类精度(CA)适合分析应用于nDMOC问题类别的常规ML算法的性能。基于实例的在线ML可提供最佳的RS性能。一个有趣的发现是,即使在CA性能收敛之后,RS也会随着样本数量的增加而不断增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号