首页> 外文期刊>MIS quarterly >IS AI GROUND TRUTH REALLY TRUE? THE DANGERS OF TRAINING AND EVALUATING AI TOOLS BASED ON EXPERTS’ KNOW-WHAT
【24h】

IS AI GROUND TRUTH REALLY TRUE? THE DANGERS OF TRAINING AND EVALUATING AI TOOLS BASED ON EXPERTS’ KNOW-WHAT

机译:是真的真的真的吗? 基于专家了解的培训和评估AI工具的危险

获取原文
获取原文并翻译 | 示例
       

摘要

Organizational decision-makers need to evaluate AI tools in light of increasing claims that such tools outperform human experts. Yet, measuring the quality of knowledge work is challenging, raising the question of how to evaluate AI performance in such contexts. We investigate this question through a field study of a major U.S. hospital, observing how managers evaluated five different machine-learning (ML) based AI tools. Each tool reported high performance according to standard AI accuracy measures, which were based on ground truth labels provided by qualified experts. Trying these tools out in practice, however, revealed that none of them met expectations. Searching for explanations, managers began confronting the high uncertainty of experts' know-what knowledge captured in ground truth labels used to train and validate ML models. In practice, experts address this uncertainty by drawing on rich know-how practices, which were not incorporated into these ML-based tools. Discovering the disconnect between AI's know-what and experts' know-how enabled managers to better understand the risks and benefits of each tool. This study shows dangers of treating ground truth labels used in ML models objectively when the underlying knowledge is uncertain. We outline implications of our study for developing, training, and evaluating AI for knowledge work.
机译:组织决策者需要根据增加的索赔来评估AI工具,因为这些工具赢得了人类专家的表现。然而,衡量知识工作质量有挑战性,提出了如何在这种情况下评估AI性能的问题。我们通过对美国专业医院的实地研究调查了这个问题,观察了经理如何评估了五种不同的机器学习(ML)的AI工具。每个工具根据标准AI精度措施报告了高性能,基于合格专家提供的地面真理标签。然而,在实践中尝试这些工具透露,他们都没有满足期望。在寻找解释,管理者开始面对专家的高度不确定性,专业知识 - 在地上真理标签中捕获的知识,用于培训和验证ML模型。在实践中,专家通过绘制丰富的专业知识实践来解决这种不确定性,这些专业知识实践未被纳入这些基于ML的工具。发现AI的诀窍和专家的诀窍与专家的诀窍与能够更好地了解每个工具的风险和优势之间的断开连接。本研究表明,当潜在的知识不确定时,客观地处理ML模型中使用的地面真理标签的危险。我们概述了我们对发展,培训和评估知识工作的AI的研究的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号