首页> 外文会议>AAAI Symposium on Simulating Human Agents >Issues in the conduct of comparisons among human performance models and live human performance in complex simulated task environments
【24h】

Issues in the conduct of comparisons among human performance models and live human performance in complex simulated task environments

机译:在复杂的模拟任务环境中的人类绩效模型和现场人性性能进行比较的问题

获取原文

摘要

The Air Force Research Laboratory has sponsored the Agent-based Modeling and Behavior Representation (AMBR) Project to advance the state of the art in human performance modeling in general and the state of practice in cost-effective computer generated forces more specifically (Pew & Mavor, 1998). To advance human performance modeling they have created an opportunity for multiple developers to create different models of the same human operator activity and to compare the results both from model to model and with human participants performing the same task. As you will learn in a later talk, in the course of this project these model comparisons will be conducted three times, each time using different, hopefully more demanding, human modeling requirements. The first comparison is now complete and the results were reported at the International Ergonomics Society Meeting in July 2000. The project is important because it is rare to have the opportunity to validate human performance models, and even rarer to be able to compare and contrast the results of multiple model developers who use different model architectures and draw their models from different theoretical perspectives. BBN has been assigned the responsibility of serving as moderator for these comparisons. In that capacity, for the first round, we developed a simulated, but much simplified, air traffic control environment and task scenarios that emphasized multiple task management, task priority setting and attention management. Using DOMAR (Freeman, 1997), an agent-based simulation environment, we have the capability to interchangeably "plug in" a human operator model or a live human participant. This capability was used to collect human performance data exercising scenarios that were identical to those the model developers were required to exercise. Together with Michael Young, the project initiator, we developed the specifications the model developers were to respond to. We collected the human performance data supplied to the developers for developing their models and the data that was used for model evaluation. Together with the developers, we conducted the experimental comparisons among models, analyzed and reported the results. Four independent development teams participated. In the course of completing these assignments we have had to address a number of challenging issues. This paper describes a sample of these issues and what we have done to resolve them. I make no claim that our solutions are the best ones. In some cases they were driven by practical considerations alone. I hope the paper will stimulate others to think about how they should be resolved.
机译:空军研究实验室赞助的基于代理的建模和行为表示(AMBR)项目推进技术人力性能建模一般状态和实践的高性价比电脑产生的力更具体的状态(皮尤&Mavor ,1998年)。为了推进人类造型表现,他们已经创造了多个开发人员有机会创建相同的人类操作员活动的不同模式,并比较结果无论是从模型到模型和执行相同的任务人类参与者。正如你将学习在稍后的谈话,在这个项目中,这些模型的比较将使用不同的,希望更多的要求很高,人体建模的要求进行三次,每次的过程。第一个比较现在完成的结果,于2000年7月的报道在国际人类工效学学会会议上的项目是重要的,因为这是难得有机会来验证人的行为模式,甚至罕见的能够比较和对比谁使用不同的模型架构,并从不同的理论视角得出他们的模型多模型的开发成果。 BBN已经被分配担任主持人对这些比较的责任。在这种能力,第一轮,我们开发了一个模拟的,但简单得多,空中交通管制环境和强调多任务管理,任务优先级设置和重视管理工作方案。使用多马(弗里曼,1997年),基于代理的模拟环境,我们有“插件”人工操作模型或活的人参与的能力,可以互换。此功能是用于收集人类的性能数据是行使的那些相同的模型开发者需要运动场景。再加上迈克尔扬,项目发起人,我们建立的模型开发者回应的规格。我们收集了提供给开发商用于开发自己的模型和用于模型评估数据的人的表现数据。再加上开发商,我们的模型中进行实验比较,分析和报告结果。四个独立的开发团队参加。在完成这些任务的过程中,我们必须解决一些具有挑战性的问题。本文介绍了这些问题的样品和我们所做的解决这些问题。我并没有说我们的解决方案是最好的。在某些情况下,他们被单独实际考虑驱动。我希望本文将激发别人认为他们应该如何解决。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号