首页> 外文期刊>Datenbank-Spektrum >Comparing Wizard of Oz & Observational Studies for Conversational IR Evaluation
【24h】

Comparing Wizard of Oz & Observational Studies for Conversational IR Evaluation

机译:比较OZ&观测研究向导对会话IR评估

获取原文
获取原文并翻译 | 示例

摘要

Abstract Systematic and repeatable measurement of information systems via test collections, the Cranfield model, has been the mainstay of Information Retrieval since the 1960s. However, this may not be appropriate for newer, more interactive systems, such as Conversational Search agents. Such systems rely on Machine Learning technologies, which are not yet sufficiently advanced to permit true human-like dialogues, and so research can be enabled by simulation via human agents.In this work we compare dialogues obtained from two studies with the same context, assistance in the kitchen, but with different experimental setups, allowing us to learn about and evaluate conversational IR systems. We discover that users adapt their behaviour when they think they are interacting with a system and that human-like conversations in one of the studies were unpredictable to an extent we did not expect. Our results have implications for the development of new studies in this area and, ultimately, the design of future conversational agents.
机译:摘要通过测试集合,Cranfield模型的系统和可重复测量信息系统,是自20世纪60年代以来的信息检索的主要机制。但是,这可能不适合较新的更多交互式系统,例如会话搜索代理。这种系统依赖于机器学习技术,尚未充分地推进以允许真正的人类类似的对话,因此可以通过人体代理进行模拟来实现研究。在这项工作中,我们比较了从同一背景下获得的两项研究获得的对话,援助在厨房里,但用不同的实验设置,允许我们了解和评估会话IR系统。我们发现用户认为他们认为他们与系统交互时,他们的行为适应他们的行为,其中一个研究中的人类对话在我们没有期望的范围内是不可预测的。我们的结果对该地区的新研究发展有影响,最终是未来的会话代理人的设计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号