...
首页> 外文期刊>Language Resources and Evaluation >A task-performance evaluation of referring expressions in situated collaborative task dialogues
【24h】

A task-performance evaluation of referring expressions in situated collaborative task dialogues

机译:协作式协作对话中引用表达的任务绩效评估

获取原文
获取原文并翻译 | 示例
           

摘要

Appropriate evaluation of referring expressions is critical for the design of systems that can effectively collaborate with humans. A widely used method is to simply evaluate the degree to which an algorithm can reproduce the same expressions as those in previously collected corpora. Several researchers, however, have noted the need of a task-performance evaluation measuring the effectiveness of a referring expression in the achievement of a given task goal. This is particularly important in collaborative situated dialogues. Using referring expressions used by six pairs of Japanese speakers collaboratively solving Tangram puzzles, we conducted a task-performance evaluation of referring expressions with 36 human evaluators. Particularly we focused on the evaluation of demonstrative pronouns generated by a machine learning-based algorithm. Comparing the results of this task-performance evaluation with the results of a previously conducted corpus-matching evaluation (Spanger et al. in Lang Resour Eval, 2010b), we confirmed the limitation of a corpus-matching evaluation and discuss the need for a task-performance evaluation.
机译:对引用表达式的适当评估对于设计可以与人类有效协作的系统至关重要。广泛使用的方法是简单地评估算法可以复制与先前收集的语料库中的表达式相同的表达式的程度。但是,一些研究人员指出,需要进行任务绩效评估,以评估参考表达在实现既定任务目标中的有效性。这在协作式对话中​​尤为重要。我们使用六对日语说话者共同解决七巧板谜题所使用的参照表达,与36位人类评估者进行了参照表达的任务绩效评估。尤其是,我们重点研究了基于机器学习的算法所产生的指示代词的评估。将这项任务绩效评估的结果与先前进行的语料匹配评估的结果进行比较(Spanger等人,在Lang Resour Eval,2010b),我们确认了语料匹配评估的局限性,并讨论了一项任务的必要性绩效评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号