A METHOD FOR EVALUATING AND COMPARING USER SIMULATIONS: THE CRAMER-VON MESES DIVERGENCE

机译：一种评估和比较用户模拟的方法：克拉默 - 冯思斯分歧

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Although user simulations are increasingly employed in the development and assessment of spoken dialog systems, there is no accepted method for evaluating user simulations. In this paper, we propose a novel quality measure for user simulations. We view a user simulation as a predictor of the performance of a dialog system, where per-dialog performance is measured with a domain-specific scoring function. The quality of the user simulation is measured as the divergence between the distribution of scores in real dialogs and simulated dialogs, and we argue that the Cramer-von Mises divergence is well-suited to this task. The technique is demonstrated on a corpus of real calls, and we present a table of critical values for practitioners to interpret the statistical significance of comparisons between user simulations.

机译：虽然在口头对话系统的开发和评估方面越来越多地使用用户仿真，但没有接受的方法来评估用户仿真。在本文中，我们提出了一种新的用户模拟质量措施。我们将用户仿真视为对话系统性能的预测，其中每个对话表现使用域特定的评分函数测量。用户仿真的质量被测量为实际对话框中分数分布与模拟对话框之间的分配，我们认为克莱默 - von ices发散非常适合这项任务。该技术在真正的呼叫语料库上进行了演示，我们为从业者提供了一个临界值表，以解释用户仿真之间的比较的统计显着性。

著录项

来源
《IEEE Workshop on Automatic Speech Recognition and Understanding》|2007年||共6页
会议地点
作者
Jason D. Williams;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
User simulation; User modelling; Dialog simulation; Dialog management;

机译：用户仿真;用户建模;对话框模拟;对话管理;

相似文献

外文文献
中文文献
专利

1. Evaluation of air diffuser flow modelling methods experiments and computational fluid dynamics simulations [J] . J.R. Fontaine, R. Rapp, H. Koskela, Building and Environment . 2005,第3期

机译：空气扩散器流动建模方法实验和计算流体动力学模拟的评估
2. Building Information Modeling-based user activity simulation and evaluation method for improving designer-user communications [J] . Weilin Shen, Qiping Shen, Quanbin Sun Automation in construction . 2012,第Jana期

机译：基于建筑信息模型的用户活动仿真与评估方法，以提高设计者与用户之间的沟通
3. A simulation study to compare different estimation approaches for network meta-analysis and corresponding methods to evaluate the consistency assumption [J] . Corinna Kiefer, Sibylle Sturtz, Ralf Bender BMC Medical Research Methodology . 2020,第1期

机译：一种模拟研究，比较网络元分析的不同估计方法和相应方法评估一致性假设
4. A METHOD FOR EVALUATING AND COMPARING USER SIMULATIONS: THE CRAMER-VON MESES DIVERGENCE [C] . Jason D. Williams IEEE Workshop on Automatic Speech Recognition and Understanding . 2007

机译：一种评估和比较用户模拟的方法：克拉默 - 冯思斯分歧
5. Regional Energy Simulation Methods: Identifying, Evaluating, and Comparing Methods to Support the Generation of Virtual Building Stocks at the Sub-national Level [D] . Hendricken, Liam. 2018

机译：区域能源模拟方法：识别，评估和比较方法，以支持在国家以下级别生成虚拟建筑库存
6. A simulation study to compare different estimation approaches for network meta-analysis and corresponding methods to evaluate the consistency assumption [O] . Corinna Kiefer, Sibylle Sturtz, Ralf Bender 2020

机译：仿真研究比较网络荟萃分析的不同估计方法和评估一致性假设的相应方法
7. Dynamically downscaled climate simulations over North America: Methods, evaluation, and supporting documentation for users [O] . S.W. Hostetler, J.R. Alder, A.M. Allan 2011

机译：在北美动态镇流的气候模拟：用户的方法，评估和支持文档
8. Evaluation of automated decisionmaking methodologies and development of an integrated robotic system simulation. Appendix A: ROBSIM user's guide [R] . Haley, D. C., Almand, B. J., Thomas, M. M., 1986

机译：附录a：ROBsIm用户指南评估自动化决策方法和开发集成机器人系统仿真。

A METHOD FOR EVALUATING AND COMPARING USER SIMULATIONS: THE CRAMER-VON MESES DIVERGENCE

摘要

著录项

相似文献

相关主题

期刊订阅