首页> 外文会议>Annual meeting of the Special Interest Group on Discourse and Dialogue >Evaluation of Crowdsourced User Input Data for Spoken Dialog Systems
【24h】

Evaluation of Crowdsourced User Input Data for Spoken Dialog Systems

机译:口语对话系统的众包用户输入数据评估

获取原文

摘要

Using the Internet for the collection of data is quite common these days. This process is called crowdsourcing and enables the collection of large amounts of data at reasonable costs. While being an inexpensive method, this data typically is of lower quality. Filtering data sets is therefore required. The occurring errors can be classified into different groups. There are technical issues and human errors. For speech recording, technical issues could be a noisy background. Human errors arise when the task is misunderstood. We employ several techniques for recognizing errors and eliminating faulty data sets in user input data for a Spoken Dialog System (SDS). Furthermore, we compare three different kinds of questionnaires (QNRs) for a given set of seven tasks. We analyze the characteristics of the resulting data sets and give a recommendation which type of QNR might be the most suitable one for a given purpose.
机译:如今,使用Internet收集数据非常普遍。此过程称为众包,可以以合理的成本收集大量数据。尽管这是一种廉价的方法,但此数据通常质量较低。因此,需要过滤数据集。发生的错误可以分为不同的组。存在技术问题和人为错误。对于语音记录,技术问题可能是嘈杂的背景。当任务被误解时,会出现人为错误。我们采用多种技术来识别语音对话系统(SDS)的错误并消除用户输入数据中的错误数据集。此外,对于给定的七个任务,我们比较了三种不同类型的问卷(QNR)。我们分析了所得数据集的特征,并提出了哪种QNR可能是最适合给定目的的QNR的建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号