首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >Using Complexity-Identical Human- and Machine-Directed Utterances to Investigate Addressee Detection for Spoken Dialogue Systems
【2h】

Using Complexity-Identical Human- and Machine-Directed Utterances to Investigate Addressee Detection for Spoken Dialogue Systems

机译:使用复杂度相同的人机对话来调查口语对话系统的收件人检测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Human-machine addressee detection (H-M AD) is a modern paralinguistics and dialogue challenge that arises in multiparty conversations between several people and a spoken dialogue system (SDS) since the users may also talk to each other and even to themselves while interacting with the system. The SDS is supposed to determine whether it is being addressed or not. All existing studies on acoustic H-M AD were conducted on corpora designed in such a way that a human addressee and a machine played different dialogue roles. This peculiarity influences speakers’ behaviour and increases vocal differences between human- and machine-directed utterances. In the present study, we consider the Restaurant Booking Corpus (RBC) that consists of complexity-identical human- and machine-directed phone calls and allows us to eliminate most of the factors influencing speakers’ behaviour implicitly. The only remaining factor is the speakers’ explicit awareness of their interlocutor (technical system or human being). Although complexity-identical H-M AD is essentially more challenging than the classical one, we managed to achieve significant improvements using data augmentation (unweighted average recall (UAR) = 0.628) over native listeners (UAR = 0.596) and a baseline classifier presented by the RBC developers (UAR = 0.539).
机译:人机收件人的检测(HM AD)是现代的语言学和对话挑战,它发生在几个人与口头对话系统(SDS)之间的多方对话中,因为用户在与系统交互时也可能彼此交谈,甚至与自己对话。 SDS应该确定是否正在处理它。现有的有关声学H-M AD的所有研究都是在语料库上进行的,语料库的设计使人类收件人和机器扮演不同的对话角色。这种特殊性会影响说话者的行为,并增加人与机器说话之间的语音差异。在本研究中,我们考虑了餐厅预订语料库(RBC),该语料库由复杂性相同的人机和机器呼叫组成,可以消除隐含影响说话者行为的大多数因素。剩下的唯一因素是演讲者对对话者(技术系统或人)的明确了解。尽管与复杂度相同的HM AD在本质上比经典AD更具挑战性,但我们设法通过数据增强(未加权平均召回率(UAR)= 0.628)优于本地监听器(UAR = 0.596)和RBC提出的基线分类器实现了重大改进开发人员(UAR = 0.539)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号