首页> 外文会议>IEEE Automatic Speech Recognition and Understanding Workshop >Towards Controlling False Alarm - Miss Trade-Off in Perceptual Speaker Comparison via Non-Neutral Listening Task Framing
【24h】

Towards Controlling False Alarm - Miss Trade-Off in Perceptual Speaker Comparison via Non-Neutral Listening Task Framing

机译:走向控制虚警-通过非中性听力任务框架在演讲者比较中的权衡取舍

获取原文

摘要

Speaker comparison by listening is a valuable resource, for instance, in human voice discrimination studies, and voice conversion (VC) systems evaluations. Usually, listeners are provided with application-neutral guidelines that encourage retaining overall high speaker discrimination accuracy. Nonetheless, listeners are subject to misses (declaring same-speaker trial as different-speaker) and false alarms (vice versa) with possibly non-symmetric outcomes. In automatic speaker verification (ASV) applications, the consequences of a miss and a false alarm are rarely equal, and decision making policy is adjusted towards a given application with a desired miss/false alarm trade-off. We study whether listener decisions could similarly be controlled to provoke more accept (or reject) decisions, by framing the voice comparison task in different ways. Our neutral, forensic, user-convenient bank and secure bank scenarios are played by disjoint panels (through Amazon's Mechanical Turk), all judging the same speaker trials originated from RedDots and 2018 Voice Conversion Challenge (VCC 2018) data. Our results indicate that listener decisions can be influenced by modifying the task framing. As a subjective task, the challenge is how to drive the panel decisions to the desired direction (to reduce miss or false alarm rate). Our preliminary results suggest potential for novel, application-directed speaker discrimination designs.
机译:例如,在人类语音识别研究和语音转换(VC)系统评估中,通过收听说话者进行比较是一种宝贵的资源。通常,为听众提供与应用程序无关的指南,这些指南鼓励保持总体上较高的说话者辨别准确性。但是,听众会遭受失误(将同一讲话者的试用声明为不同讲话者)和错误警报(反之亦然),结果可能不对称。在自动扬声器验证(ASV)应用程序中,未命中和错误警报的后果很少相等,并且针对给定的应用程序调整了决策策略,并具有所需的未命中/错误警报权衡。我们研究通过以不同方式构建语音比较任务,是否可以类似地控制听众的决策以激发更多的接受(或拒绝)决策。我们的中立,取证,用户方便的银行和安全银行场景是由不相关的面板(通过Amazon的Mechanical Turk)播放的,所有这些都基于来自RedDots和2018语音转换挑战(VCC 2018)数据的同一扬声器测试。我们的结果表明,可以通过修改任务框架来影响侦听器的决策。作为一项主观任务,面临的挑战是如何将专家小组的决定推向期望的方向(以减少未命中率或误报率)。我们的初步结果表明,有可能进行新颖的,针对应用的说话人辨别设计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号