首页> 外文会议>IEEE Automatic Speech Recognition and Understanding Workshop >Towards Controlling False Alarm - Miss Trade-Off in Perceptual Speaker Comparison via Non-Neutral Listening Task Framing

【24h】

Towards Controlling False Alarm - Miss Trade-Off in Perceptual Speaker Comparison via Non-Neutral Listening Task Framing

机译：走向控制虚警-通过非中性听力任务框架在演讲者比较中的权衡取舍

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker comparison by listening is a valuable resource, for instance, in human voice discrimination studies, and voice conversion (VC) systems evaluations. Usually, listeners are provided with application-neutral guidelines that encourage retaining overall high speaker discrimination accuracy. Nonetheless, listeners are subject to misses (declaring same-speaker trial as different-speaker) and false alarms (vice versa) with possibly non-symmetric outcomes. In automatic speaker verification (ASV) applications, the consequences of a miss and a false alarm are rarely equal, and decision making policy is adjusted towards a given application with a desired miss/false alarm trade-off. We study whether listener decisions could similarly be controlled to provoke more accept (or reject) decisions, by framing the voice comparison task in different ways. Our neutral, forensic, user-convenient bank and secure bank scenarios are played by disjoint panels (through Amazon's Mechanical Turk), all judging the same speaker trials originated from RedDots and 2018 Voice Conversion Challenge (VCC 2018) data. Our results indicate that listener decisions can be influenced by modifying the task framing. As a subjective task, the challenge is how to drive the panel decisions to the desired direction (to reduce miss or false alarm rate). Our preliminary results suggest potential for novel, application-directed speaker discrimination designs.

机译：例如，在人类语音识别研究和语音转换（VC）系统评估中，通过收听说话者进行比较是一种宝贵的资源。通常，为听众提供与应用程序无关的指南，这些指南鼓励保持总体上较高的说话者辨别准确性。但是，听众会遭受失误（将同一讲话者的试用声明为不同讲话者）和错误警报（反之亦然），结果可能不对称。在自动扬声器验证（ASV）应用程序中，未命中和错误警报的后果很少相等，并且针对给定的应用程序调整了决策策略，并具有所需的未命中/错误警报权衡。我们研究通过以不同方式构建语音比较任务，是否可以类似地控制听众的决策以激发更多的接受（或拒绝）决策。我们的中立，取证，用户方便的银行和安全银行场景是由不相关的面板（通过Amazon的Mechanical Turk）播放的，所有这些都基于来自RedDots和2018语音转换挑战（VCC 2018）数据的同一扬声器测试。我们的结果表明，可以通过修改任务框架来影响侦听器的决策。作为一项主观任务，面临的挑战是如何将专家小组的决定推向期望的方向（以减少未命中率或误报率）。我们的初步结果表明，有可能进行新颖的，针对应用的说话人辨别设计。

著录项

来源
《IEEE Automatic Speech Recognition and Understanding Workshop 》|2019年|749-756|共8页
会议地点
作者
Rosa González Hautamäki; Tomi H. Kinnunen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Task analysis; Forensics; Decision making; Speaker recognition; Cost function; NIST; Guidelines;

机译：任务分析;取证;决策;说话人识别;成本函数; NIST;指南;

相似文献

外文文献
中文文献
专利

1. Listening to different speakers: On the time-course of perceptual compensation for vocal-tract characteristics [J] . SjerpsM.J., MittererH., McQueenJ.M. Neuropsychologia . 2011 ,第14期

机译：聆听不同的说话者：关于声道特征的知觉补偿的时程
2. Voice biometrics security: Extrapolating false alarm rate via hierarchical Bayesian modeling of speaker verification scores [J] . Alexey Sholokhov, Tomi Kinnunen, Ville Vestman, Computer speech and language . 2020 ,第Mara期

机译：语音生物识别技术的安全性：通过说话人验证分数的分级贝叶斯建模推断错误警报率
3. Effects of linguistic contents on perceptual speaker identification:Comparison of familiar and unknown speaker identifications~1 [J] . Kanae Amino, Takayuki Arai Acoustical science and technology . 2009 ,第2期

机译：语言内容对言语识别的影响：比较熟悉和未知的言语识别〜1
4. Towards Controlling False Alarm - Miss Trade-Off in Perceptual Speaker Comparison via Non-Neutral Listening Task Framing [C] . Rosa González Hautam?ki, Tomi H. Kinnunen IEEE Automatic Speech Recognition and Understanding Workshop . 2019

机译：在控制虚假警报 - 通过非中立聆听任务框架进行感知扬声器比较的权衡
5. An interpretive frame model of memory: Effects of social identity activation on recognition, recall, and the false alarms effect. [D] . Mercurio, Kathryn Ruth. 2011

机译：记忆的解释框架模型：社会身份激活对识别，回忆和虚假警报的影响。
6. Functional Difference between Sustained and Transient Modulations of Cognitive Control in the Simon Task: Evidence from False Alarm Responses on No-Go Trials [O] . Kunihiro Hasegawa, Shin’ya Takahashi -1

机译：Simon任务中认知控制的持续调制和暂时调制之间的功能差异：来自对不进行试验的错误警报响应的证据
7. Functional difference between sustained and transient modulations of cognitive control in the simon task: evidence from false alarm responses on no-go trials. [O] . Kunihiro Hasegawa, Shin'ya Takahashi 2013

机译：simon任务中认知控制的持续和瞬时调制之间的功能差异：来自禁止试验的虚警响应的证据。
8. Performance Comparison of Cell Averaging and 'Greatest-of' Constant False Alarm Rate (CFAR) Methods [R] . Lawrence, N. B. 1981

机译：细胞平均和'最大'恒定误报率（CFaR）方法的性能比较

Towards Controlling False Alarm - Miss Trade-Off in Perceptual Speaker Comparison via Non-Neutral Listening Task Framing

摘要

著录项

相似文献

相关主题

期刊订阅