Playing a Part: Speaker Verification at the movies

机译：扮演部分：在电影中发言人验证

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The goal of this work is to investigate the performance of popular speaker recognition models on speech segments from movies, where often actors intentionally disguise their voice to play a character. We make the following three contributions: (i) We collect a novel, challenging speaker recognition dataset called VoxMovies, with speech for 856 identities from almost 4000 movie clips. VoxMovies contains utterances with varying emotion, accents and background noise, and therefore comprises an entirely different domain to the interview-style, emotionally calm utterances in current speaker recognition datasets such as VoxCeleb; (ii) We provide a number of domain adaptation evaluation sets, and benchmark the performance of state-of-the-art speaker recognition models on these evaluation pairs. We demonstrate that both speaker verification and identification performance drops steeply on this new data, showing the challenge in transferring models across domains; and finally (iii) We show that simple domain adaptation paradigms improve performance, but there is still large room for improvement.

机译：这项工作的目标是调查流行扬声器识别模型在电影中的语音段中的表现，在那里通常演员故意伪装他们的声音扮演一个角色。我们提出以下三个贡献：（i）我们收集一个小说，挑战的扬声器识别数据集，称为VoxMovies，具有来自近4000个电影剪辑的856个身份的演讲。 VoxMovies含有不同情绪，口音和背景噪声的话语，因此包括一个完全不同的域，在voxceleb等当前扬声器识别数据集中的情感平静的话语; （ii）我们提供了许多域适应评估集，并在这些评估对上基准测试最先进的扬声器识别模型。我们展示了扬声器验证和识别性能急剧下降在这一新数据上，呈现出跨领域转移模型的挑战;最后（iii）我们展示了简单的域适应范例提高了性能，但仍然有大的改进空间。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|6174-6178|共5页
会议地点
作者
Andrew Brown; Jaesung Huh; Arsha Nagrani; Joon Son Chung; Andrew Zisserman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Adaptation models; Conferences; Speech recognition; Signal processing; Motion pictures; Data models; Speaker recognition;

机译：适应模型;会议;语音识别;信号处理;电影;数据模型;扬声器识别;

相似文献

外文文献
中文文献
专利

1. A New Replay Attack Against Automatic Speaker Verification Systems [J] . Yoon Sung-Hyun, Koh Min-Sung, Park Jae-Han, Quality Control, Transactions . 2020,第期

机译：对自动扬声器验证系统进行新的重播攻击
2. Deep generative variational autoencoding for replay spoof detection in automatic speaker verification [J] . Bhusan Chettri, Tomi Kinnunen, Emmanouil Benetos Computer speech and language . 2020,第Sepa期

机译：自动扬声器验证中重放欺骗检测的深度生成变分自动化
3. On the study of replay and voice conversion attacks to text-dependent speaker verification [J] . Wu Zhizheng, Li Haizhou Multimedia Tools and Applications . 2016,第9期

机译：关于重播和语音转换攻击对依赖文本的说话人验证的研究
4. RedDots replayed: A new replay spoofing attack corpus for text-dependent speaker verification research [C] . Tomi Kinnunen, Md Sahidullah, Mauro Falcone, IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：重播了RedDots：一种新的重播欺骗攻击语料库，用于依赖文本的说话者验证研究
5. Title of Theory Paper: Crossover Teen Drama Movies: The Changing Characterization of Teenage Girls in Contemporary American Movies Title of Production: Inside-out screenplay. [D] . Shepeleva, Ekaterina. 2015

机译：理论论文的标题：跨界青少年戏剧电影：当代美国电影中少女的变化特征制作名称：由内而外的剧本。
6. Short-time speaker verification with different speaking style utterances [O] . Hongwei Mao, Yan Shi, Yue Liu, 2020

机译：短时间发言者验证不同的说话风格的话语
7. RedDots Replayed:A New Replay Spoofing Attack Corpus for Text-dependent Speaker Verification Research [O] . Kinnunen, Tomi, Sahidullah, Md, Falcone, Mauro, 2017

机译：RedDots重播：一个新的重播欺骗攻击语料库，用于文本相关的说话人验证研究
8. Tests Results Advanced Development Models of BISS Identity Verification Equipment. Volume II. Automatic Speaker Verification. [R] . foodman,martin j. 1978

机译：测试结果BIss身份验证设备的高级开发模型。第二卷。自动扬声器验证。

Playing a Part: Speaker Verification at the movies

摘要

著录项

相似文献

相关主题

期刊订阅