Postprocessing Synthetic Speech With a Complex Cepstrum Vocoder for Spoofing Phase-Based Synthetic Speech Detectors

Cenk Demiroglu; Osman Buyuk; Ali Khodabakhsh; Ranniery Maia

首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Postprocessing Synthetic Speech With a Complex Cepstrum Vocoder for Spoofing Phase-Based Synthetic Speech Detectors

【24h】

Postprocessing Synthetic Speech With a Complex Cepstrum Vocoder for Spoofing Phase-Based Synthetic Speech Detectors

机译：使用复杂倒谱声码器对合成语音进行后处理，以欺骗基于相位的合成语音检测器

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

State-of-the-art speaker verification systems are vulnerable to spoofing attacks. To address the issue, high-performance synthetic speech detectors (SSDs) for existing spoofing methods have been proposed. Phase-based SSDs that exploit the fact that most of the parametric speech coders use minimum-phase filters are particularly successful when synthetic speech is generated with a parametric vocoder. Here, we propose a new attack strategy to spoof phase-based SSDs with the objective of increasing the security of voice verification systems by enabling the development of more generalized SSDs. As opposed to other parametric vocoders, the complex cepstrum approach uses mixed-phase filters, which makes it an ideal candidate for spoofing the phase-based SSDs. We propose using a complex cepstrum vocoder as a postprocessor to existing techniques to spoof the speaker verification system as well as the phase-based SSDs. Once synthetic speech is generated with a speech synthesis or a voice conversion technique, for each synthetic speech frame, a natural frame is selected from a training database using a spectral distance measure. Then, complex cepstrum parameters of the natural frame are used for resynthesizing the synthetic frame. In the proposed method, complex cepstrum-based resynthesis is used as a postprocessor. Hence, it can be used in tandem with any synthetic speech generator. Experimental results showed that the approach is successful at spoofing four phase-based SSDs across nine parametric attack algorithms. Moreover, performance at spoofing the speaker verification system did not substantially degrade compared to the case when no postprocessor is employed.

机译：最先进的扬声器验证系统容易受到欺骗攻击。为了解决该问题，已经提出了用于现有欺骗方法的高性能合成语音检测器（SSD）。当使用参数声码器生成合成语音时，利用大多数参数语音编码器使用最小相位滤波器这一事实的基于相位的SSD特别成功。在这里，我们提出一种新的攻击策略来欺骗基于阶段的SSD，以通过开发更通用的SSD来提高语音验证系统的安全性为目标。与其他参量声码器相反，复杂倒谱方法使用混合相位滤波器，这使其成为欺骗基于相位的SSD的理想选择。我们建议使用复杂的倒谱声码器作为现有技术的后处理器，以欺骗说话者验证系统以及基于相位的SSD。一旦使用语音合成或语音转换技术生成了合成语音，对于每个合成语音帧，便使用频谱距离度量从训练数据库中选择自然帧。然后，使用自然帧的复杂倒谱参数来重新合成合成帧。在提出的方法中，基于复杂倒谱的再合成被用作后处理器。因此，它可以与任何合成语音生成器一起使用。实验结果表明，该方法成功欺骗了九个参数攻击算法中的四个基于相位的SSD。而且，与不采用后处理器的情况相比，欺骗说话者验证系统的性能没有实质性降低。

著录项

来源
《Selected Topics in Signal Processing, IEEE Journal of》 |2017年第4期|671-683|共13页
作者
Cenk Demiroglu; Osman Buyuk; Ali Khodabakhsh; Ranniery Maia;
展开▼
作者单位

Department of Electrical and Computer Engineering, Özyeğin University, Istanbul, Turkey;

Department of Electronics and Telecommunications Engineering, Kocaeli University, Kocaeli, Turkey;

Department of Electrical and Computer Engineering, Özyeğin University, Istanbul, Turkey;

Cambridge Research Laboratory, Toshiba Research Europe Limited, Cambridge, U.K.;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech; Cepstrum; Vocoders; Feature extraction; Speech synthesis; Signal processing algorithms; Detectors;

机译：语音;倒谱;声码器;特征提取;语音合成;信号处理算法;检测器;
入库时间 2022-08-18 01:16:26

相似文献

外文文献
中文文献
专利

1. Discrimination Method of Synthetic Speech Using Pitch Frequency against Synthetic Speech Falsification [J] . Akio OGIHARA, Hitoshi UNNO, Akira SHIOZAKI IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2005,第1期

机译：基于基音频率的合成语音识别方法
2. A cepstrum-based preprocessing and postprocessing for speech enhancement in adverse environments [J] . Xiaohu Hu, Shiwei Wang, Chengshi Zheng, Applied Acoustics . 2013,第12期

机译：基于倒谱的预处理和后处理，可在不利的环境中增强语音
3. Auditory event-related potentials index faster processing of natural speech but not synthetic speech over nonspeech analogs in children [J] . Whitten Allison, Key Alexandra P., Mefferd Antje S., Brain and language . 2020,第1期

机译：检测事件相关的潜力指数更快地处理自然语音，而不是儿童非宾诵类似物的合成讲话
4. Voice Spoofing Countermeasure for Synthetic Speech Detection [C] . Farman Hassan, Ali Javed International Conference on Artificial Intelligence . 2021

机译：合成语音检测语音欺骗对策
5. Testing the effect of training with synthetic speech on task performance with a mixed-human-and-synthetic-speech interface. [D] . Arrigucci, Annette Christine. 2006

机译：使用人与混合语音混合界面测试合成语音训练对任务执行的效果。
6. Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems [O] . BETH G. GREENE, JOHN S. LOGAN, DAVID B. PISONI -1

机译：规则自动产生的合成语音的感知：八个文本到语音系统的可理解性
7. Spoofing Detection Goes Noisy: An Analysis of Synthetic Speech Detection in the Presence of Additive Noise [O] . Hanilci, Cemal, Kinnunen, Tomi, Sahidullah, Md, 2016

机译：欺骗检测嘈杂：综合语音检测分析在存在附加噪声的情况下
8. Effects of Voice Coding and Speech Rate on a Synthetic Speech Display in a Telephone Information System [R] . Herlong, D. W. 1988

机译：语音编码和语音速率对电话信息系统中合成语音显示的影响

Postprocessing Synthetic Speech With a Complex Cepstrum Vocoder for Spoofing Phase-Based Synthetic Speech Detectors

摘要

著录项

相似文献

相关主题

期刊订阅