Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method

Azar Mahmoodzadeh; Hamid Reza Abutalebi; Hamid Soltanian-Zadeh; Hamid Sheikhzadeh

首页> 外文期刊>EURASIP journal on advances in signal processing >Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method

【24h】

Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method

机译：基于新型音高范围估计方法的调制频域单通道语音分离

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Computational Auditory Scene Analysis (CASA) has been the focus in recent literature for speech separation from monaural mixtures. The performance of current CASA systems on voiced speech separation strictly depends on the robustness of the algorithm used for pitch frequency estimation. We propose a new system that estimates pitch (frequency) range of a target utterance and separates voiced portions of target speech. The algorithm, first, estimates the pitch range of target speech in each frame of data in the modulation frequency domain, and then, uses the estimated pitch range for segregating the target speech. The method of pitch range estimation is based on an onset and offset algorithm. Speech separation is performed by filtering the mixture signal with a mask extracted from the modulation spectrogram. A systematic evaluation shows that the proposed system extracts the majority of target speech signal with minimal interference and outperforms previous systems in both pitch extraction and voiced speech separation.

机译：计算听觉场景分析（CASA）一直是单声道混合语音分离的最新文献。当前的CASA系统在有声语音分离方面的性能严格取决于用于音调频率估计的算法的鲁棒性。我们提出了一种新的系统，该系统可估计目标话语的音调（频率）范围并分离目标语音的浊音部分。该算法首先在调制频域中估计每个数据帧中目标语音的音调范围，然后将估计的音调范围用于分离目标语音。音高范围估计的方法基于开始和偏移算法。通过使用从调制频谱图提取的掩码对混合信号进行滤波来执行语音分离。系统评估表明，所提出的系统以最小的干扰提取出大多数目标语音信号，并且在音调提取和浊音分离方面均优于以前的系统。

著录项

来源
《EURASIP journal on advances in signal processing》 |2012年第1期|共页
作者
Azar Mahmoodzadeh; Hamid Reza Abutalebi; Hamid Soltanian-Zadeh; Hamid Sheikhzadeh;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类通信;
关键词
acoustic frequencymodulation frequencyonset and offset algorithmpitch range estimationspeech separation;

机译：声调频率起始和偏移算法音调范围估计语音分离;

相似文献

外文文献
中文文献
专利

1. Impact of phase estimation on single-channel speech separation based on time-frequency masking [J] . Mayer Florian, Williamson Donald S., Mowlaee Pejman, The Journal of the Acoustical Society of America . 2017,第6期

机译：相位估计对基于时频掩蔽的单通道语音分离的影响
2. Single-channel speech separation using empirical mode decomposition and multi pitch information with estimation of number of speakers [J] . M. K. Prasanna Kumar, R. Kumaraswamy International journal of speech technology . 2017,第1期

机译：使用经验模态分解和多音高信息的单通道语音分离，并估计扬声器的数量
3. Source–Filter-Based Single-Channel Speech Separation Using Pitch Information [J] . Stark M., Wohlmayr M., Pernkopf F. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第2期

机译：基于音源信息的基于源滤波器的单通道语音分离
4. Single channel speech separation with a frame-based pitch range estimation method in modulation frequency [C] . Mahmoodzadeh A., Abutalebi H.R., Soltanian-Zadeh H., 2010 5th International Symposium on Telecommunications . 2010

机译：调制频率下基于帧的基音范围估计方法进行单通道语音分离
5. New time-frequency domain pitch estimation methods for speech signals under low levels of SNR. [D] . Shahnaz, Celia. 2009

机译：低信噪比下语音信号的时频域基音估计新方法。
6. Impact of phase estimation on single-channel speech separation based on time-frequency masking [O] . Florian Mayer, Donald S. Williamson, Pejman Mowlaee, -1

机译：基于时频掩蔽的相位估计对单通道语音分离的影响
7. Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method [O] . Azar Mahmoodzadeh, Hamid Reza Abutalebi, Hamid Soltanian-Zadeh, 2012

机译：基于新型音高范围估计方法的调制频域单通道语音分离

Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method

摘要

著录项

相似文献

相关主题

期刊订阅