Speech and music pitch trajectory classification using recurrent neural networks for monaural speech segregation

Kim Han-Gyu; Jang Gil-Jin; Oh Yung-Hwan; Choi Ho-Jin

首页> 外文期刊>Journal of supercomputing >Speech and music pitch trajectory classification using recurrent neural networks for monaural speech segregation

【24h】

Speech and music pitch trajectory classification using recurrent neural networks for monaural speech segregation

机译：使用反复性神经网络进行语音和音乐音调轨迹分类，用于单一语音隔离

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we propose speech/music pitch classification based on recurrent neural network (RNN) for monaural speech segregation from music interferences. The speech segregation methods in this paper exploit sub-band masking to construct segregation masks modulated by the estimated speech pitch. However, for speech signals mixed with music, speech pitch estimation becomes unreliable, as speech and music have similar harmonic structures. In order to remove the music interference effectively, we propose an RNN-based speech/music pitch classification. Our proposed method models the temporal trajectories of speech and music pitch values and determines an unknown continuous pitch sequence as belonging to either speech or music. Among various types of RNNs, we chose simple recurrent network, long short-term memory (LSTM), and bidirectional LSTM for pitch classification. The experimental results show that our proposed method significantly outperforms the baseline methods for speech-music mixtures without loss of segregation performance for speech-noise mixtures.

机译：在本文中，我们提出了基于经常性神经网络（RNN）的语音/音乐间距分类，用于从音乐干扰的单一语音隔离。本文中的语音分离方法利用子带掩模来构建由估计的语音间距调制的分离掩模。然而，对于与音乐混合的语音信号，语音间距估计变得不可靠，因为语音和音乐具有相似的谐波结构。为了有效地去除音乐干扰，我们提出了基于RNN的语音/音乐间距分类。我们所提出的方法模拟语音和音乐间距值的时间轨迹，并确定属于语音或音乐的未知连续间距序列。在各种类型的RNN中，我们选择简单的复发网络，长短期存储器（LSTM）和Bidirectional LSTM进行间距分类。实验结果表明，我们提出的方法显着优于语音音乐混合物的基线方法，而不会损失语音噪声混合物的分离性能。

著录项

来源
《Journal of supercomputing》 |2020年第10期|8193-8213|共21页
作者
Kim Han-Gyu; Jang Gil-Jin; Oh Yung-Hwan; Choi Ho-Jin;
展开▼
作者单位

Naver Corp Clova Speech Seongnam Si Gyeonggi Do South Korea;

Kyungpook Natl Univ Sch Elect Engn Daegu South Korea;

Korea Adv Inst Sci & Technol Sch Comp Daejeon South Korea;

Korea Adv Inst Sci & Technol Sch Comp Daejeon South Korea;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech segregation; Speech pitch estimation; Pitch classification; Recurrent neural network; Long short-term memory; Bidirectional long short-term memory;

机译：语音分离;语音音调估计;间距分类;复发性神经网络;长短期内存;双向短期内存;

相似文献

外文文献
中文文献
专利

1. Speech Segregation based on Pitch Track Correction and Music-Speech Classification [J] . KIM H.-G., JANG G.-J., PARK J.-S., Advances in Electrical and Computer Engineering . 2012,第2期

机译：基于音高校正和音乐语音分类的语音分离
2. Particle Filtering Based Pitch Sequence Correction for Monaural Speech Segregation [J] . Han-Gyu Kim, Gil-Jin Jang, Jeong-Sik Park, International journal of imaging systems and technology . 2013,第1期

机译：基于粒子滤波的单声道语音分离基音序列校正
3. Pitch-based monaural segregation of reverberant speech [J] . Roman N, Wang DL The Journal of the Acoustical Society of America . 2006,第1期

机译：基于基音的混响语音单声道隔离
4. Monaural Speech Segregation Based on Pitch Track Correction Using An Ensemble Kalman Filter [C] . Han-Gyu Kim, Gil-Jin Jang, Jeong-Sik Park, Conference of the International Speech Communication Association . 2013

机译：基于沥青轨道校正的单声道语音分离，使用Ensemble Kalman滤波器
5. Classification and recognition of speech under perceptual stress using neural networks and N-D HMMs. [D] . Womack, Brian David. 1996

机译：使用神经网络和N-D HMM在感知压力下对语音进行分类和识别。
6. A hybrid technique for speech segregation and classification using a sophisticated deep neural network [O] . Khurram Ashfaq Qazi, Tabassam Nawaz, Zahid Mehmood, -1

机译：使用复杂的深度神经网络进行语音分离和分类的混合技术
7. Speech segregation based on pitch track correction and music-speech classification [O] . Kim, Han-Gyu, Jang, Gil-Jin, Park, Jeong-Sik, 2014

机译：基于音高校正和音乐语音分类的语音分离

Speech and music pitch trajectory classification using recurrent neural networks for monaural speech segregation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅