首页> 外文会议>AES international conference >Singing Voice Separation from Stereo Recordings using Spatial Clues and Robust FO Estimation

【24h】

Singing Voice Separation from Stereo Recordings using Spatial Clues and Robust FO Estimation

机译：使用空间线索和稳健的FO估计从立体声录音中分离声音

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Separation of singing voice from music accompaniment is a topic of great utility in many application of Music Information Retrieval. In the context of stereophonic music mixtures, many algorithms face this problem making use of the spatial diversity of the sound sources to localize and isolate the singing voice. Although these spatial approaches can obtain acceptable results, the separated signal usually is affected by a high level of distortions and artifacts. In this paper, we propose a method for improving the isolation of the singing voice in stereo recordings based on incorporating the fundamental frequency (FO) information to the separation process. First, the singing voice is pre-separated from the input mixture using a state-of-the-art stereo source separation method, the MuLeTs algorithm. Then, the FO of this pre-separated signal is obtained using a robust pitch estimator based on the computation of the difference function and Hidden Markov Models, obtaining a smooth pitch contour with voiced/unvoiced decisions. A binary mask is finally constructed from FO to isolate the singing voice from the original mix. The method has been tested on studio music recordings, obtaining good separation results.

机译：从音乐伴奏中分离歌唱声音是在音乐信息检索的许多应用中非常有用的主题。在立体声音乐混合的情况下，许多算法都利用声源的空间分集来定位和隔离歌声，从而面临这个问题。尽管这些空间方法可以获得令人满意的结果，但是分离的信号通常会受到高水平的失真和伪影的影响。在本文中，我们提出了一种方法，该方法将基本频率（FO）信息纳入分离过程，从而改善了立体声录音中歌声的隔离度。首先，使用最先进的立体声源分离方法（MuLeTs算法）将歌声从输入混音中预先分离出来。然后，基于差函数的计算和隐马尔可夫模型，使用鲁棒的音高估计器获得该预分离信号的FO，从而获得具有浊音/清音决策的平滑音高轮廓。最终，由FO构建了一个二进制蒙版，以将演唱声音与原始混音隔离开来。该方法已经在录音室音乐录音中进行了测试，获得了良好的分离效果。

著录项

来源
《AES international conference 》|2011年|p.239-246|共8页
会议地点
作者
Pablo Cabanas-Molero; Damian Martinez-Munoz; Maximo Cobos; Jose J. Lopez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类声学工程 ;
关键词

相似文献

外文文献
中文文献
专利

1. Separation of Singing Voice From Music Accompaniment for Monaural Recordings [J] . Yipeng Li, DeLiang Wang IEEE transactions on audio, speech and language processing . 2007 ,第4期

机译：单声道录音中歌声与音乐伴奏的分离
2. Combining F0 and non-negative constraint robust principal component analysis for singing voice separation [J] . Signal processing . 2020 ,第May期

机译：结合F0和非负约束鲁棒主成分分析进行歌声分离
3. Blind monaural singing voice separation using rank-1 constraint robust principal component analysis and vocal activity detection [J] . Li Feng, Akagi Masato Neurocomputing . 2019 ,第JULa20期

机译：基于秩1约束鲁棒主成分分析和声音活动检测的盲单声道歌声分离
4. Singing Voice Separation from Stereo Recordings using Spatial Clues and Robust F0 Estimation [C] . Pablo Cabanas-Molero, Damian Martinez-Munoz, Maximo Cobos, Audio Engineering Society International Conference . 2011

机译：使用空间线索和强大的F0估算从立体声录制中唱歌语音分离
5. Characterization of the singing voice from polyphonic recordings. [D] . Smit, Christine. 2011

机译：复音录音中歌声的特征。
6. Automatic speech and singing classification in ambulatory recordings for normal and disordered voices [O] . Andrew J. Ortiz, Laura E. Toles, Katherine L. Marks, -1

机译：自动录制语音和唱歌分类以记录正常和无序的声音
7. Singing-voice separation from monaural recordings using robust principal component analysis [O] . Po-sen Huang, Scott Deeann Chen, Paris Smaragdis, 2012

机译：使用健壮的主成分分析从单声道录音中分离语音

Singing Voice Separation from Stereo Recordings using Spatial Clues and Robust FO Estimation

摘要

著录项

相似文献

相关主题

期刊订阅