首页> 外文会议>AES international conference >Singing Voice Separation from Stereo Recordings using Spatial Clues and Robust FO Estimation
【24h】

Singing Voice Separation from Stereo Recordings using Spatial Clues and Robust FO Estimation

机译:使用空间线索和稳健的FO估计从立体声录音中分离声音

获取原文

摘要

Separation of singing voice from music accompaniment is a topic of great utility in many application of Music Information Retrieval. In the context of stereophonic music mixtures, many algorithms face this problem making use of the spatial diversity of the sound sources to localize and isolate the singing voice. Although these spatial approaches can obtain acceptable results, the separated signal usually is affected by a high level of distortions and artifacts. In this paper, we propose a method for improving the isolation of the singing voice in stereo recordings based on incorporating the fundamental frequency (FO) information to the separation process. First, the singing voice is pre-separated from the input mixture using a state-of-the-art stereo source separation method, the MuLeTs algorithm. Then, the FO of this pre-separated signal is obtained using a robust pitch estimator based on the computation of the difference function and Hidden Markov Models, obtaining a smooth pitch contour with voiced/unvoiced decisions. A binary mask is finally constructed from FO to isolate the singing voice from the original mix. The method has been tested on studio music recordings, obtaining good separation results.
机译:从音乐伴奏中分离歌唱声音是在音乐信息检索的许多应用中非常有用的主题。在立体声音乐混合的情况下,许多算法都利用声源的空间分集来定位和隔离歌声,从而面临这个问题。尽管这些空间方法可以获得令人满意的结果,但是分离的信号通常会受到高水平的失真和伪影的影响。在本文中,我们提出了一种方法,该方法将基本频率(FO)信息纳入分离过程,从而改善了立体声录音中歌声的隔离度。首先,使用最先进的立体声源分离方法(MuLeTs算法)将歌声从输入混音中预先分离出来。然后,基于差函数的计算和隐马尔可夫模型,使用鲁棒的音高估计器获得该预分离信号的FO,从而获得具有浊音/清音决策的平滑音高轮廓。最终,由FO构建了一个二进制蒙版,以将演唱声音与原始混音隔离开来。该方法已经在录音室音乐录音中进行了测试,获得了良好的分离效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号