首页> 外文会议>Conference of the International Speech Communication Association >Spectro-temporal Modulation Based Singing Detection Combined with Pitch- based Grouping for Singing Voice Separation
【24h】

Spectro-temporal Modulation Based Singing Detection Combined with Pitch- based Grouping for Singing Voice Separation

机译:基于光谱 - 时间调制的歌唱检测与基于沥青分组的唱歌语音分离

获取原文

摘要

A spectro-temporal modulation based singing voice detection cascaded with a Viterbi based pitch tracking algorithm is proposed in this paper for singing-voice separation from monaural recordings. To detect the singing voice, the spectro- temporal modulation energy related to voice harmonics is extracted using a spectro-temporal modulation analysis framework developed for the Fourier spectrogram. Separation of singing-voice from background music is conducted using a binary mask to group estimated harmonics of singing voice. The proposed system is evaluated using MIR-1K dataset and is shown outperforming three other binary-mask based systems in the vocal/music separation task.
机译:在本文中提出了一种基于基于Viterbi的音调跟踪算法级联的歌唱语音检测,用于与单声道记录唱歌语音分离。为了检测唱歌语音,使用为傅立叶谱图开发的光谱 - 时间调制分析框架提取与语音谐波相关的光谱调制能量。使用二进制掩码进行背景音乐的唱歌语音的分离,以分组估计歌唱语音的谐波。使用MIR-1K数据集进行评估所提出的系统,并显示在声带/音乐分离任务中的三个其他二进制掩码基于二进制掩码的系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号