首页> 外文会议>2010 IEEE Symposium on Industrial Electronics Applications >Music and vocal separation using multiband modulation based features
【24h】

Music and vocal separation using multiband modulation based features

机译:使用基于多频带调制的功能进行音乐和人声分离

获取原文

摘要

The potential use of non-linear speech features has not been investigated for music analysis although other commonly used speech features like Mel Frequency Ceptral Coefficients (MFCC) and pitch have been used extensively. In this paper, we assume an audio signal to be a sum of modulated sinusoidal and then use the energy separation algorithm to decompose the audio into amplitude and frequency modulation components using the non-linear Teager-Kaiser energy operator. We first identify the distribution of these non-linear features for music only and voice only segments in the audio signal in different Mel spaced frequency bands and show that they have the ability to discriminate voice and music from an audio signal. The proposed method is based on Kullback-Leibler divergence measure and is evaluated using a set of Indian classical songs from three different artists. Experimental results show that the discrimination ability is evident in certain low and mid frequency bands (100–1500 Hz).
机译:非线性语音特征的潜在用途尚未用于音乐分析,尽管已广泛使用了其他常用的语音特征,例如梅尔频率中心系数(MFCC)和音高。在本文中,我们假设音频信号是调制正弦波的总和,然后使用能量分离算法使用非线性Teager-Kaiser能量算子将音频分解为振幅和频率调制分量。我们首先确定这些非线性特征在音频信号中不同Mel间隔频带中仅音乐和仅语音段的分布,并表明它们具有区分音频信号中的语音和音乐的能力。所提出的方法基于Kullback-Leibler发散度度量,并使用一组来自三位不同艺术家的印度古典歌曲进行评估。实验结果表明,在某些低频段和中频段(100-1500 Hz)中,辨别能力很明显。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号