首页> 外文会议>IEEE Symposium on Industrial Electronics and Applications >Music and vocal separation using multiband modulation based features
【24h】

Music and vocal separation using multiband modulation based features

机译:基于多频带调制功能的音乐和声音分离

获取原文
获取外文期刊封面目录资料

摘要

The potential use of non-linear speech features has not been investigated for music analysis although other commonly used speech features like Mel Frequency Ceptral Coefficients (MFCC) and pitch have been used extensively. In this paper, we assume an audio signal to be a sum of modulated sinusoidal and then use the energy separation algorithm to decompose the audio into amplitude and frequency modulation components using the non-linear Teager-Kaiser energy operator. We first identify the distribution of these non-linear features for music only and voice only segments in the audio signal in different Mel spaced frequency bands and show that they have the ability to discriminate voice and music from an audio signal. The proposed method is based on Kullback-Leibler divergence measure and is evaluated using a set of Indian classical songs from three different artists. Experimental results show that the discrimination ability is evident in certain low and mid frequency bands (100–1500 Hz).
机译:虽然其他常用的语音特征如麦隆频率焦点系数(MFCC)和间距相同,但尚未研究潜在的非线性语音特征的潜在使用进行音乐分析。在本文中,我们假设音频信号是调制正弦的总和,然后使用能量分离算法使用非线性Teager-kaiser能量操作员将音频分解为幅度和频率调制分量。我们首先识别这些非线性特征的分布仅用于音乐,并且在不同熔点间隔频带中的音频信号中的语音仅段,并表明它们具有区分语音和音乐从音频信号辨别语音和音乐。该方法基于Kullback-Leibler分歧测量,并使用来自三个不同艺术家的一组印度古典歌曲进行评估。实验结果表明,在某些低和中频带(100-1500Hz)中辨别能力是显而易见的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号