Music and vocal separation using multiband modulation based features

机译：使用基于多频带调制的功能进行音乐和人声分离

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The potential use of non-linear speech features has not been investigated for music analysis although other commonly used speech features like Mel Frequency Ceptral Coefficients (MFCC) and pitch have been used extensively. In this paper, we assume an audio signal to be a sum of modulated sinusoidal and then use the energy separation algorithm to decompose the audio into amplitude and frequency modulation components using the non-linear Teager-Kaiser energy operator. We first identify the distribution of these non-linear features for music only and voice only segments in the audio signal in different Mel spaced frequency bands and show that they have the ability to discriminate voice and music from an audio signal. The proposed method is based on Kullback-Leibler divergence measure and is evaluated using a set of Indian classical songs from three different artists. Experimental results show that the discrimination ability is evident in certain low and mid frequency bands (100–1500 Hz).

机译：非线性语音特征的潜在用途尚未用于音乐分析，尽管已广泛使用了其他常用的语音特征，例如梅尔频率中心系数（MFCC）和音高。在本文中，我们假设音频信号是调制正弦波的总和，然后使用能量分离算法使用非线性Teager-Kaiser能量算子将音频分解为振幅和频率调制分量。我们首先确定这些非线性特征在音频信号中不同Mel间隔频带中仅音乐和仅语音段的分布，并表明它们具有区分音频信号中的语音和音乐的能力。所提出的方法基于Kullback-Leibler发散度度量，并使用一组来自三位不同艺术家的印度古典歌曲进行评估。实验结果表明，在某些低频段和中频段（100-1500 Hz）中，辨别能力很明显。

著录项

来源
《2010 IEEE Symposium on Industrial Electronics Applications》|2010年|p.733-737|共5页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类基础理论;
关键词
Music Voice Separation; Music discrimination; modulation features;

机译：音乐声音分离;音乐辨别;调制特征;

相似文献

外文文献
中文文献
专利

1. Monaural Musical Sound Separation Based on Pitch and Common Amplitude Modulation [J] . Li Y., Woodruff J., Wang D. Audio, Speech, and Language Processing, IEEE Transactions on . 2009,第7期

机译：基于音高和共模调制的单声道音乐声音分离
2. Automatic Music Mood Classification Based on Timbre and Modulation Features [J] . Ren Jia-Min, Wu Ming-Ju, Jang Jyh-Shing Roger Affective Computing, IEEE Transactions on . 2015,第3期

机译：基于音色和调制特征的自动音乐情绪分类
3. Automatic Music Genre Classification Based on Modulation Spectral Analysis of Spectral and Cepstral Features [J] . Lee C.-H., Shih J.-L., Yu K.-M., Multimedia, IEEE Transactions on . 2009,第4期

机译：基于频谱特征和倒频谱特征的调制频谱分析的音乐流派自动分类
4. Music and vocal separation using multiband modulation based features [C] . {missing} IEEE Symposium on Industrial Electronics and Applications . 2010

机译：基于多频带调制功能的音乐和声音分离
5. Exploitation of Phase and Vocal Excitation Modulation Features for Robust Speaker Recognition [D] . Wang, Ning. 2011

机译：利用相位和人声激励调制功能实现可靠的说话人识别
6. Attentional spreading to task-irrelevant object features: experimental support and a 3-step model of attention for object-based selection and feature-based processing modulation [O] . Detlef Wegener, Fingal Orlando Galashan, Maike Kathrin Aurich, 2014

机译：注意力分散到与任务无关的对象特征：基于对象的选择和基于特征的处理调制的实验支持和三步注意模型
7. Music and Vocal Separation Using Multi-Band Modulation Based Features [O] . Meghna P, G Sita 2016

机译：基于多波段调制特征的音乐与声乐分离

Music and vocal separation using multiband modulation based features

摘要

著录项

相似文献

相关主题

期刊订阅