首页> 外文OA文献 >A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval
【2h】

A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval

机译:伴奏声音的鲁棒性歌唱建模及其在歌手识别和基于音色相似性的音乐信息检索中的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This paper describes a method of modeling the characteristics of a singing voice from polyphonic musical audio signals including sounds of various musical instruments. Because singing voices play an important role in musical pieces with vocals, such representation is useful for music information retrieval systems. The main problem in modeling the characteristics of a singing voice is the negative influences caused by accompaniment sounds. To solve this problem, we developed two methods, accompaniment sound reduction and reliable frame selection . The former makes it possible to calculate feature vectors that represent a spectral envelope of a singing voice after reducing accompaniment sounds. It first extracts the harmonic components of the predominant melody from sound mixtures and then resynthesizes the melody by using a sinusoidal model driven by these components. The latter method then estimates the reliability of frame of the obtained melody (i.e., the influence of accompaniment sound) by using two Gaussian mixture models (GMMs) for vocal and nonvocal frames to select the reliable vocal portions of musical pieces. Finally, each song is represented by its GMM consisting of the reliable frames. This new representation of the singing voice is demonstrated to improve the performance of an automatic singer identification system and to achieve an MIR system based on vocal timbre similarity.
机译:本文介绍了一种根据包含多种乐器声音的复音音乐音频信号来模拟歌唱声音特征的方法。因为唱歌声在带有人声的音乐作品中起着重要的作用,所以这种表示对于音乐信息检索系统很有用。对歌唱声音的特征进行建模的主要问题是由伴奏声音引起的负面影响。为了解决这个问题,我们开发了两种方法,伴奏声音降低和可靠的帧选择。前者使得在减少伴奏声音之后计算代表唱歌声音的频谱包络的​​特征向量成为可能。它首先从混音中提取主要旋律的谐波成分,然后使用由这些成分驱动的正弦模型重新合成旋律。然后,后一种方法通过使用用于声音和非声音帧的两个高斯混合模型(GMM)选择乐曲的可靠声音部分,来估计所获得的旋律的帧的可靠性(即,伴奏声音的影响)。最后,每首歌曲都由包含可靠帧的GMM表示。歌唱声音的这种新表示方式被证明可以改善自动歌手识别系统的性能,并实现基于人声音色相似性的MIR系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号