...
首页> 外文期刊>Consumer Electronics, IEEE Transactions on >Robust query-by-singing/humming system against background noise environments
【24h】

Robust query-by-singing/humming system against background noise environments

机译:针对背景噪声环境的强大的按歌/哼唱查询系统

获取原文
获取原文并翻译 | 示例

摘要

Under background noise environments, the performance of the Query-by-Singing/Humming (QbSH) system is considerably degraded. Since human pitch information is used as a feature vector for the QbSH system, a noise robust pitchestimation algorithm is inevitable. Thus, a novel pitch-estimation method is proposed by integrating temporal-autocorrelation and spectral-salience methods. As a pre-processing block, spectral smoothing is applied to enhance the stationarity of the noisy input signal. To calculate the similarity between the MIDI database and input humming signal, the dynamic time warping (DTW) algorithm is used. Jang¿s corpus and AURORA2 database are selected as humming and background noise signals, respectively. Compared with the standard pitch estimation algorithm in the ITU-T G.729 speech codec, the proposed pitch estimation method improves the average accuracy by 11.7% for the 0 dB signal-to-noise ratio (SNR) noise case. It also improves top-20 ratio and mean reciprocal rank (MRR) of the proposed QbSH system, on average, by 7.4% and 0.13, respectively.
机译:在背景噪声环境下,按唱歌/哼唱查询(QbSH)系统的性能会大大降低。由于人体音调信息被用作QbSH系统的特征向量,因此不可避免地要采用噪声鲁棒的音调估计算法。因此,通过结合时间自相关和频谱显着性方法,提出了一种新颖的基音估计方法。作为预处理模块,应用频谱平滑来增强噪声输入信号的平稳性。为了计算MIDI数据库和输入的嗡嗡声之间的相似度,使用了动态时间规整(DTW)算法。 Jang的语料库和AURORA2数据库分别被选作嗡嗡声和背景噪声信号。与ITU-T G.729语音编解码器中的标准音高估计算法相比,对于0 dB信噪比(SNR)噪声情况,所提出的音高估计方法将平均准确度提高了11.7%。它还使拟议的QbSH系统的前20名比率和平均倒数排名(MRR)平均分别提高7.4%和0.13。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号