Robust query-by-singing/humming system against background noise environments

Kichul Kim; Kang Ryoung Park; Sung-Joo Park; Soek-Pil Lee; Moo Young Kim

首页> 外文期刊>Consumer Electronics, IEEE Transactions on >Robust query-by-singing/humming system against background noise environments

【24h】

Robust query-by-singing/humming system against background noise environments

机译：针对背景噪声环境的强大的按歌/哼唱查询系统

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Under background noise environments, the performance of the Query-by-Singing/Humming (QbSH) system is considerably degraded. Since human pitch information is used as a feature vector for the QbSH system, a noise robust pitchestimation algorithm is inevitable. Thus, a novel pitch-estimation method is proposed by integrating temporal-autocorrelation and spectral-salience methods. As a pre-processing block, spectral smoothing is applied to enhance the stationarity of the noisy input signal. To calculate the similarity between the MIDI database and input humming signal, the dynamic time warping (DTW) algorithm is used. JangÃÂ¿s corpus and AURORA2 database are selected as humming and background noise signals, respectively. Compared with the standard pitch estimation algorithm in the ITU-T G.729 speech codec, the proposed pitch estimation method improves the average accuracy by 11.7% for the 0 dB signal-to-noise ratio (SNR) noise case. It also improves top-20 ratio and mean reciprocal rank (MRR) of the proposed QbSH system, on average, by 7.4% and 0.13, respectively.

机译：在背景噪声环境下，按唱歌/哼唱查询（QbSH）系统的性能会大大降低。由于人体音调信息被用作QbSH系统的特征向量，因此不可避免地要采用噪声鲁棒的音调估计算法。因此，通过结合时间自相关和频谱显着性方法，提出了一种新颖的基音估计方法。作为预处理模块，应用频谱平滑来增强噪声输入信号的平稳性。为了计算MIDI数据库和输入的嗡嗡声之间的相似度，使用了动态时间规整（DTW）算法。 Jang的语料库和AURORA2数据库分别被选作嗡嗡声和背景噪声信号。与ITU-T G.729语音编解码器中的标准音高估计算法相比，对于0 dB信噪比（SNR）噪声情况，所提出的音高估计方法将平均准确度提高了11.7％。它还使拟议的QbSH系统的前20名比率和平均倒数排名（MRR）平均分别提高7.4％和0.13。

著录项

来源
《Consumer Electronics, IEEE Transactions on 》 |2011年第2期| p.720-725| 共6页
作者
Kichul Kim; Kang Ryoung Park; Sung-Joo Park; Soek-Pil Lee; Moo Young Kim;
展开▼
作者单位

Human Computer Interaction Laboratory, Department of Information and Communication Engineering, Sejong University, Seoul, Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Query-by-Singing/Humming; background noise; dynamic time warping; pitch estimation;

机译：哼唱查询;背景噪声;动态时间扭曲;音高估计;

相似文献

外文文献
中文文献
专利

1. An FFT-based fast melody comparison method for query-by-singing/humming systems [J] . Wei-Ho Tsai, Yu-Ming Tu, Cin-Hao Ma Pattern recognition letters . 2012 ,第16期

机译：一种基于FFT的单音/单音查询系统的快速旋律比较方法
2. Fast Query-by-Singing/Humming System That Combines Linear Scaling and Quantized Dynamic Time Warping Algorithm [J] . Gi PyoNam, Kang RyoungPark International Journal of Distributed Sensor Networks . 2015 ,第3期

机译：线性定标和量化动态时间规整算法相结合的快速按唱歌/哼唱查询系统
3. Multi-Classifier Based on a Query-by-Singing/Humming System [J] . Gi Pyo Nam, Kang Ryoung Park, Sergei Odintsov Symmetry . 2015 ,第2期

机译：基于唱歌/哼声查询的多分类器
4. Implementation of a practical query-by-singing/humming (QbSH) system and its commercial applications [C] . Song Chai-Jong, Park Hochong, Yang Chang-Mo, IEEE International Conference on Consumer Electronics . 2013

机译：实用的单音哼唱查询系统的实现及其商业应用
5. Robust background subtraction for moving cameras and their applications in ego-vision systems. [D] . Sajid, Hasan. 2016

机译：用于移动摄像机及其在自我视觉系统中的应用的稳健背景扣除。
6. LTD windows of the STDP learning rule and synaptic connections having a large transmission delay enable robust sequence learning amid background noise [O] . Hatsuo Hayashi, Jun Igarashi 2009

机译：STDP学习规则的LTD窗口和具有较大传输延迟的突触连接可在背景噪声中实现强大的序列学习
7. Multi-Classifier Based on a Query-by-Singing/Humming System [O] . Gi Pyo Nam, Kang Ryoung Park 2015

机译：基于逐个查询/哼唱系统的多分类器
8. Cancellation Technique for Reducing Background Noise Within Turbulent Flow Environments Characterized by Pipes and Annuli [R] . Horne, M. P., Hendricks, E. W., Handler, R. A. 1988

机译：在管道和环形特征的湍流环境中降低背景噪声的消除技术

Robust query-by-singing/humming system against background noise environments

摘要

著录项

相似文献

相关主题

期刊订阅