【24h】

Retrieving sounds by vocal imitation recognition

机译:通过声乐模仿识别检索声音

获取原文

摘要

Vocal imitation is widely used in human communication. In this paper, we propose an approach to automatically recognize the concept of a vocal imitation, and then retrieve sounds of this concept. Because different acoustic aspects (e.g., pitch, loudness, timbre) are emphasized in imitating different sounds, a key challenge in vocal imitation recognition is to extract appropriate features. Hand-crafted features may not work well for a large variety of imitations. Instead, we use a stacked auto-encoder to automatically learn features from a set of vocal imitations in an unsupervised way. Then, a multi-class SVM is trained for sound concepts of interest using their training imitations. Given a new vocal imitation of a sound concept of interest, our system can recognize its underlying concept and return it with a high rank among all concepts. Experiments show that our system significantly outperforms an MFCC-based comparison system in both classification and retrieval.
机译:声乐模仿广泛用于人类交流。在本文中,我们提出了一种自动识别声乐模仿的概念的方法,然后检索这个概念的声音。因为在模仿不同的声音时强调了不同的声学方面(例如,响度,响度,TIMBRE),所以声乐模仿识别中的一个关键挑战是提取适当的特征。手工制作的功能可能无法适用于各种各样的模仿。相反,我们使用堆叠的自动编码器以无监视的方式自动学习来自一组声学模仿的功能。然后,使用他们的训练模仿,为乐趣的景观感染多级SVM。鉴于新的声音模仿声音概念的兴趣概念,我们的系统可以识别其潜在的概念,并在所有概念中以高级返回它。实验表明,我们的系统在分类​​和检索中显着优于基于MFCC的比较系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号