首页> 外文会议>International Conference on Multimedia and Expo >MULTIDIMENSIONAL HUMMING TRANSCRIPTION USING A STATISTICAL APPROACH FOR QUERY BY HUMMING SYSTEMS
【24h】

MULTIDIMENSIONAL HUMMING TRANSCRIPTION USING A STATISTICAL APPROACH FOR QUERY BY HUMMING SYSTEMS

机译:利用嗡嗡声使用查询统计方法的多维嗡嗡声转录

获取原文

摘要

A new statistical pattern recognition approach applied to human humming transcription is proposed in this research. A music note has two important attributes, i.e. pitch and duration. The proposed algorithm generates multidimensional humming transcriptions, which contain both pitch and duration information. Query by humming provides a natural means for content-based retrieval from music databases, and this research provides a robust front-end for such an application. The segment of a note in the humming waveform is modeled by a hidden Markov model (HMM) while the pitch of the note is modeled by a pitch model using a Gaussian mixture model. Preliminary real-time recognition experiments are carried out with models trained by data obtained from eight human objects, and an overall correct recognition rate of around 80% is demonstrated.
机译:在本研究中提出了一种应用于人哼唱转录的新统计模式识别方法。音乐注意有两个重要的属性,即音高和持续时间。该算法产生多维蜂窝录像,其包含音高和持续时间信息。通过Humming查询提供了一种自然的方法,可以从音乐数据库中获取基于内容的检索,并且该研究为这种应用提供了一种强大的前端。蜂扣波形中的音符的段由隐马尔可夫模型(HMM)建模,而使用高斯混合模型的音调的音调由音高模型建模。初步实时识别实验与由八个人物获得的数据训练的模型进行,总体正确识别率约为80%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号