首页> 外文会议>International Conference on Communication and Computer Engineering >Application of SHAZAM-Based Audio Fingerprinting for Multilingual Indian Song Retrieval
【24h】

Application of SHAZAM-Based Audio Fingerprinting for Multilingual Indian Song Retrieval

机译:Shazam的音频指纹识别在多语种印度歌曲检索中的应用

获取原文

摘要

Extracting film songs from a multilingual database based on a query clip is a challenging task. The challenge stems from the subtle variations in pitch and rhythm, which accompany the change in the singer's voice, style, and orchestration, change in language and even a change in gender. The fingerprinting algorithm must be designed to capture the base tune in the composition and not the adaptations (or variations which include lyrical modifications and changes in the singer's voice). The SHAZAM system was developed for capturing cover audio pieces from millions of Western songs stored in the database, with the objective of tapping into the melodic construct of the song (devoid of other forms of embellishments). When applied to the Indian database the system was found less effective, due to subtle changes in both rhythm and melody mainly due to the semiclassical nature of Indian film songs. The retrieval accuracy was found to be 85 %. Potential reasons for the failure of this SHAZAM system have been discussed with examples.
机译:根据查询剪辑从多语言数据库中提取胶片歌曲是一个具有挑战性的任务。挑战源于音高和节奏的微妙变化,歌手的语音,风格和管弦乐流变,语言变化甚至是性别的变化。必须设计指纹算法以捕获构图中的基调,而不是适应(或包括歌词修改和歌手语音的变化)的适应性(或变化)。开发了Shazam系统,用于从存储在数据库中的数百万西方歌曲中捕获覆盖音频件,其目的是进入歌曲的旋律构建(缺乏其他形式的装饰)。当应用于印度数据库时,系统被发现效果较差,由于节奏和旋律的微妙变化主要是由于印度电影歌曲的半思法性质。检索准确性被发现为85%。已经讨论了这种Shazam系统失败的潜在原因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号