首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >An audio fingerprinting system for live version identification using image processing techniques
【24h】

An audio fingerprinting system for live version identification using image processing techniques

机译:使用图像处理技术的实时版本识别音频指纹识别系统

获取原文

摘要

Suppose that you are at a music festival checking on an artist, and you would like to quickly know about the song that is being played (e.g., title, lyrics, album, etc.). If you have a smartphone, you could record a sample of the live performance and compare it against a database of existing recordings from the artist. Services such as Shazam or SoundHound will not work here, as this is not the typical framework for audio fingerprinting or query-by-humming systems, as a live performance is neither identical to its studio version (e.g., variations in instrumentation, key, tempo, etc.) nor it is a hummed or sung melody. We propose an audio fingerprinting system that can deal with live version identification by using image processing techniques. Compact fingerprints are derived using a log-frequency spectrogram and an adaptive thresholding method, and template matching is performed using the Hamming similarity and the Hough Transform.
机译:假设您在艺术家的音乐节上检查,您希望快速了解正在播放的歌曲(例如,标题,歌词,专辑等)。如果您有智能手机,您可以录制现场性能的样本,并将其与艺术家的现有录音的数据库进行比较。 Shazam或Soundhound等服务在这里不起作用,因为这不是音频指纹或查询逐个系统的典型框架,因为现场性能既不与其工作室版本相同(例如,仪器的变化,钥匙,速度等等),也不是嗡嗡声或刚刚的旋律。我们提出了一种音频指纹系统,可以通过使用图像处理技术来处理实时版本识别。使用逻辑频谱谱图和自适应阈值方法导出紧凑的指纹,使用汉明相似度和霍夫变换来执行模板匹配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号