首页> 外文期刊>Journal on multimodal user interfaces >Synchronizing multimodal recordings using audio-to-audio alignment An application of acoustic fingerprinting to facilitate music interaction research
【24h】

Synchronizing multimodal recordings using audio-to-audio alignment An application of acoustic fingerprinting to facilitate music interaction research

机译:使用音频到音频对齐来同步多模式录音声学指纹识别在促进音乐交互研究中的应用

获取原文
获取原文并翻译 | 示例

摘要

Research on the interaction between movement and music often involves analysis of multi-track audio, video streams and sensor data. To facilitate such research a framework is presented here that allows synchronization of multimodal data. A low cost approach is proposed to synchronize streams by embedding ambient audio into each data-stream. This effectively reduces the synchronization problem to audio-to-audio alignment. As a part of the framework a robust, computationally efficient audio-to-audio alignment algorithm is presented for reliable synchronization of embedded audio streams of varying quality. The algorithm uses audio fingerprinting techniques to measure offsets. It also identifies drift and dropped samples, which makes it possible to find a synchronization solution under such circumstances as well. The framework is evaluated with synthetic signals and a case study, showing millisecond accurate synchronization.
机译:对运动与音乐之间相互作用的研究通常涉及对多轨音频,视频流和传感器数据的分析。为了促进此类研究,此处提出了一个框架,该框架允许同步多模式数据。提出了一种通过将环境音频嵌入每个数据流中来同步流的低成本方法。这有效地将同步问题减少到音频到音频对齐。作为框架的一部分,提出了一种鲁棒的,计算效率高的音频到音频对齐算法,用于可靠地同步变化质量的嵌入式音频流。该算法使用音频指纹技术来测量偏移量。它还可以识别漂移和丢失的样本,这也使得在这种情况下也可以找到同步解决方案。该框架通过综合信号和案例研究进行了评估,显示了毫秒级的精确同步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号