首页> 外文会议>2011 IEEE International Conference on Acoustics, Speech and Signal Processing >Integrating frame-based and segment-based dynamic time warping for unsupervised spoken term detection with spoken queries
【24h】

Integrating frame-based and segment-based dynamic time warping for unsupervised spoken term detection with spoken queries

机译:集成基于帧和基于段的动态时间规整,以实现带有语音查询的无监督语音术语检测

获取原文

摘要

Rapidly increasing quantities of multimedia and spoken content today demand fast and accurate retrieval approaches for convenient browsing. The spoken documents with wide variety of different acoustic and linguistic conditions make supervised training of well-matched acoustic/language models very difficult. Unsupervised methods using frame-based dynamic time warping (DTW) require no acoustic/language models but with high computation load. Therefore, segment-based DTW was proposed to relieve the computation load at the cost of degraded detection performance. In this paper, we refine the segment-based DTW by allowing deletion of end segments of query to improve detection performance. The search space is also reduced by segment similarity constraints. We also proposed a two-pass framework. The segment-based DTW is performed in the first pass to locate hypothesized spoken term region and the frame-based DTW for precise rescoring in the second pass. Then the pseudo relevance feedback is used to expand acoustic variations of the query. We obtain significantly higher detection performance at significantly lower computation load as compared to frame-based DTW.
机译:如今,迅速增长的多媒体和口语内容数量要求快速,准确的检索方法,以方便浏览。具有多种不同的声学和语言条件的语音文档使得很难很好地指导良好匹配的声学/语言模型的监督训练。使用基于帧的动态时间规整(DTW)的无监督方法不需要声学/语言模型,但需要高计算量。因此,提出了基于分段的DTW以减轻计算性能为代价来减轻计算负荷。在本文中,我们通过允许删除查询的末端片段来改进基于片段的DTW,以提高检测性能。分段相似性约束也减少了搜索空间。我们还提出了一个两遍框架。在第一遍中执行基于段的DTW,以在第二遍中定位假设的口语区域和基于帧的DTW,以进行精确计分。然后,伪相关反馈被用于扩展查询的声学变化。与基于帧的DTW相比,我们以显着更低的计算负载获得了更高的检测性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号