首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >RICH SYSTEM COMBINATION FOR KEYWORD SPOTTING IN NOISY AND ACOUSTICALLY HETEROGENEOUS AUDIO STREAMS
【24h】

RICH SYSTEM COMBINATION FOR KEYWORD SPOTTING IN NOISY AND ACOUSTICALLY HETEROGENEOUS AUDIO STREAMS

机译:丰富的系统组合,用于嘈杂和声学的异构音频流中的关键字斑点

获取原文

摘要

We address the problem of retrieving spoken information from noisy and heterogeneous audio archives using system combination with a rich and diverse set of noise-robust modules. Audio search applications so far have focused on constrained domains or genres and not-so-noisy and heterogeneous acoustic or channel conditions. In this paper, our focus is to improve the accuracy of a keyword spotting system in highly degraded and diverse channel conditions by employing multiple recognition systems in parallel with different robust frontends and modeling choices, as well as different representations during audio indexing and search (words vs. subword units). After aligning keyword hits from different systems, we employ system combination at the score level using a logistic-regression-based classifier. Side information such as the output of an acoustic condition identification module is used to guide system combination system that is trained on a held-out dataset. Lattice-based indexing and search is used in all keyword spotting systems. We present improvements in probability-miss at a fixed probability-false-alarm by employing our proposed rich system combination approach on DARPA Robust Automatic Transcription of Speech (RATS) PhaseI evaluation data that contains highly degraded channel recordings (signal-to-noise ratio levels as low as 0 dB) and different channel characteristics.
机译:我们解决检索使用了丰富而多样的噪音,耐用型模块系统组合嘈杂的,异构的音频档案语音信息的问题。音频搜索应用至今都集中在受限的域或流派和不那么嘈杂声异质或信道条件。在本文中,我们的重点是通过用不同的鲁棒前端并行使用多个识别系统和建模过程中的音频索引的选择,以及不同的表示,以提高在高度降解的和多样的信道条件的关键词定位系统的准确度和搜索(字与子字单元)。来自不同系统对准的关键字点击之后,我们采用系统组合在比分级使用基于逻辑回归分类。侧信息,诸如声学条件识别模块的输出用于引导被在保持输出数据集训练的系统的组合系统。格为基础的索引和搜索是在所有的关键词识别系统中使用。我们采用的辞DARPA强大的自动转录(RATS),其中包含高度退化声道录音(信噪比水平PhaseI评估数据我们提出的丰富的系统相结合的办法概率错过本发明的改进在一个固定的概率假警报低至0 dB为单位)和不同的信道特性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号