...
首页> 外文期刊>Circuits, systems, and signal processing >Keyword Spotting in Continuous Speech Using Spectral and Prosodic Information Fusion
【24h】

Keyword Spotting in Continuous Speech Using Spectral and Prosodic Information Fusion

机译:频谱和韵律信息融合在连续语音中发现关键词

获取原文
获取原文并翻译 | 示例

摘要

Keyword spotting in a continuous speech is a challenging problem and has relevance in applications like audio indexing and music retrieval. In this work, the problem of keyword spotting is addressed by utilizing the complementary information present in spectral and prosodic features of the speech signal. A thorough analysis of the complementary information is performed on a large Hindi language database developed for this purpose. Phonetic and prosodic distribution analysis is performed toward this end, using canonical correlation and Student T-distance function. Motivated by these analyses, novel methods for spectral and prosodic information fusion that optimize a combined error function is proposed. The fusion methods are developed both at the feature and the model level. Improved syllable sequence prediction and keyword spotting performance are obtained using these methods when compared to conventional methods of keyword spotting. Additionally, in order to enable comparison with the state-of-the-art deep learning-based methods, a novel method for improved syllable sequence prediction using deep denoising autoencoders is proposed. The performance of the methods proposed in this work is evaluated for keyword spotting using a syllable sliding protocol over a large Hindi database. Reasonable performance improvements are noted from the experimental results on syllable sequence prediction, keyword spotting, and audio retrieval.
机译:在连续语音中发现关键词是一个具有挑战性的问题,并且与音频索引和音乐检索等应用程序相关。在这项工作中,通过利用语音信号频谱和韵律特征中存在的补充信息解决了关键词发现的问题。为此目的而开发的大型印地语数据库对补充信息进行了全面分析。为此,使用规范相关和学生T距离函数进行语音和韵律分布分析。受这些分析的启发,提出了一种优化频谱和韵律信息融合组合误差函数的新方法。融合方法是在特征和模型级别上开发的。与传统的关键词搜索方法相比,使用这些方法可以获得更好的音节序列预测和关键词搜索性能。此外,为了能够与基于深度学习的最新方法进行比较,提出了一种使用深度去噪自动编码器改进音节序列预测的新方法。在大型印地语数据库上使用音节滑动协议评估了本文中提出的方法的性能,以发现关键字。从音节序列预测,关键词识别和音频检索的实验结果中可以注意到,合理的性能改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号