首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Transform representation of the spectra of acoustic speech segments with applications. I. General approach and application to speech recognition
【24h】

Transform representation of the spectra of acoustic speech segments with applications. I. General approach and application to speech recognition

机译:借助应用程序来变换语音片段的频谱表示。一,一般方法及其在语音识别中的应用

获取原文
获取原文并翻译 | 示例

摘要

An approach to modeling and capturing the time-varying structure of the spectral envelope of speech is reported. Acoustic subword decomposition and the Karhunen-Loeve transform (KLT) are used to extract and efficiently represent the highly correlated structure of the spectral envelope. Integration of the KLT with acoustic subword modeling provides concise representation of both steady-state and dynamic features of the spectra in a unified framework that very effectively captures acoustic-phonetic patterns. The physiological and perceptual basis for the approach, the frame-based and acoustic-subword-based spectral representation, and applications to speaker-dependent recognition are presented. The performance of the recognition algorithm based on this approach compares favorably with that of other techniques.
机译:报告了一种建模和捕获语音频谱包络的​​时变结构的方法。声学子词分解和Karhunen-Loeve变换(KLT)用于提取并有效表示频谱包络的​​高度相关结构。 KLT与声学子词建模的集成在统一框架中提供了频谱的稳态和动态特征的简洁表示,可以非常有效地捕获声学模式。提出了该方法的生理学和知觉基础,基于帧和基于声学子词的频谱表示以及在说话者相关识别中的应用。基于这种方法的识别算法的性能可与其他技术相媲美。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号