首页> 外文会议>European Signal Processing Conference >Multichannel Audio Front-End for Far-Field Automatic Speech Recognition
【24h】

Multichannel Audio Front-End for Far-Field Automatic Speech Recognition

机译:用于远场自动语音识别的多通道音频前端

获取原文

摘要

Far-field automatic speech recognition (ASR) is a key enabling technology that allows untethered and natural voice interaction between users and Amazon Echo family of products. A key component in realizing far-field ASR on these products is the suite of audio front-end (AFE) algorithms that helps in mitigating acoustic environmental challenges and thereby improving the ASR performance. In this paper, we discuss the key algorithms within the AFE, and we provide insights into how these algorithms help in mitigating the various acoustical challenges for far-field processing. We also provide insights into the audio algorithm architecture adopted for the AFE, and we discuss ongoing and future research.
机译:远场自动语音识别(ASR)是一项关键的启用技术,它允许用户与Amazon Echo系列产品之间进行不受限制的自然语音交互。在这些产品上实现远场ASR的关键组件是音频前端(AFE)算法套件,它有助于缓解声学环境挑战,从而改善ASR性能。在本文中,我们讨论了AFE中的关键算法,并对这些算法如何帮助缓解远场处理的各种声学挑战提供了见解。我们还提供有关AFE采用的音频算法体系结构的见解,并讨论正在进行的和将来的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号