首页> 外文会议>European Signal Processing Conference >Multichannel Audio Front-End for Far-Field Automatic Speech Recognition

【24h】

Multichannel Audio Front-End for Far-Field Automatic Speech Recognition

机译：用于远场自动语音识别的多通道音频前端

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Far-field automatic speech recognition (ASR) is a key enabling technology that allows untethered and natural voice interaction between users and Amazon Echo family of products. A key component in realizing far-field ASR on these products is the suite of audio front-end (AFE) algorithms that helps in mitigating acoustic environmental challenges and thereby improving the ASR performance. In this paper, we discuss the key algorithms within the AFE, and we provide insights into how these algorithms help in mitigating the various acoustical challenges for far-field processing. We also provide insights into the audio algorithm architecture adopted for the AFE, and we discuss ongoing and future research.

机译：远场自动语音识别（ASR）是一项关键的启用技术，它允许用户与Amazon Echo系列产品之间进行不受限制的自然语音交互。在这些产品上实现远场ASR的关键组件是音频前端（AFE）算法套件，它有助于缓解声学环境挑战，从而改善ASR性能。在本文中，我们讨论了AFE中的关键算法，并对这些算法如何帮助缓解远场处理的各种声学挑战提供了见解。我们还提供有关AFE采用的音频算法体系结构的见解，并讨论正在进行的和将来的研究。

著录项

来源
《European Signal Processing Conference 》|2018年|1527-1531|共5页
会议地点
作者
Amit Chhetri; Philip Hilmes; Trausti Kristjansson; Wai Chu; Mohamed Mansour; Xiaoxue Li; Xianxian Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Acoustics; Signal processing algorithms; Array signal processing; Engines; Measurement; Microphone arrays;

机译：声学;信号处理算法;阵列信号处理;引擎;测量;麦克风阵列;

相似文献

外文文献
中文文献
专利

1. Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems [J] . Xugang Lu, Masashi Unoki, Masato Akagi Acoustical science and technology . 2008 ,第6期

机译：比较评估基于调制传递函数的语音子带功率包络的盲恢复作为自动语音识别系统的前端处理器
2. Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems [J] . Masashi Unoki, Masato Akagi, Xugang Lu Acoustical science and technology . 2008 ,第6期

机译：比较评估基于调制传递函数的语音子带功率包络的盲恢复作为自动语音识别系统的前端处理器
3. Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition [J] . Shimada Kazuki, Bando Yoshiaki, Mimura Masato, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019 ,第5期

机译：基于多通道NMF信息波束形成的无监督语音增强技术，用于强噪声自动语音识别
4. Multichannel Audio Front-End for Far-Field Automatic Speech Recognition [C] . Amit Chhetri, Philip Hilmes, Trausti Kristjansson, European Signal Processing Conference . 2018

机译：用于远场自动语音识别的多通道音频前端
5. Advances in Audiovisual Speech Processing for Robust Voice Activity Detection and Automatic Speech Recognition [D] . Tao, Fei. 2018

机译：用于鲁棒语音活动检测和自动语音识别的视听语音处理方面的进展
6. Lipreading and Audiovisual Speech Recognition across the Adult Lifespan: Implications for Audiovisual Integration [O] . Nancy Tye-Murray, Brent Spehar, Joel Myerson, -1

机译：成人寿命中的唇读和视听语音识别：对视听整合的启示
7. Building a Visual Front-end for Audio-Visual Automatic Speech Recognition in Vehicle Environments [O] . Robert Hursig, Jane Zhang 2011

机译：在车辆环境中构建视觉前端用于视听自动语音识别

Multichannel Audio Front-End for Far-Field Automatic Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅