首页> 外文期刊>IEEE Transactions on Consumer Electronics >Speech Enhancement Parameter Adjustment to Maximize Accuracy of Automatic Speech Recognition
【24h】

Speech Enhancement Parameter Adjustment to Maximize Accuracy of Automatic Speech Recognition

机译:语音增强参数调整,以最大限度地提高自动语音识别的准确性

获取原文
获取原文并翻译 | 示例

摘要

Consumer electronics equipped with a microphone array, such as car navigation devices and headsets commonly implement speech enhancement techniques based on the gradient method to cope with additive noise. However, while these techniques had been originally developed for voice communication and can maximize the signal-to-distortion ratio (SDR), they cannot always maximize automatic speech recognition (ASR) accuracy. For this reason, the front-end speech enhancement parameters have been adjusted by human experts to each environment and acoustic model. In this study, we developed a novel system for maximizing the accuracy of a given ASR engine by automatically adjusting the front-end speech enhancement. The proposed method allows consumers to use ASR through the consumer electronics with less stress when ambient noise varies. A genetic algorithm (GA) is used to generate parameter values of the front-end speech enhancement for particular environments. The generated values can be dynamically assigned to input speech signals by preliminarily clustering the environments based on noise features. In evaluations, parameter values determined by our method outperformed one adjusted by a human expert.
机译:消费电子设备配备有麦克风阵列,例如汽车导航设备和耳机通常基于梯度方法实现语音增强技术,以应对添加性噪声。但是,虽然这些技术最初用于语音通信,并且可以最大化信号到失真率(SDR),但它们不能总是最大化自动语音识别(ASR)精度。因此,人类专家对每个环境和声学模型进行了前端语音增强参数。在这项研究中,我们通过自动调整前端语音增强,开发了一种用于最大化给定ASR发动机的准确性的新系统。所提出的方法允许消费者在环境噪声变化时,消费者通过消费电子设备的压力较小。遗传算法(GA)用于生成特定环境前端语音增强的参数值。可以通过基于噪声特征预先聚类环境来动态地分配生成的值以输入语音信号。在评估中,我们的方法确定的参数值优于人类专家调整的参数值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号