首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >TRAINING WIDEBAND ACOUSTIC MODELS USING MIXED-BANDWIDTH TRAINING DATA VIA FEATURE BANDWIDTH EXTENSION
【24h】

TRAINING WIDEBAND ACOUSTIC MODELS USING MIXED-BANDWIDTH TRAINING DATA VIA FEATURE BANDWIDTH EXTENSION

机译:培训使用混合带宽培训数据通过功能带宽扩展训练宽带声学模型

获取原文

摘要

One serious difficulty in the deployment of wideband speech recognition systems for new tasks is the expense in both time and cost of obtaining sufficient training data. A more economical approach is to collect telephone speech and then restrict the application to operate at the telephone bandwidth. However, this generally results in sub-optimal performance. In this paper, we propose a new algorithm for training wideband acoustic models that requires only a small amount of wideband speech augmented by a larger amount of narrowband speech. The algorithm operates by first converting the narrowband features to wideband features through a process called Feature Bandwidth Extension. The bandwidth-extended features are then combined with available wideband data to train the acoustic models using a modified version of the conventional forward-backward algorithm. Experiments performed using wideband speech and telephone speech demonstrate that the proposed mixed-bandwidth training algorithm results in significant improvements in recognition accuracy over conventional training strategies when the amount of wideband data is limited.
机译:为新任务部署宽带语音识别系统的一个严重困难是获得足够训练数据的时间和成本的费用。一种更经济的方法是收集电话语音,然后限制应用程序在电话带宽中运行。但是,这通常会导致次优性能。在本文中,我们提出了一种训练宽带声学模型的新算法,该算法仅需要少量的宽带语音来增强更多的窄带语音。该算法通过首先通过称为特征带宽扩展的过程将窄带特征转换为宽带特征来操作。然后将带宽扩展功能与可用的宽带数据组合以使用传统的前后算法的修改版本训练声学模型。使用宽带语音和电话语音进行的实验表明,当宽带数据量有限时,所提出的混合带宽训练算法导致识别准确性的识别准确性的显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号