TRAINING WIDEBAND ACOUSTIC MODELS USING MIXED-BANDWIDTH TRAINING DATA VIA FEATURE BANDWIDTH EXTENSION

机译：培训使用混合带宽培训数据通过功能带宽扩展训练宽带声学模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

One serious difficulty in the deployment of wideband speech recognition systems for new tasks is the expense in both time and cost of obtaining sufficient training data. A more economical approach is to collect telephone speech and then restrict the application to operate at the telephone bandwidth. However, this generally results in sub-optimal performance. In this paper, we propose a new algorithm for training wideband acoustic models that requires only a small amount of wideband speech augmented by a larger amount of narrowband speech. The algorithm operates by first converting the narrowband features to wideband features through a process called Feature Bandwidth Extension. The bandwidth-extended features are then combined with available wideband data to train the acoustic models using a modified version of the conventional forward-backward algorithm. Experiments performed using wideband speech and telephone speech demonstrate that the proposed mixed-bandwidth training algorithm results in significant improvements in recognition accuracy over conventional training strategies when the amount of wideband data is limited.

机译：为新任务部署宽带语音识别系统的一个严重困难是获得足够训练数据的时间和成本的费用。一种更经济的方法是收集电话语音，然后限制应用程序在电话带宽中运行。但是，这通常会导致次优性能。在本文中，我们提出了一种训练宽带声学模型的新算法，该算法仅需要少量的宽带语音来增强更多的窄带语音。该算法通过首先通过称为特征带宽扩展的过程将窄带特征转换为宽带特征来操作。然后将带宽扩展功能与可用的宽带数据组合以使用传统的前后算法的修改版本训练声学模型。使用宽带语音和电话语音进行的实验表明，当宽带数据量有限时，所提出的混合带宽训练算法导致识别准确性的识别准确性的显着改进。

著录项

来源
《IEEE International Conference on Acoustics, Speech, and Signal Processing》|2005年||共4页
会议地点
作者
Michael L. Seltzer; Alex Acero;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition [J] . Michael L. Seltzer, Alex Acero IEEE transactions on audio, speech and language processing . 2007,第1期

机译：使用混合带宽训练数据训练语音识别的宽带声学模型
2. Training data selection for improving discriminative training of acoustic models [J] . Berlin Chen, Shih-Hung Liu, Fang-Hui Chu Pattern recognition letters . 2009,第13期

机译：选择训练数据以改善声学模型的判别训练
3. Building Acoustic Model Ensembles by Data Sampling With Enhanced Trainings and Features [J] . Chen X., Zhao Y. Audio, Speech, and Language Processing, IEEE Transactions on . 2013,第3期

机译：通过具有增强的培训和功能的数据采样来构建声学模型集合
4. TRAINING WIDEBAND ACOUSTIC MODELS USING MIXED-BANDWIDTH TRAINING DATA VIA FEATURE BANDWIDTH EXTENSION [C] . Michael L. Seltzer, Alex Acero IEEE International Conference on Acoustics, Speech, and Signal Processing . 2005

机译：培训使用混合带宽培训数据通过功能带宽扩展训练宽带声学模型
5. Evaluation of Synthetic Training Data and Training-Data-Augmentation Techniques for Object Detection in Ground-Penetrating Radar Data using Deep-Learning Models [D] . Ruggiero, Jean. 2021

机译：使用深度学习模型评估用于地面穿透雷达数据的对象检测的综合训练数据和训练数据增强技术
6. Improving Robustness of Deep Neural Network Acoustic Models via Speech Separation and Joint Adaptive Training [O] . Arun Narayanan, DeLiang Wang -1

机译：通过语音分离和联合自适应训练提高深度神经网络声学模型的鲁棒性
7. TRAINING WIDEBAND ACOUSTIC MODELS USING MIXED-BANDWIDTH TRAINING DATA VIA FEATURE BANDWIDTH EXTENSION [O] . 2008

机译：通过特征带宽扩展使用混合带宽训练数据训练宽带声学模型
8. Extension of Training Extension Course Cost and Training Effectiveness Analysis Data Collection [R] . Bercos, J., Eakins, R. C. 1985

机译：延伸培训推广课程成本与培训效果分析数据收集

TRAINING WIDEBAND ACOUSTIC MODELS USING MIXED-BANDWIDTH TRAINING DATA VIA FEATURE BANDWIDTH EXTENSION

摘要

著录项

相似文献

相关主题

期刊订阅