Articulatory Trajectories for Large-Vocabulary Speech Recognition.

机译：大词汇量语音识别的发音轨迹。

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Studies have demonstrated that articulatory information can model speech variability effectively and can potentially help to improve speech recognition performance. Most of the studies involving articulatory information have focused on effectively estimating them from speech, and few studies have actually used such features for speech recognition. Speech recognition studies using articulatory information have been mostly confined to digit or medium vocabulary speech recognition, and efforts to incorporate them into large vocabulary systems have been limited. We present a neural network model to estimate articulatory trajectories from speech signals where the model was trained using synthetic speech signals generated by Haskins Laboratories' task-dynamic model of speech production. The trained model was applied to natural speech, and the estimated articulatory trajectories obtained from the models were used in conjunction with standard cepstral features to train acoustic models for large-vocabulary recognition systems. Two different large-vocabulary English datasets were used in the experiments reported here. Results indicate that employing articulatory information improves speech recognition performance not only under clean conditions but also under noisy background conditions. Perceptually motivated robust features were also explored in this study and the best performance was obtained when systems based on articulatory, standard cepstral and perceptually motivated feature were all combined.

著录项

作者
A. Stolcke C. Richey H. Nam J. Yuan M. Liberman V. Mitra W. Wang;
展开▼
作者单位

展开▼
年度 2013
页码 1-5
总页数 5
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Acoustic models; Articulatory; Artificial neural networks; Audiovisual aids; Automatic speech recognition; Component report; Computer vision; Image processing; Large vocabulary; Presentation slides; Sensor fusion; Trajectories; Vocal tract variables; Workshops;

机译：声学模型;发音;人工神经网络;视听辅助;自动语音识别;组件报告;计算机视觉;图像处理;大词汇;演示幻灯片;传感器融合;轨迹;声道变量;研讨会;

相似文献

外文文献
中文文献
专利

1. Encoding of Articulatory Kinematic Trajectories in Human Speech Sensorimotor Cortex [J] . Josh Chartier, Gopala K. Anumanchipalli, Keith Johnson, Neuron . 2018,第5期

机译：人类演讲传感器皮层的铰接性运动轨迹的编码
2. The use of articulatory movement data in speech synthesis applications: An overview — Application of articulatory movements using machine learning algorithms — [J] . Junichi Yamagishi, Korin Richmond, Zhenhua Ling Acoustical science and technology . 2015,第6期

机译：语音运动数据在语音合成应用程序中的使用：概述-使用机器学习算法的发音运动的应用-
3. A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model [J] . Sankaran Panchapagesan, Abeer Alwan The Journal of the Acoustical Society of America . 2011,第4期

机译：使用链矩阵和前田发音模型通过合成分析对语音进行语音到发音发音转换的研究
4. Articulatory trajectories for large-vocabulary speech recognition [C] . Mitra Vikramjit, Wang Wen, Stolcke Andreas, IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：大词汇语音识别的发音轨迹
5. Balancing model resolution and generalizability in large-vocabulary continuous speech recognition. [D] . Luo, Xiaoqiang. 1999

机译：在大词汇量连续语音识别中平衡模型的分辨率和可推广性。
6. Encoding of articulatory kinematic trajectories in human speech sensorimotor cortex [O] . Josh Chartier, Gopala K. Anumanchipalli, Keith Johnson, -1

机译：人的语音感觉运动皮层中运动运动轨迹的编码
7. Articulatory trajectories for large-vocabulary speech recognition [O] . Vikramjit Mitra, Wen Wang, Andreas Stolcke, 2013

机译：大词汇量语音识别的发音轨迹
8. Recognizing Articulatory Gestures from Speech for Robust Speech Recognition. [R] . C. Espy-Wilson E. Saltzman H. Nam L. Goldstein V. Mitra 2012

机译：从语音识别衔接手势以获得强大的语音识别能力。

Articulatory Trajectories for Large-Vocabulary Speech Recognition.

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅