首页> 外文会议> >Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop

【24h】

Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop

机译：基于发音特征的声音和视听语音识别方法：2006年JHU夏季研讨会的总结

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We report on investigations, conducted at the 2006 Johns Hopkins Workshop, into the use of articulatory features (AFs) for observation and pronunciation models in speech recognition. In the area of observation modeling, we use the outputs of AF classifiers both directly, in an extension of hybrid HMMeural network models, and as part of the observation vector, an extension of the "tandem" approach. In the area of pronunciation modeling, we investigate a model having multiple streams of AF states with soft synchrony constraints, for both audio-only and audio-visual recognition. The models are implemented as dynamic Bayesian networks, and tested on tasks from the small-vocabulary switchboard (SVitchboard) corpus and the CUAVE audio-visual digits corpus. Finally, we analyze AF classification and forced alignment using a newly collected set of feature-level manual transcriptions

机译：我们报告了在2006年约翰·霍普金斯讲习班上进行的有关在语音识别中将发音特征（AF）用于观察和发音模型的调查。在观察建模领域，我们在混合HMM /神经网络模型的扩展中直接使用了AF分类器的输出，并且在“观察”向量的一部分中，直接使用了“串联”方法的扩展。在语音建模领域，我们研究了具有多个具有软同步约束的AF状态流的模型，用于纯音频和视听识别。这些模型被实现为动态贝叶斯网络，并在小型词汇总机（SVitchboard）语料库和CUAVE视听数字语料库的任务上进行了测试。最后，我们使用一组新收集的功能级别手动转录来分析AF分类和强制对齐

著录项

来源
《》|2007年|621-624|共4页
会议地点
作者
Livescu; K.; Cetin; O.; Hasegawa-Johnson; M.; King; S.; Bartels; C.; Borges; N.; Kantor; A.; Lal; P.; Yung; L.; Bezman; A.; Dawson-Haggerty; S.; Woods; B.; Frankel; J.; Magami-Doss; M.; Saenko; K.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Bayes methods; feature extraction; hidden Markov models; neural nets; speech processing; speech recognition; 2006 JHU Summer Workshop; CUAVE audio-visual digits corpus; HMM; acoustic; articulatory feature-based methods; audio-visual recognition; audio-visual speech r;

机译：贝叶斯方法;特征提取;隐马尔可夫模型;神经网络;语音处理;语音识别; 2006 JHU Summer Workshop; CUAVE视听数字语料库; HMM;声学;基于发音特征的方法;视听识别;视听语音[R;

相似文献

外文文献
中文文献
专利

1. Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion [J] . Ghosh P.K., Narayanan S. The Journal of the Acoustical Society of America . 2011,第4aPta1期

机译：使用从与主题无关的声音到发音反转的发音特征进行自动语音识别
2. Multistream Articulatory Feature-Based Models for Visual Speech Recognition [J] . Saenko Kate, Livescu Karen, Glass James, Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2009,第9期

机译：基于多流发音特征的视觉语音识别模型
3. Integration of acoustic and articulatory information with application to speech recognition [J] . Ka-Yee Leung, Manhung Siu Information Fusion . 2004,第2期

机译：声音和发音信息的集成及其在语音识别中的应用
4. Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop [C] . Livescu K., Cetin O., Hasegawa-Johnson M., . -1

机译：基于发音特征的声音和视听语音识别方法：2006年JHU夏季研讨会的总结
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion [O] . Prasanta Kumar Ghosh, Shrikanth Narayanan -1

机译：使用从独立于受试者的声学到发音反转的发音特征进行自动语音识别
7. Articulatory feature-based methods for acoustic and audio-visual speech recognition: Summary from the 2006 JHU Summer Workshop. [O] . Livescu Karen, Çetin Ozgur, Hasegawa-Johnson Mark, 2007

机译：基于听觉特征的声音和视听语音识别方法：2006 JHU Summer Workshop的摘要。

Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅