首页> 外文会议> >A semi-continuous stochastic trajectory model for phoneme-based continuous speech recognition

【24h】

A semi-continuous stochastic trajectory model for phoneme-based continuous speech recognition

机译：基于音素的连续语音识别的半连续随机轨迹模型

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We propose a model of phoneme-based speech unit, called semi-continuous stochastic trajectory model (SC-STM), which generalizes our stochastic trajectory models (STM). As STMs, the SC-STMs focus on the modeling of speech segments (called trajectories) in their parameter space, and can therefore handle segmental information, which is critical for large vocabulary continuous speech recognition. Compared to the STMs, the SC-STMs improve the resolution of the trajectory modeling, while keeping a moderate number of free parameters by sharing state probability density functions. The SC-STM can therefore maintain a good trade-off between detailed acoustic modeling and limited training data. We tested the idea on a 2010 words, speaker-dependent, continuous speech database. Preliminary results show that SC-STM gives a word accuracy close to that of STM, without using heuristic techniques that enhanced STM.

机译：我们提出了一种基于音素的语音单元模型，称为半连续随机轨迹模型（SC-STM），该模型概括了我们的随机轨迹模型（STM）。作为STM，SC-STM专注于在其参数空间中对语音片段（称为轨迹）进行建模，因此可以处理片段信息，这对于大词汇量连续语音识别至关重要。与STM相比，SC-STM改进了轨迹建模的分辨率，同时通过共享状态概率密度函数保持适量的自由参数。因此，SC-STM可以在详细的声学模型和有限的训练数据之间保持良好的权衡。我们在2010单词，与说话者相关的连续语音数据库中测试了该想法。初步结果表明，在不使用增强STM的启发式技术的情况下，SC-STM的字准确度接近STM。

著录项

来源
《》|1996年|P.471-474|共4页
会议地点
作者
Siohan; O.; Yifan Gong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. A stochastic segment model for phoneme-based continuous speech recognition [J] . Ostendorf M., Roukos S. IEEE Transactions on Acoustics, Speech, and Signal Processing . 1989,第12期

机译：基于音素的连续语音识别的随机段模型
2. Stochastic trajectory modeling and sentence searching for continuous speech recognition [J] . Yifan Gong IEEE Transactions on Speech and Audio Proceeding . 1997,第1期

机译：随机轨迹建模和句子搜索以实现连续语音识别
3. Context-dependent Syllable Modeling of Sentence-based Semi-continuous Speech Recognition for the Tamil Language [J] . Ibralebbe Mohamed Kalith, David Asirvatham, Ismail Raisal Information Technology Journal . 2017,第3期

机译：基于句子的泰米尔语语言半连续语音识别的上下文依赖音节建模
4. A semi-continuous stochastic trajectory model for phoneme-based continuous speech recognition [C] . Siohan O., Yifan Gong, Institute of Electric and Electronic Engineer IEEE International Conference on Acoustics, Speech, and Signal Processing . 1996

机译：基于音素的连续语音识别的半连续随机轨迹模型
5. Integrate template matching and statistical modeling for continuous speech recognition. [D] . Sun, Xie. 2011

机译：集成模板匹配和统计建模，可进行连续语音识别。
6. Improved model adaptation approach for recognition of reduced-frame-rate continuous speech [O] . Lee-Min Lee, Hoang-Hiep Le, Fu-Rong Jean -1

机译：用于识别降低帧率的连续语音的改进模型自适应方法
7. Reduced Semi-continuous Models for Large Vocabulary Continuous Speech Recognition in Dutch [O] . K. Demuynck, J. Duchateau, D. Van Compernolle 1996

机译：荷兰语大词汇量连续语音识别的半连续模型
8. Efficient A* Stack Decoder Algorithm for Continuous Speech Recognition with a Stochastic Language Model. [R] . Paul, D. B. 1991

机译：用随机语言模型进行连续语音识别的高效a *堆栈译码算法。

A semi-continuous stochastic trajectory model for phoneme-based continuous speech recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅