Compound decomposition in Dutch large vocabulary speech recognition

机译：荷兰大词汇语音识别中的复合分解

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses compound splitting for Dutch in the context of broadcast news transcription. Language models were created using original text versions and text versions that were decomposed using a data-driven compound splitting algorithm. Language model performances were compared in terms of out-of-vocabulary rates and word error rates in a real-world broadcast news transcription task. It was concluded that compound splitting does improve ASR performance. Best results were obtained when frequent compounds were not decomposed.

机译：本文针对广播新闻转录中荷兰语的复合拆分问题。语言模型是使用原始文本版本创建的，而文本版本是使用数据驱动的复合拆分算法分解的文本版本。在实际广播新闻转录任务中，根据语音输出率和单词错误率对语言模型的性能进行了比较。结论是，化合物分裂确实改善了ASR性能。当常用化合物不分解时，可获得最佳结果。

著录项

来源
《European Conference on Speech Communication and Technology - EUROSPEECH 2003(INTERSPEECH 2003) vol.1; 20030901-04; Geneva(CH)》|2003年|P.225-228|共4页
会议地点 Geneva(CH)
作者
Roeland Ordelman; Arjan van Hessen; Franciska de Jong;
展开▼
作者单位

Department of Computer Science, University of Twente, The Netherlands;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动信息理论;
关键词
入库时间 2022-08-26 13:48:54

相似文献

外文文献
中文文献
专利

1. An improved two-stage mixed language model approach for handling out-of-vocabulary words in large vocabulary continuous speech recognition [J] . Bert Reveil, Kris Demuynck, Jean-Pierre Martens Computer speech and language . 2014,第1期

机译：一种改进的两阶段混合语言模型方法，用于处理大词汇量连续语音识别中的词汇外单词
2. Effect of Vocabulary Extension using Word Sequence Concatenation for Large Vocabulary Continuous Speech Recognition [J] . YOSUKE WADA, NORIHIKO KOBAYASHI, YUICHIRO NAKANO 情報処理学会論文誌 . 1999,第4期

机译：单词序列级联对词汇扩展对大词汇量连续语音识别的影响
3. Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech [J] . Krerksak Likitsupin, Proadpran Punyabukkana, Chai Wutiwiwatchai, Engineering journal . 2016,第2期

机译：改进大词汇量连续语音基于片段的语音识别的声学方法
4. Compound decomposition in Dutch large vocabulary speech recognition [C] . Roeland Ordelman, Arjan van Hessen, Franciska de Jong, European Conference on Speech Communication and Technology . 2003

机译：复合分解在荷兰大词汇语音识别
5. Vocabulary Acquisition Using Mnemonics and Speech Recognition [D] . Keshoju, Apurva. 2019

机译：使用助记符和语音识别的词汇习得
6. Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition [O] . Jibin Wu, Emre Yılmaz, Malu Zhang, 2020

机译：大型词汇自动语音识别深尖峰神经网络
7. N-Best 2008: A Benchmark Evaluation for Large Vocabulary Speech Recognition in Dutch [O] . 2013

机译：N-Best 2008：荷兰大型词汇语音识别的基准评估
8. Vocabulary and Environment Adaptation in Vocabulary-Independent Speech Recognition. [R] . Hon, H., Lee, K. 1992

机译：词汇独立语音识别中的词汇与环境适应。

Compound decomposition in Dutch large vocabulary speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅