首页> 外文会议>Asian Language Processing, 2009. IALP '09 >Advances in Acoustic Modeling for Vietnamese LVCSR

【24h】

Advances in Acoustic Modeling for Vietnamese LVCSR

机译：越南LVCSR声学建模的进展

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present our experiments on the selection of basic phonetic units for the Vietnamese large vocabulary continuous speech recognition (LVCSR). Two acoustic models were compared. The first model has just used vowels or monophthongs as phonemes [2] while the second one, which was proposed in this paper, has explored the use of diphthongs and triphthongs as phonemes as well. The two models were trained and evaluated on a Broadcast News corpus containing 27 hours of acoustic training data and 1 hour of acoustic testing data. Moreover, an 146M-word corpus collection of newspaper was employed for building the language models. Experimental results indicate significant improvements in both word accuracy rate and time-execution. With the second acoustic model, the word accuracy rates reach 86.06% on the best case and the execution time is faster than the real-time.

机译：在本文中，我们介绍了越南大词汇量连续语音识别（LVCSR）的基本语音单位选择的实验。比较了两种声学模型。第一个模型只是使用元音或单音作为音素[2]，而本文提出的第二个模型也探讨了双音和三音作为音素的使用。在包含27小时声学训练数据和1小时声学测试数据的广播新闻语料库上对这两种模型进行了训练和评估。此外，使用了一个1.46亿字的报纸语料库集合来构建语言模型。实验结果表明，单词准确率和执行时间都得到了显着提高。在第二种声学模型中，最佳情况下的单词准确率达到86.06％，执行时间比实时更快。

著录项

来源
《Asian Language Processing, 2009. IALP '09 》|2009年|280-284|共5页
会议地点 Singapore(SG);Singapore(SG)
作者
Nguyen Tuan; Vu Quan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Vietnamese; acoustic models; speech recognition;

机译：越南语;声学模型;语音识别;

相似文献

外文文献
中文文献
专利

1. The Effect of Tone Modeling in Vietnamese LVCSR System [J] . Quoc Bao Nguyen, Tat Thang Vu, Chi Mai Luong Procedia Computer Science . 2016 ,第1期

机译：音调建模在越南LVCSR系统中的作用
2. LVCSR Based on Context-Dependent Syllable Acoustic Models [J] . Jian ZHANG, Longbiao WANG, Seiichi NAKAGAWA 電子情報通信学会技術研究報告 . 2008 ,第551期

机译：基于上下文相关音节声学模型的LVCSR
3. LVCSR Based on Context-Dependent Syllable Acoustic Models [J] . Jian ZHANG, Longbiao WANG, Seiichi NAKAGAWA 電子情報通信学会技術研究報告. 音声. Speech . 2007 ,第551期

机译：基于上下文相关音节声学模型的LVCSR
4. Advances in Acoustic Modeling for Vietnamese LVCSR [C] . Tuan Nguyen, Quan Vu International Conference on Asian Language Processing . 2009

机译：越南LVCSR的声学建模进展
5. Search and decoding strategies for complex lexical modeling in LVCSR [D] . Deoras, Anoop 2011

机译：LVCSR中复杂词法建模的搜索和解码策略
6. Acoustic and non-acoustic factors in modeling listener-specific performance of sagittal-plane sound localization [O] . Piotr Majdak, Robert Baumgartner, Bernhard Laback 2014

机译：建模听众特定的矢状面声音定位性能时的声学和非声学因素
7. The Effect of Tone Modeling in Vietnamese LVCSR System [O] . Nguyen Quoc Bao, Vu Tat Thang, Luong Chi Mai 2016

机译：音调建模在越南LVCSR系统中的作用

Advances in Acoustic Modeling for Vietnamese LVCSR

摘要

著录项

相似文献

相关主题

期刊订阅