首页> 外文会议>Chinese Spoken Language Processing; Lecture Notes in Artificial Intelligence; 4274 >Vietnamese Automatic Speech Recognition: The FLaVoR Approach

【24h】

Vietnamese Automatic Speech Recognition: The FLaVoR Approach

机译：越南语自动语音识别：FLaVoR方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic speech recognition for languages in Southeast Asia, including Chinese, Thai and Vietnamese, typically models both acoustics and languages at the syllable level. This paper presents a new approach for recognizing those languages by exploiting information at the word level. The new approach, adapted from our FLaVoR architecture[1], consists of two layers. In the first layer, a pure acoustic-phonemic search generates a dense phoneme network enriched with meta data. In the second layer, a word decoding is performed in the composition of a series of finite state transducers (FST), combining various knowledge sources across sub-lexical, word lexical and word-based language models. Experimental results on the Vietnamese Broadcast News corpus showed that our approach is both effective and flexible.

机译：东南亚语言（包括中文，泰语和越南语）的自动语音识别通常在音节级别对声学和语言进行建模。本文提出了一种通过在单词级别上利用信息来识别那些语言的新方法。根据我们的FLaVoR体系结构[1]改编的新方法包括两层。在第一层中，纯声学音素搜索将生成一个密集的音素网络，其中富含元数据。在第二层中，在一系列有限状态换能器（FST）的组合中执行单词解码，将跨子词法，词词法和基于词的语言模型的各种知识源进行组合。越南广播新闻语料库的实验结果表明，我们的方法既有效又灵活。

著录项

来源
《Chinese Spoken Language Processing; Lecture Notes in Artificial Intelligence; 4274 》|2006年|464-474|共11页
会议地点 Singapore(SG)
作者
Quan Vu; Kris Demuynck; Dirk Van Compernolle;
展开▼
作者单位

K.U.Leuven/ESAT/PSI Kasteelpaxk Arenberg 10, B-3001 Leuven, Belgium;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言 ;
关键词
automatic speech recognition; finite state transducer; phoneme network; vietnamese;

机译：自动语音识别；有限状态传感器音素网络；越南语;

相似文献

外文文献
中文文献
专利

1. Bridging automatic speech recognition and psycholinguistics: Extending Shortlist to an end-to-end model of human speech recognition (L) [J] . Odette Scharenborg, Louis ten Bosch, Lou Boves, The Journal of the Acoustical Society of America . 2003 ,第6期

机译：桥接自动语音识别和心理语言学：将候选清单扩展到人类语音识别的端到端模型（L）
2. A Fast Adaptation Approach for Enhanced Automatic Recognition of Children's Speech with Mismatched Acoustic Models [J] . Shahnawazuddin S., Sinha Rohit Circuits, systems, and signal processing . 2018 ,第3期

机译：利用不匹配的声学模型增强儿童语音自动识别的快速自适应方法
3. A unified approach to transfer learning of deep neural networks with applications to speaker adaptation in automatic speech recognition [J] . Huang Zhen, Siniscalchi Sabato Marco, Lee Chin-Hui Neurocomputing . 2016 ,第DECa19期

机译：深度神经网络转移学习的统一方法及其在自动语音识别中的说话人自适应中的应用
4. Vietnamese Automatic Speech Recognition: The FLaVoR Approach [C] . Quan Vu, Kris Demuynck, Dirk Van Compernolle International Symposium on Chinese Spoken Language Processing . 2006

机译：越南自动演讲识别：味道方法
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference [O] . Byeongwook Lee, Kwang-Hyun Cho -1

机译：以语音包络作为时间参考的自动语音识别的大脑启发式语音分割
7. Vietnamese Automatic Speech Recognition: the FLaVoR Approach [O] . Quan Vu, Kris Demuynck, Dirk Van Compernolle 2014

机译：越南语自动语音识别：FLaVoR方法

Vietnamese Automatic Speech Recognition: The FLaVoR Approach

摘要

著录项

相似文献

相关主题

期刊订阅