Efficient search strategy in large vocbulary continuous speech recognition using prosodic boundary information

机译：韵律边界信息在大词汇连续语音识别中的有效搜索策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Prosodic-syntactic boundary as an information source can be used to improve the performance of Large Vocabulary Continuous Speech Recognition (LVCSR) in both efficiency and accuracy. This paper presents a study of two effective methods to explit prosodic boundary information in a multi-pass decoder. In this paper, we address the effect of a language model on setting pruning beam width and how to control the Cross-word Context Dependent (CCD) models by prosodic boundary information. In the first pass decoding, dynamci beam search strategy regarding inner-word and cross-word paths is proposed to reduce search space efficiently, and then cross-word context dependent models are optimized using prosodic boundary information in the second pass decoding. The recognition experiments, which were carried out on the Japanese Newspaper Article Sentences (JNAS) 20k word task using a multi-pass decoder, demonstrated that the proposed method led to significant reduction in the search space with accuracy improvement.

机译：韵律句法边界作为信息源可用于提高大词汇量连续语音识别（LVCSR）的效率和准确性。本文提出了两种有效的方法来研究多通道解码器中的韵律边界信息。在本文中，我们解决了语言模型对设置修剪波束宽度的影响，以及如何通过韵律边界信息控制跨字上下文相关（CCD）模型。在第一遍解码中，提出了一种针对内单词和跨单词路径的动态波束搜索策略，以有效地减少搜索空间，然后在第二遍解码中使用韵律边界信息对跨单词上下文相关的模型进行优化。使用多遍解码器对日本报纸文章句子（JNAS）20k单词任务进行的识别实验表明，该方法可显着减少搜索空间，并提高准确性。

著录项

来源
《6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16.-Oct.20 2000 Beijing International Convention Center,Beijing, China》|2000年|p.274-277|共4页
会议地点
作者
Shi-wook Lee; Keikichi Hirose; Nobuaki Minematsu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类世界各国文化与文化事业;
关键词

相似文献

外文文献
中文文献
专利

1. An efficient search space representation for large vocabulary continuous speech recognition [J] . Kris DemuynckpJacques Duchateau, Dirk Van Compernolle 20f Speech Communication . 2000,第1期

机译：用于大词汇量连续语音识别的有效搜索空间表示
2. A New Approach of Parsing and Search Based on the Divide and Conquer Strategy for Continuous Speech Recognition [J] . Ming-Sheng WANG, Satoshi IMAI IEICE Transactions on Information and Systems . 1995,第4期

机译：基于分而治之策略的连续语音识别解析与搜索新方法
3. Prosodic word boundary detection from Bengali continuous speech [J] . Tanmay Bhowmik, Shyamal Kumar Das Mandal Language Resources and Evaluation . 2020,第3期

机译：孟加拉连续演讲的韵律词界探测
4. Efficient search strategy in large vocbulary continuous speech recognition using prosodic boundary information [C] . Shi-wook Lee, Keikichi Hirose, Nobuaki Minematsu International conference on spoken language processing . 2000

机译：使用博物馆边界信息的大型遗物连续语音识别中的高效搜索策略
5. Parallel Viterbi search for continuous speech recognition on a multi-core architecture [D] . Parihar, Naveen 2009

机译：并行Viterbi搜索可在多核体系结构上进行连续语音识别
6. Gestural coordination at prosodic boundaries and its role for prosodic structure and speech planning processes [O] . Jelena Krivokapić 2014

机译：韵律边界的手势协调及其在韵律结构和语音计划过程中的作用
7. Detection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition [O] . Keikichi Hirose, Koji Iwano 2000

机译：通过基频轮廓等值线的跃迁统计模型检测韵律词边界，并将其用于连续语音识别

Efficient search strategy in large vocbulary continuous speech recognition using prosodic boundary information

摘要

著录项

相似文献

相关主题

期刊订阅