All-Path Decoding Algorithm for Segmental Based Speech Recognition

机译：基于分段的语音识别全路径解码算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In conventional speech processing, researchers adopt a dividable assumption, that the speech utterance can be divided into nonoverlapping feature sequences and each segment represents an acoustic event or a label. And the probability of a label sequence on an utterance approximates to the probability of the best utterance segmentation for this label sequence. But in the real case, feature sequences of acoustic events may be overlapped partially, especially for the neighboring phonemes within a syllable. And the best segmentation approximation even reinforces the distortion by the dividable assumption. In this paper, we propose an all-path decoding algorithm, which can fuse the information obtained by different segmentations (or paths) without paying obvious computation load, so the weakness of the dividable assumption could be alleviated. Our experiments show, the new decoding algorithm can improve the system performance effectively in tasks with heavy insertion and deletion errors.

机译：在常规语音处理中，研究人员采用可分割的假设，即语音发声可以分为不重叠的特征序列，每个片段代表一个声音事件或一个标签。标签序列在发声上的概率接近此标签序列的最佳发声分段的概率。但是在实际情况下，声音事件的特征序列可能会部分重叠，尤其是对于音节中的相邻音素而言。最佳分割逼近甚至可以通过可分割的假设增强失真。在本文中，我们提出了一种全路径解码算法，该算法可以融合由不同分段（或路径）获得的信息，而无需付出明显的计算负担，因此可以缓解可分假设的弱点。我们的实验表明，新的解码算法可以有效地解决具有严重插入和删除错误的任务中的系统性能。

著录项

来源
《Chinese Spoken Language Processing; Lecture Notes in Artificial Intelligence; 4274》|2006年|435-444|共10页
会议地点 Singapore(SG)
作者
Yun Tang; Wenju Liu; Bo Xu;
展开▼
作者单位

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Corrections to "Segmental minimum Bayes-risk decoding for automatic speech recognition" [J] . Goel V., Kumar S., Byrne W. IEEE transactions on audio, speech and language processing . 2006,第1期

机译：对“用于自动语音识别的分段最小贝叶斯风险解码”的更正
2. Reducing computational load in segmental hidden Markov model decoding for speech recognition [J] . Russell M.J. Electronics Letters . 2005,第25期

机译：减少用于语音识别的分段隐马尔可夫模型解码中的计算负荷
3. Segmental minimum Bayes-risk decoding for automatic speech recognition [J] . Goel V., Kumar S., Byrne W. IEEE Transactions on Speech and Audio Proceessing . 2004,第3期

机译：分段最小贝叶斯风险解码，用于自动语音识别
4. All-Path Decoding Algorithm for Segmental Based Speech Recognition [C] . Yun Tang, Wenju Liu, Bo Xu International Symposium on Chinese Spoken Language Processing . 2006

机译：基于分段的语音识别的All-Path解码算法
5. Explicit N-best formant features for segment-based speech recognition. [D] . Schmid, Philipp Heinz. 1996

机译：基于段的语音识别的显式N最佳共振峰特征。
6. Robust EEG-Based Decoding of Auditory Attention With High-RMS-Level Speech Segments in Noisy Conditions [O] . Lei Wang, Ed X. Wu, Fei Chen 2020

机译：基于危险的eeg的eeg的解码在嘈杂的条件下具有高rms级语音段的听觉注意力
7. All-Path Decoding Algorithm for Segmental based Speech Recognition [O] . Yun Tang, Wen-ju Liu, Bo Xu 2015

机译：基于分段语音识别的全路径译码算法
8. Efficient A* Stack Decoder Algorithm for Continuous Speech Recognition with a Stochastic Language Model. [R] . Paul, D. B. 1991

机译：用随机语言模型进行连续语音识别的高效a *堆栈译码算法。

All-Path Decoding Algorithm for Segmental Based Speech Recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅