Context dependent phonetic string edit distance for automatic speech recognition

机译：上下文相关的语音字符串编辑距离，用于自动语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

An automatic speech recognition system searches for the word transcription with the highest overall score for a given acoustic observation sequence. This overall score is typically a weighted combination of a language model score and an acoustic model score. We propose including a third score, which measures the similarity of the word transcription's pronunciation to the output of a less constrained phonetic recognizer. We show how this phonetic string edit distance can be learned from data, and that including context in the model is essential for good performance. We demonstrate improved accuracy on a business search task.

机译：自动语音识别系统针对给定的声音观察序列搜索具有最高总分的单词转录。该总体分数通常是语言模型分数和声学模型分数的加权组合。我们建议包括第三个分数，该分数用于测量单词转录的发音与较少受约束的语音识别器输出的相似性。我们展示了如何从数据中学习该语音字符串编辑距离，以及在模型中包括上下文对于取得良好性能至关重要。我们证明了业务搜索任务的准确性得到提高。

著录项

来源
《IEEE International Conference on Acoustics Speech and Signal;ICASSP 2010》|2010年|p.4358-4361|共4页
会议地点 Dallas, TX(US);Dallas, TX(US)
作者
Droppo, Jasha; Acero, Alex;
展开▼
作者单位

Speech Technology Group Microsoft Research Redmond Washington USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
acoustic modeling; speech recognition; string edit distance;

机译：声学建模；语音识别;字符串编辑距离;

相似文献

外文文献
中文文献
专利

1. Validation of phonetic transcriptions in the context of automatic speech recognition [J] . Christophe Van Bael, Henk van den Heuvel, Helmer Strik Language Resources and Evaluation . 2007,第2期

机译：在自动语音识别中验证语音转录
2. ARABIC DISORDERED SPEECH PHONETIC DICTIONARY GENERATOR FOR AUTOMATIC SPEECH RECOGNITION [J] . ASSAL A. M. ALQUDAH, MOHAMMAD A. M. ALSHRAIDEH, AHMAD A. S. SHARIEH Journal of Theoretical and Applied Information Technology . 2020,第4期

机译：用于自动语音识别的阿拉伯语混乱的语音语音字典发生器
3. Arabic Speaker-Independent Continuous Automatic Speech Recognition Based on a Phonetically Rich and Balanced Speech Corpus [J] . Mohammad Abushariah, Raja Ainon, Roziati Zainuddin, The international arab journal of information technology . 2012,第1期

机译：基于语音丰富均衡的语料库的阿拉伯语独立于说话人的连续自动语音识别
4. Speech recognition using sub-word units dependent on phonetic contexts of both training and recognition vocabularies [C] . Hattori, H., Yamada, . 1996

机译：使用取决于训练和识别词汇的语音上下文的子词单位进行语音识别
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. Automatically Detecting Likely Edits in Clinical Notes Created Using Automatic Speech Recognition [O] . Kevin Lybarger, Mari Ostendorf, Meliha Yetisgen 2017

机译：在使用自动语音识别功能创建的临床笔记中自动检测可能的编辑
7. Speech Recognition Using Sub-Word Units Dependent On Phonetic Contexts Of Both Training And Recognition Vocabularies [O] . Hiroaki Hattori And, Hiroaki Hattori, Eiko Yamada 2007

机译：使用依赖于训练和识别词汇的语音上下文的子词单位进行语音识别
8. Software Package for Speaker Independent or Dependent Speech Recognition UsingStandard Objects for Phonetic Speech Recognition [R] . Pfister, M. 1998

机译：使用标准对象进行语音识别的扬声器独立或相关语音识别软件包

Context dependent phonetic string edit distance for automatic speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅