首页> 外文会议>IEEE International Conference on Acoustics Speech and Signal;ICASSP 2010 >Context dependent phonetic string edit distance for automatic speech recognition
【24h】

Context dependent phonetic string edit distance for automatic speech recognition

机译:上下文相关的语音字符串编辑距离,用于自动语音识别

获取原文

摘要

An automatic speech recognition system searches for the word transcription with the highest overall score for a given acoustic observation sequence. This overall score is typically a weighted combination of a language model score and an acoustic model score. We propose including a third score, which measures the similarity of the word transcription's pronunciation to the output of a less constrained phonetic recognizer. We show how this phonetic string edit distance can be learned from data, and that including context in the model is essential for good performance. We demonstrate improved accuracy on a business search task.
机译:自动语音识别系统针对给定的声音观察序列搜索具有最高总分的单词转录。该总体分数通常是语言模型分数和声学模型分数的加权组合。我们建议包括第三个分数,该分数用于测量单词转录的发音与较少受约束的语音识别器输出的相似性。我们展示了如何从数据中学习该语音字符串编辑距离,以及在模型中包括上下文对于取得良好性能至关重要。我们证明了业务搜索任务的准确性得到提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号