首页> 外文会议>European Conference on Speech Communication and Technology v.2; 20010903-20010907; Aalborg; DK >Breadth-First Search for Finding the Optimal Phonetic Transcription from Multiple Utterances
【24h】

Breadth-First Search for Finding the Optimal Phonetic Transcription from Multiple Utterances

机译:广度优先搜索,从多种话语中找到最佳语音转录

获取原文
获取原文并翻译 | 示例

摘要

Extending the vocabulary of a large vocabulary speech recognition system usually requires phonetic transcriptions for all words to be known. With automatic phonetic baseform determination acoustic samples of the words in question can substitute for the required expert knowledge. In this paper we follow a probabilitistic approach to this problem and present a novel breadth-first search algorithm which takes full advantage of multiple samples. An extension to the algorithm to genereate phone graphs as well as an EM based iteration scheme for estimating stochastic pronunciation models is presented. In preliminary experiments phoneme error rates below 5% with respect to the standard pronunciation are achieved without language or word specific prior knowledge.
机译:扩展大型词汇语音识别系统的词汇通常要求所有已知单词的语音转录。通过自动语音基础形式确定,所讨论单词的声音样本可以替代所需的专家知识。在本文中,我们采用概率方法解决此问题,并提出了一种新颖的广度优先搜索算法,该算法充分利用了多个样本。提出了用于生成电话图的算法的扩展,以及基于EM的用于估计随机发音模型的迭代方案。在初步实验中,在没有语言或单词特定先验知识的情况下,相对于标准发音而言,音素错误率低于5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号