首页> 外文会议>IEEE Workshop on Spoken Language Technology >Syllable based keyword search: Transducing syllable lattices to word lattices
【24h】

Syllable based keyword search: Transducing syllable lattices to word lattices

机译:基于音节的关键字搜索:将音节格转换为单词格

获取原文

摘要

This paper presents a weighted finite state transducer (WFST) based syllable decoding and transduction framework for keyword search (KWS). Acoustic context dependent phone models are trained from word forced alignments. Then syllable decoding is done with lattices generated using a syllable lexicon and language model (LM). To process out-of-vocabulary (OOV) keywords, pronunciations are produced using a grapheme-to-syllable (G2S) system. A syllable to word lexical transducer containing both in-vocabulary (IV) and OOV keywords is then constructed and composed with a keyword-boosted LM transducer. The composed transducer is then used to transduce syllable lattices to word lattices for final KWS. We show that our method can effectively perform KWS on both IV and OOV keywords, and yields up to 0.03 Actual Term-Weighted Value (ATWV) improvement over searching keywords directly in subword lattices. Word Error Rates (WER) and KWS results are reported for three different languages.
机译:本文提出了一种基于加权有限状态变换器(WFST)的关键字搜索音节解码和转导框架(KWS)。语音上下文相关的电话模型是通过单词强制对齐进行训练的。然后,使用使用音节词典和语言模型(LM)生成的格完成音节解码。为了处理语音不佳(OOV)关键字,使用音素到音节(G2S)系统产生发音。然后构造一个包含语音中(IV)和OOV关键字的音节到词的词法转换器,并用关键字增强的LM转换器组成。然后,将组合的换能器用于将音节格转换为最终KWS的单词格。我们证明了我们的方法可以有效地对IV和OOV关键字执行KWS,与直接在子词晶格中搜索关键字相比,该方法可以提高高达0.03的实际术语加权值(ATWV)。报告三种不同语言的单词错误率(WER)和KWS结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号