首页> 外文会议>IEEE Workshop on Spoken Language Technology >Syllable based keyword search: Transducing syllable lattices to word lattices
【24h】

Syllable based keyword search: Transducing syllable lattices to word lattices

机译:基于音节的关键字搜索:将音节格式转换为单词格子

获取原文

摘要

This paper presents a weighted finite state transducer (WFST) based syllable decoding and transduction framework for keyword search (KWS). Acoustic context dependent phone models are trained from word forced alignments. Then syllable decoding is done with lattices generated using a syllable lexicon and language model (LM). To process out-of-vocabulary (OOV) keywords, pronunciations are produced using a grapheme-to-syllable (G2S) system. A syllable to word lexical transducer containing both in-vocabulary (IV) and OOV keywords is then constructed and composed with a keyword-boosted LM transducer. The composed transducer is then used to transduce syllable lattices to word lattices for final KWS. We show that our method can effectively perform KWS on both IV and OOV keywords, and yields up to 0.03 Actual Term-Weighted Value (ATWV) improvement over searching keywords directly in subword lattices. Word Error Rates (WER) and KWS results are reported for three different languages.
机译:本文介绍了用于关键字搜索(KWS)的加权有限状态传感器(WFST)的音节解码和转换框架。声学上下文相关的电话模型从强制对齐方式培训。然后使用使用音节词典和语言模型(LM)生成的格子完成音节解码。要处理词汇外(OOV)关键字,请使用GraphEme-to-symllable(G2S)系统生成发音。然后将包含词汇表(IV)和OOV关键词的单词词汇传感器的音节构成并用关键字升压的LM换能器组成。然后,组成的传感器用于将音节格子转换为最终KWS的字格。我们表明我们的方法可以在IV和OOV关键字中有效地执行KWS,并在下字形格子中直接在搜索关键字上产生高达0.03个实际的术语加权值(ATWV)改进。报告了三种不同语言的字错误率(WER)和KWS结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号