...
【24h】

Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR

机译:基于有限状态传感器的音韵和形态学与匈牙利LVCSR的应用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This article introduces a novel approach to model phonology and morphosyntax in morpheme unit based speech recognizers. The proposed method is evaluated in our recent Hungarian large vocabulary continuous speech recognition (LVCSR) system. The architecture of the recognition system is based on the weighted finite state transducer (WFST) paradigm. The task domain is the recognition of fluently read sentences selected from a major daily newspaper. The vocabulary units used in the system are morpheme based in order to provide sufficient coverage of the large number of word-forms resulting from affixation and compounding. Besides the basic pronunciation model and the morpheme N-gram language model we evaluate a novel phonology model and the novel stochastic morphosyntactic language model (SMLM). Thanks to the flexible transducer-based architecture of the system these new components are integrated seamlessly with the basic modules with no need to modify the decoder itself. The proposed phonology model reduced the error rate by 8.32% and the stochastic morphosyntactic language model decreased the error rate by 17.9% relatively compared to the baseline systems. The morpheme error rate of the best configuration is 14.75% in a 1350 morpheme Hungarian dictation task.
机译:本文介绍了词素基于单元的语音识别器的新方法,以模型音韵学和形态句法。所提出的方法,我们在最近的匈牙利大词汇量连续语音识别(LVCSR)系统进行评估。识别系统的体系结构是基于加权有限状态变换器(WFST)范例。任务域是从主要日报选定读起来顺口的句子的识别。在系统中使用的词汇单元词素基于以提供大量的附加从和配混所得字形式的足够的覆盖。除了基本的语音模型和语素的N-gram语言模型中,我们评价一个新的音韵模型和新的随机形态句法语言模型(SMLM)。由于该系统中,这些新的组件与具有无需修改解码器本身的基本模块无缝集成的柔性基于换能器的体系结构。所提出的音韵模型由8.32%降低了错误率和随机形态句法语言模型相对比基线系统由17.9%降低错误率。最佳配置的语素误差率在1350语素匈牙利听写任务14.75%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号