Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR

Mate Szarvas; Sadaoki Furui

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR

【24h】

Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR

机译：Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

This article introduces a novel approach to model phonology and morphosyntax in morpheme unit based speech recognizers. The proposed method is evaluated in our recent Hungarian large vocabulary continuous speech recognition (LVCSR) system. The architecture of the recognition system is based on the weighted finite state transducer (WFST) paradigm. The task domain is the recognition of fluently read sentences selected from a major daily newspaper. The vocabulary units used in the system are morpheme based in order to provide sufficient coverage of the large number of word-forms resulting from affixation and compounding. Besides the basic pronunciation model and the morpheme N-gram language model we evaluate a novel phonology model and the novel stochastic morphosyntactic language model (SMLM). Thanks to the flexible transducer-based architecture of the system these new components are integrated seamlessly with the basic modules with no need to modify the decoder itself. The proposed phonology model reduced the error rate by 8.32% and the stochastic morphosyntactic language model decreased the error rate by 17.9% relatively compared to the baseline systems. The morpheme error rate of the best configuration is 14.75% in a 1350 morpheme Hungarian dictation task.

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2002年第529期|141-146|共6页
作者
Mate Szarvas; Sadaoki Furui;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种日语
中图分类电报、传真;
关键词
Phonology modeling; Language modeling; Morphology modeling; Finite state transducer; Speech recognition; Hungarian;

Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR

摘要

著录项

相关主题

期刊订阅