Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR

Mate Szarvas; Sadaoki Furui

首页> 外文期刊>電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication >Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR

【24h】

Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR

机译：基于有限状态换能器的语音和形态学建模及其在匈牙利LVCSR中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article introduces a novel approach to model phonology and morphosyntax in morpheme unit based speech recognizers. The proposed method is evaluated in our recent Hungarian large vocabulary continuous speech recognition (LVCSR) system. The architecture of the recognition system is based on the weighted finite state transducer (WFST) paradigm. The task domain is the recognition of fluently read sentences selected from a major daily newspaper. The vocabulary units used in the system are morpheme based in order to provide sufficient coverage of the large number of word-forms resulting from affixation and compounding. Besides the basic pronunciation model and the morpheme N-gram language model we evaluate a novel phonology model and the novel stochastic morphosyntactic language model (SMLM). Thanks to the flexible transducer-based architecture of the system these new components are integrated seamlessly with the basic modules with no need to modify the decoder itself. The proposed phonology model reduced the error rate by 8.32% and the stochastic morphosyntactic language model decreased the error rate by 17.9% relatively compared to the baseline systems. The morpheme error rate of the best configuration is 14.75% in a 1350 morpheme Hungarian dictation task.

机译：本文介绍了一种新的方法，用于在基于词素单元的语音识别器中对语音和词法语法进行建模。在我们最近的匈牙利大词汇量连续语音识别（LVCSR）系统中对提出的方法进行了评估。识别系统的体系结构基于加权有限状态传感器（WFST）范例。任务领域是识别从主要日报中流利阅读的句子。系统中使用的词汇单位是基于词素的，以便充分覆盖由于粘贴和复合而产生的大量单词形式。除了基本的发音模型和语素N-gram语言模型外，我们还评估了一种新颖的语音模型和新颖的随机形句法语言模型（SMLM）。由于系统采用了基于传感器的灵活架构，因此这些新组件可以与基本模块无缝集成，而无需修改解码器本身。相对于基准系统，所提出的语音模型将错误率降低了8.32％，而随机形态句法语言模型将错误率降低了17.9％。在1350个匈牙利语的听写任务中，最佳配置的语素错误率是14.75％。

著录项

来源
《電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication》 |2002年第527期|共6页
作者
Mate Szarvas; Sadaoki Furui;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 jpn
中图分类通信;
关键词
Phonology modeling; Language modeling; Morphology modeling; Finite state transducer; Speech recognition; Hungarian;

机译：语音建模;语言建模;形态学建模;有限状态传感器;语音识别;匈牙利语;

相似文献

外文文献
中文文献
专利

1. Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR [J] . Mate Szarvas, Sadaoki Furui 電子情報通信学会技術研究報告. 音声. Speech . 2002,第529期

机译：基于有限状态换能器的语音和形态学建模及其在匈牙利LVCSR中的应用
2. Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR [J] . Mate Szarvas, Sadaoki Furui 電子情報通信学会技術研究報告. 音声. Speech . 2002,第529期

机译：基于有限状态传感器的音韵和形态学与匈牙利LVCSR的应用
3. Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR [J] . Mate Szarvas, Sadaoki Furui 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2002,第527期

机译：基于有限状态传感器的音韵和形态学与匈牙利LVCSR的应用
4. FINITE-STATE TRANSDUCER BASED MODELING OF MORPHOSYNTAX WITH APPLICATIONS TO HUNGARIAN LVCSR [C] . IEEE IEEE International Conference on Acoustics, Speech, and Signal Processing . 2003

机译：基于有限状态传感器的Morphosyntax模型与匈牙利LVCSR的应用
5. Validation of an Image-based Subject-specific Dynamic Model of the Ankle Joint Complex and Its Applications to the Study of the Effect of Articular Surface Morphology on Ankle Joint Mechanics [D] . Balakrishnan, Vishnuvardhan. 2018

机译：踝关节复合体的基于图像的对象特异性动态模型及其应用于踝关节力学效果研究的应用
6. Learning stochastic finite-state transducer to predict individual patient outcomes [O] . Patricia Ordoñez, Nelson Schwarz, Adnel Figueroa-Jiménez, -1

机译：学习随机有限状态换能器以预测单个患者的结果
7. Finite-state transducer based modeling of morphosyntax with applications to Hungarian LVCSR [O] . M. Szarvas, S. Furui -1

机译：基于有限状态传感器的Morphosyntax模型与匈牙利LVCSR的应用
8. Current Pressure Transducer Application of Model-based Prognostics Using Steady State Conditions. [R] . Teubert, C., Daigle, M. J. 2014

机译：基于稳态条件的基于模型的预测的当前压力传感器应用。

Finite-state transducer based phonology and morphology modeling with applications to Hungarian LVCSR

摘要

著录项

相似文献

相关主题

期刊订阅