Recent Evaluations of a WFST-Based Speech Recognition Decoder

Paul R. DIXON; Josef R. NOVAK; Tasuku OONISHI; Sadaoki FURUI

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Recent Evaluations of a WFST-Based Speech Recognition Decoder

【24h】

Recent Evaluations of a WFST-Based Speech Recognition Decoder

机译：基于WFST的语音识别解码器的最新评估

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes the latest performance evaluations on the Tokyo Tech Transducer-based (T~3) speech decoder. These evaluations focus on two particular tasks which include a large-vocabulary continuous speech transcription system with a 460k vocabulary evaluated on the JNAS corpus, and a voice search system developed for an all-Japan train timetables task. This paper provides a detailed explanation of the successful steps taken to construct a large integrated network which achieves high recognition performance, based on an exhaustive comparison of different construction strategies. Furthermore, in the context of the voice search task, this paper provides a performance comparison of two widely popular acoustic model toolkits, HTK and SphinxTrain in the unified context of the T~3 decoder. In particular these results indicate that there is a significant advantage to employing the log semiring for all WFST construction operations. These results also serve to further verify the flexibility and speed of the T~3 decoder on a variety of different tasks.

机译：本文介绍了基于东京技术换能器（T〜3）语音解码器的最新性能评估。这些评估集中在两个特定的任务上，其中包括一个大词汇量连续语音转录系统和一个在JNAS语料库上评估的460k词汇量，以及一个针对全日本列车时间表任务开发的语音搜索系统。本文对不同的构建策略进行了详尽的比较，详细说明了构建大型集成网络并成功实现识别性能的成功步骤。此外，在语音搜索任务的上下文中，本文在T〜3解码器的统一上下文中提供了两种流行的声学模型工具包HTK和SphinxTrain的性能比较。特别是这些结果表明，对所有WFST施工操作采用对数半圆环具有明显的优势。这些结果还有助于进一步验证T〜3解码器在各种不同任务上的灵活性和速度。

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2009年第356期|共6页
作者
Paul R. DIXON; Josef R. NOVAK; Tasuku OONISHI; Sadaoki FURUI;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电报、传真;
关键词
Speech Recognition; WFST; LVCSR;

机译：语音识别WFST LVCSR;

相似文献

外文文献
中文文献
专利

1. Recent Evaluations of a WFST-Based Speech Recognition Decoder [J] . Paul R. DIXON, Josef R. NOVAK, Tasuku OONISHI, 電子情報通信学会技術研究報告. 音声. Speech . 2009,第356期

机译：基于WFST的语音识别解码器的最新评估
2. Recent Evaluations of a WFST-Based Speech Recognition Decoder [J] . Paul R. DIXON, Josef R. NOVAK, Tasuku OONISHI, 電子情報通信学会技術研究報告 . 2009,第355期

机译：基于WFST的语音识别解码器的最新评估
3. Recent Evaluations of a WFST-Based Speech Recognition Decoder [J] . Paul R. DIXON, Josef R. NOVAK, Tasuku OONISHI, 電子情報通信学会技術研究報告 . 2009,第356期

机译：基于WFST的语音识别解码器的最新评估
4. An Asynchronous WFST-Based Decoder for Automatic Speech Recognition [C] . Hang Lv, Zhehuai Chen, Hainan Xu, IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：基于异步WFST的解码器，用于自动语音识别
5. Objective speech intelligibility assessment using speech recognition and bigram statistics with application to low bit-rate codec evaluation [D] . Teng, Yan 2006

机译：使用语音识别和双字母组统计的客观语音清晰度评估及其在低比特率编解码器评估中的应用
6. Neural speech recognition: Continuous phoneme decoding using spatiotemporal representations of human cortical activity [O] . David A Moses, Nima Mesgarani, Matthew K Leonard, -1

机译：神经语音识别：使用人类皮层活动的时空表示进行连续音素解码
7. EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding [O] . Miao, Yajie, Gowayyed, Mohammad, Metze, Florian 2015

机译：EEsEN：使用Deep RNN模型和。的端到端语音识别基于WFsT的解码
8. Efficient A* Stack Decoder Algorithm for Continuous Speech Recognition with a Stochastic Language Model. [R] . Paul, D. B. 1991

机译：用随机语言模型进行连续语音识别的高效a *堆栈译码算法。

Recent Evaluations of a WFST-Based Speech Recognition Decoder

摘要

著录项

相似文献

相关主题

期刊订阅