首页> 外文会议>Proceedings of the International Colloquium on Information Fusion 2007 >Research on Chinese Character Confusion Network Algorithm for LVCSR

【24h】

Research on Chinese Character Confusion Network Algorithm for LVCSR

机译：LVCSR的汉字混淆网络算法研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In large vocabulary continuous speech recognition, the recognizer outputs using the standard MAP decoding strategy have the minimum sentence error rate, so there is a mismatch between the MAP recognition results and the commonly used performance metric- word error rate. The minimum bayes risk(MBR)decoding strategy can be used to obtain recognition results with minimum WER. One method of MBR decoding is that the word lattice can be transformed into confusion network in order to obtain the hypotheses with minimum WER. According to the characteristic of mandarin, we proposed an Chinese character confusion network generation algorithm based on prevenient works. Firstly, a Chinese word lattice can be produced by using standard mandarin large vocabulary continuous speech recognizer; then the Chinese word lattice is analyzed and handled based on the Chinese language features, and an Chinese character lattice is made; lastly an Chinese character confusion network is produce by implementing alignment in the Chinese character lattice. The experimental results of mandarin large vocabulary continuous speech recognition show that the proposed algorithm yields a lower WER than the MAP recognition and previous two confusion network generation algorithms.

机译：在大词汇量连续语音识别中，使用标准MAP解码策略的识别器输出具有最小的句子错误率，因此MAP识别结果与常用的性能度量词错误率不匹配。最小贝叶斯风险（MBR）解码策略可用于以最小WER获得识别结果。 MBR解码的一种方法是可以将单词晶格转换为混淆网络，以获得具有最小WER的假设。根据普通话的特点，提出了一种基于先验作品的汉字混淆网络生成算法。首先，可以使用标准的普通话大词汇量连续语音识别器来生成中文单词格;然后根据汉语言特征对汉字词格进行分析处理，制作汉字词格。最后通过在汉字格中实现对齐产生汉字混淆网络。普通话大词汇量连续语音识别的实验结果表明，该算法产生的WER低于MAP识别和前两种混淆网络生成算法。

著录项

来源
《Proceedings of the International Colloquium on Information Fusion 2007 》|2007年|P.389-392|共4页
会议地点
作者
Bin Wu; Chengli Sun; Jun Guo; Gang Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理技术 ;
关键词
LVCSR; Chinese character confusion network; MBR; WER;

机译：LVCSR;汉字混淆网络; MBR; WER;

相似文献

外文文献
中文文献
专利

1. Candidate expansion algorithm based on weighted syllable confusion matrix for Mandarin LVCSR [J] . Fengxiang, Chang, Baoxiang, Communications, China . 2013 ,第7期

机译：基于加权音节混淆矩阵的普通话LVCSR候选扩展算法
2. Towards Directing Convolutional Neural Networks Using Computational Geometry Algorithms: Application to Handwritten Arabic Character Recognition [J] . Mohsine Elkhayati, Youssfi Elkettani Advances in Science, Technology and Engineering Systems . 2020 ,第5期

机译：使用计算几何算法指导卷积神经网络：应用于手写的阿拉伯语字符识别
3. Performance evaluation of Hopfield neural networks for overlapped English characters by using genetic algorithms [J] . Somesh Kumar, Manu Pratap Singh International Journal of Hybrid Intelligent Systems . 2011 ,第4期

机译：基于遗传算法的Hopfield神经网络重叠英文字符性能评估
4. Confusion Network Based System Combination for Chinese Translation Output Word-Level or Character-Level? [C] . LI Maoxi, WANG Mingwen Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT . 2012

机译：基于混淆网络的中文翻译输出词级还是字符级系统组合？
5. Comparison of Search Algorithms in Two-Stage Neural Network Training for Optical Character Recognition of Handwritten Digits [D] . Gilley, Patrik Wayne. 2020

机译：两级神经网络训练中搜索算法的比较，用于手写数字的光学字符识别
6. Brain networks associated with sublexical properties of Chinese characters [O] . Jianfeng Yang, Xiaojuan Wang, Hua Shu, -1

机译：脑网络与汉字的芳香属性相关
7. Review of Hypothesis Alignment Algorithms for MT System Combination via Confusion Network Decoding [O] . Rosti Antti-Veikko, He Xiaodong, Karakos Damianos, 2011

机译：混淆网络解码MT系统组合的假设对准算法综述

Research on Chinese Character Confusion Network Algorithm for LVCSR

摘要

著录项

相似文献

相关主题

期刊订阅