首页> 外文会议>Computer Processing of Oriental Languages: Beyond the Orient: The Research Challenges Ahead; Lecture Notes in Artificial Intelligence; 4285 >Word Error Correction of Continuous Speech Recognition Using WEB Documents for Spoken Document Indexing
【24h】

Word Error Correction of Continuous Speech Recognition Using WEB Documents for Spoken Document Indexing

机译:使用WEB文档对连续文档进行语音识别的单词错误校正

获取原文
获取原文并翻译 | 示例

摘要

This paper describes an error correction method of continuous speech recognition using WEB documents for spoken documents indexing. We performed an experiment of error correction for news speech automatically transcribed, where we focused on especially proper nouns. Two LVCSR systems were used to detect correctly and incorrectly recognized words. Keywords for the Internet search engine were selected among the correctly transcribed words, then correct candidates for the mis-recognized words were obtained in retrieved documents. A Dynamic Programming (DP) technique with a confusion matrix was utilized to compare the candidates with the mis-recognized words. In results of experiment of error correction, recognition rate of proper nouns achieved improvement of about 10% by using WEB documents.
机译:本文介绍了一种使用WEB文档进行语音文档索引的连续语音识别错误纠正方法。我们对自动转录的新闻语音进行了纠错实验,我们重点研究了专有名词。两个LVCSR系统用于检测正确和错误识别的单词。从正确转录的单词中选择Internet搜索引擎的关键字,然后在检索到的文档中获得错误识别单词的正确候选者。利用带有混淆矩阵的动态编程(DP)技术来比较候选单词和误识别的单词。在纠错实验的结果中,通过使用WEB文档,专有名词的识别率提高了约10%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号