首页> 外国专利> Speech recognition error analysis apparatus, method, program, and recording medium therefor

Speech recognition error analysis apparatus, method, program, and recording medium therefor

机译:语音识别错误分析装置,方法,程序及其记录介质

摘要

P To specify a part being likely to cause error in recognition in a language model. PSOLUTION: In a method and voice recognition processings are performed for a voice signal by using the language model and recognized word strings are assigned. The erroneously recognized word strings constituted by one or a plurality of continuous words which do not agree with the correct word strings corresponding to the recognized word strings in the recognized word strings and a section of erroneous recognition constituted by the erroneously recognized word strings and each of one word before and after the erroneously recognized word strings are extracted out of the recognized word strings. A set of two erroneous words in a START part constituted by the first word in the section of erroneous recognition and the first word in the erroneously recognized word strings is extracted. A set of two correct words in the START part constituted by the first word in the section of erroneous recognition and the first word in the correct word strings corresponding to the erroneously recognized word strings is extracted. Each of word chain probability of the set of two erroneous words in the START part and the set of two correct words in the START part is computed by using the language model. The set of two correct words in the START part having lower word chain probability than the word chain probability of the set of two erroneous words in the START part is extracted. PCOPYRIGHT: (C)2009 and JPO& INPIT
机译:

指定在语言模型中可能导致识别错误的零件。

解决方案:在一种方法中,通过使用语言模型对语音信号执行语音识别处理,并分配识别的单词字符串。由一个或多个连续单词构成的错误识别单词串,这些连续单词与对应于已识别单词串中的已识别单词串的正确单词串不一致,并且由错误识别单词串和每个错误组成的部分错误识别从识别的单词串中提取错误识别的单词串之前和之后的一个单词。提取由错误识别部分中的第一个单词和错误识别词串中的第一个单词组成的START部分中的两个错误单词的集合。提取由错误识别部分中的第一个单词和与错误识别的单词串相对应的正确单词串中的第一个单词构成的START部分中的两个正确单词的集合。使用语言模型计算START部分中两个错误单词的集合和START部分中两个正确单词的集合的单词链概率。提取START部分中的两个正确单词的集合,其词链概率比START部分中的两个错误单词的集合的词链概率低。

版权:(C)2009和JPO&INPIT

著录项

  • 公开/公告号JP4829910B2

    专利类型

  • 公开/公告日2011-12-07

    原文格式PDF

  • 申请/专利权人 日本電信電話株式会社;

    申请/专利号JP20080038468

  • 发明设计人 浅見 太一;野田 喜昭;

    申请日2008-02-20

  • 分类号G10L15/18;G10L15/06;G10L15/22;

  • 国家 JP

  • 入库时间 2022-08-21 17:35:39

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号