首页> 外国专利> METHOD AND SYSTEM FOR AUTOMATIC WORD SPACING OF VOICE RECOGNITION USING NAMED ENTITY RECOGNITION

METHOD AND SYSTEM FOR AUTOMATIC WORD SPACING OF VOICE RECOGNITION USING NAMED ENTITY RECOGNITION

机译:使用命名实体识别自动识别语音的单词分配方法和系统

摘要

The present invention provides a method and a system for compensating voice recognized word spacing using the recognition of an entity name, wherein the method comprises: a step of recognizing an inputted voice and generating voice text; an error section determination step of estimating an error section of the voice recognition via a natural language processing procedure for the voice text, and setting the error section as a compensation target; a category estimation step of extracting the compensation target and a usage pattern using the compensation target in the context of the compensation target from the voice text, and comparing the extracted usage pattern with the usage pattern of an object stored in an entity name usage pattern database for each category to estimate the category corresponding to the compensation target; and a spacing compensation step of analyzing an occurrence frequency of a syllable N-gram for each category for the compensation target based on an entity name dictionary database for each category to compensate the spacing of the compensation target. The present invention first checks a scope of a corresponding word with the category even though words not registered in a voice recognition dictionary such as a proper noun, a new-coined word, a varied word or the like and applying the checked scope to a spacing probability, thereby compensating the spacing error of the voice recognition, correctly and reliably.
机译:本发明提供了一种使用实体名称的识别来补偿语音识别的单词间隔的方法和系统,其中,该方法包括:识别输入的语音并生成语音文本的步骤;以及错误部分确定步骤,该错误部分确定步骤经由用于语音文本的自然语言处理过程来估计语音识别的错误部分,并将该错误部分设置为补偿目标;类别估计步骤:从语音文本中提取补偿目标和在补偿目标的上下文中使用补偿目标的使用模式,并将提取的使用模式与存储在实体名称使用模式数据库中的对象的使用模式进行比较为每个类别估计与补偿目标相对应的类别;间隔补偿步骤,其基于每个类别的实体名称词典数据库,分析所述补偿目标的每个类别的音节N-gram的出现频率,以补偿所述补偿目标的间隔。即使没有在语音识别字典中注册的单词(例如专有名词,新硬币,变体单词等),本发明也首先检查具有该类别的相应单词的范围,并将所检查的范围应用于间隔概率,从而正确,可靠地补偿语音识别的间隔误差。

著录项

  • 公开/公告号KR20150066361A

    专利类型

  • 公开/公告日2015-06-16

    原文格式PDF

  • 申请/专利权人 KT CORPORATION;

    申请/专利号KR20130151798

  • 发明设计人 PARK JAE HAN;

    申请日2013-12-06

  • 分类号G10L15/19;G06F17/20;G10L15/26;

  • 国家 KR

  • 入库时间 2022-08-21 14:59:52

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号