首页> 外国专利> Method and apparatus for large vocabulary continuous speech recognition based on dynamic addition of vocabulary to be recognized

Method and apparatus for large vocabulary continuous speech recognition based on dynamic addition of vocabulary to be recognized

机译：基于待识别词汇动态加法的大词汇量连续语音识别方法及装置

页面导航

摘要
著录项
相似文献

摘要

The present invention improves the recognition rate of the speech recognition system by using the multimedia data such as broadcasting, conference recording, call center conversation recording, and the like when users provide and add words or strings that are expected to be frequently generated errors. A method and apparatus for continuous speech recognition (large vocabulary continuous speech recognition) are proposed. According to an aspect of the present invention, in a continuous speech recognition system, applying a'BLANK' vocabulary to a language model in advance to construct a language model (vocabulary extension language model) capable of extending the vocabulary; Providing a user with a user-defined word/word phrase as an input to the speech recognition system, and inserting the user-defined word/word phrase into the'BLANK' vocabulary previously prepared in the vocabulary-extended language model search space; Provided is a method for continuously recognizing a large vocabulary based on a dynamic recognition target vocabulary, comprising receiving a user's spoken voice and performing a decoding process for the speech.

机译：当用户提供并添加期望经常产生错误的单词或字符串时，本发明通过使用诸如广播，会议记录，呼叫中心对话记录等的多媒体数据来提高语音识别系统的识别率。提出了一种用于连续语音识别（大词汇量连续语音识别）的方法和设备。根据本发明的一个方面，在连续语音识别系统中，预先将“空白”词汇应用于语言模型以构建能够扩展词汇的语言模型（词汇扩展语言模型）。向用户提供用户定义的单词/单词短语作为语音识别系统的输入，并将用户定义的单词/单词短语插入先前在词汇扩展语言模型搜索空间中准备的“ BLANK”词汇中;提供了一种基于动态识别目标词汇表连续识别大词汇表的方法，该方法包括：接收用户的口头语音;以及对该语音进行解码处理。

著录项

公开/公告号KR20200072920A

专利类型
公开/公告日2020-06-23

原文格式PDF
申请/专利权人 한국전자통신연구원;
展开▼

申请/专利号KR20180161034
发明设计人 전형배;오유리;박기영;강병옥;강점자;김현우;박전규;송화전;이성주;이윤경;이윤근;정의석;정호영;정훈;최우용;한란;
展开▼

申请日2018-12-13
分类号G10L15/183;
国家 KR
入库时间 2022-08-21 11:06:39

相似文献

专利
外文文献
中文文献