首页> 外国专利> SYSTEM FOR RECOGNIZING DYNAMIC TIME-WARPING ISOLATED WORD USING VOICED, UNVOICED, AND SILENCE SOUND INFORMATION FOUNDATION

SYSTEM FOR RECOGNIZING DYNAMIC TIME-WARPING ISOLATED WORD USING VOICED, UNVOICED, AND SILENCE SOUND INFORMATION FOUNDATION

机译:利用语音,语音和静音声音信息基础来识别动态时空偏移隔离词的系统

摘要

PURPOSE: A system for recognizing a dynamic time-warping(DTW) isolated word is provided to use voiced/unvoiced/silence sound information extracted from recognition-targeted voice signals to perform a DTW algorithm, so as to reduce calculating amounts consumed in a pattern matching. CONSTITUTION: A database(10) separates voiced/unvoiced/silence sound code sections relating to each recognition-targeted voice, and classifies the voices as standard voice patterns according to code word patterns, composed of combinations of the separated code section. The database(10) stores the classified voices. A pre-emphasis processor(20) extracts first characteristic variables from an inputted original voice signal to perform a pre-emphasis process for an original voice, and extracts second characteristic variables to generate a test voice pattern of the original voice signal by using the first/the second characteristic variables, then extracts boundary information according to each voiced/unvoiced/silence sound code section from the original voice signal. A code word classifier(30) forms code words divided into voiced/unvoiced/silence sound sections by using the first/the second variables, and retrieves standard voice patterns having code word patterns corresponding to the formed code words from the database(10). A pattern matching unit(40) partially applies a dynamic time-warping(DTW) algorithm to the retrieved standard voice patterns by using the boundary information, and performs pattern matching processes to generate a recognition result.
机译:目的:提供一种用于识别动态时空扭曲(DTW)隔离词的系统,该系统使用从识别目标语音信号中提取的有声/无声/无声声音信息来执行DTW算法,从而减少在模式中消耗的计算量匹配。组成:数据库(10)分离与每个识别目标语音相关的有声/无声/无声语音代码段,并根据由分离的代码段的组合组成的代码字模式将语音分类为标准语音模式。数据库(10)存储分类语音。预加重处理器(20)从输入的原始语音信号中提取第一特征变量以对原始语音执行预加重处理,并提取第二特征变量,以通过使用第一特征变量来生成原始语音信号的测试语音模式。 /第二特征变量,然后根据原始语音信号中的每个有声/无声/无声声音代码段提取边界信息。码字分类器(30)通过使用第一/第二变量形成被划分为有声/无声/静音部分的码字,并从数据库(10)中检索具有与所形成的码字相对应的码字模式的标准语音模式。模式匹配单元(40)通过使用边界信息将动态时间规整(DTW)算法部分地应用于检索到的标准语音模式,并且执行模式匹配处理以生成识别结果。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号