首页> 外国专利> Sound envelope deconstruction to identify words and speakers in continuous speech

Sound envelope deconstruction to identify words and speakers in continuous speech

机译:声音包络解构以识别连续语音中的单词和说话者

摘要

A speech recognition capability in which speakers of spoken text are identified based on the contour of sound waves representing the spoken text. Variations in the contour of the sound waves are identified, features are assigned to those variations, and parameters of those features are grouped into predefined characteristics. The predefined characteristics are combined into voice characteristic groups. If a prior voice characteristic group is present, the voice characteristic group from the soundlet is compared to existing voice characteristic groups and, if a match is present, the sound construct is assigned to a speaker identified by the existing voice characteristic group.
机译:语音识别功能,其中基于代表语音文本的声波轮廓识别语音文本的说话者。识别声波轮廓的变化,将特征分配给这些变化,并将这些特征的参数分组为预定义的特征。将预定义的特性组合到语音特性组中。如果存在先前的语音特征组,则将来自小号的语音特征组与现有的语音特征组进行比较,如果存在匹配,则将声音构造分配给由现有语音特征组标识的说话者。

著录项

  • 公开/公告号US9754593B2

    专利类型

  • 公开/公告日2017-09-05

    原文格式PDF

  • 申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;

    申请/专利号US201514931943

  • 发明设计人 MUKUNDAN SUNDARARAJAN;

    申请日2015-11-04

  • 分类号G10L17/02;G10L19/00;G10L15/30;G10L25/15;

  • 国家 US

  • 入库时间 2022-08-21 13:42:35

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号