首页> 外国专利> SPEECH SYLLABLE/VOWEL/PHONE BOUNDARY DETECTION USING AUDITORY ATTENTION CUES

SPEECH SYLLABLE/VOWEL/PHONE BOUNDARY DETECTION USING AUDITORY ATTENTION CUES

机译:使用音频注意提示的语音音节/语音/电话边界检测

摘要

PROBLEM TO BE SOLVED: To detect phone, vowel or syllable boundaries in speech.SOLUTION: In syllable, vowel or phone boundary detection during speech, an auditory spectrum is determined for an input window of sound and one or more multi-scale features are extracted from the auditory spectrum. Each multi-scale feature can be extracted using a separate two-dimensional spectro-temporal receptive filter. One or more feature maps corresponding to the one or more multi-scale features can be generated and an auditory gist vector can be extracted from each of the one or more feature maps. A cumulative gist vector is obtained through augmentation of each auditory gist vector extracted from the one or more feature maps. One or more syllable, vowel or phone boundaries in the input window of sound are detected by mapping the cumulative gist vector to one or more syllable, vowel or phone boundary characteristics using a machine learning algorithm.SELECTED DRAWING: Figure 1A
机译:解决的问题:要检测语音中的电话,元音或音节边界解决方案:在语音过程中的音节,元音或电话边界检测中,确定声音输入窗口的听觉频谱并提取一个或多个多尺度特征从听觉范围可以使用单独的二维光谱时态接收滤波器提取每个多尺度特征。可以生成与一个或多个多尺度特征相对应的一个或多个特征图,并且可以从一个或多个特征图的每一个提取听觉要点矢量。通过增加从一个或多个特征图提取的每个听觉要点矢量来获得累积要点矢量。通过使用机器学习算法将累积要点矢量映射到一个或多个音节,元音或电话边界特征,可以检测到声音输入窗口中的一个或多个音节,元音或电话边界。选定的图形:图1A

著录项

  • 公开/公告号JP2016128935A

    专利类型

  • 公开/公告日2016-07-14

    原文格式PDF

  • 申请/专利权人 SONY INTERACTIVE ENTERTAINMENT LLC;

    申请/专利号JP20160046781

  • 发明设计人 OZLEM KALINLI;LUCIAN CHEN;

    申请日2016-03-10

  • 分类号G10L15/04;G10L25/87;G10L21/18;G10L15/02;

  • 国家 JP

  • 入库时间 2022-08-21 14:47:21

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号