首页> 外国专利> Detection of speech syllable / vowel / phoneme boundaries using auditory attention cues

Detection of speech syllable / vowel / phoneme boundaries using auditory attention cues

机译:使用听觉提示信号检测语音音节/元音/音素边界

摘要

In syllable or vowel or phone boundary detection during speech, an auditory spectrum may be determined for an input window of sound and one or more multi-scale features may be extracted from the auditory spectrum. Each multi-scale feature can be extracted using a separate two-dimensional spectro-temporal receptive filter. One or more feature maps corresponding to the one or more multi-scale features can be generated and an auditory gist vector can be extracted from each of the one or more feature maps. A cumulative gist vector may be obtained through augmentation of each auditory gist vector extracted from the one or more feature maps. One or more syllable or vowel or phone boundaries in the input window of sound can be detected by mapping the cumulative gist vector to one or more syllable or vowel or phone boundary characteristics using a machine learning algorithm.
机译:在语音期间的音节或元音或电话边界检测中,可以为声音的输入窗口确定听觉频谱,并且可以从听觉频谱中提取一个或多个多尺度特征。可以使用单独的二维光谱时态接收滤波器提取每个多尺度特征。可以生成与一个或多个多尺度特征相对应的一个或多个特征图,并且可以从一个或多个特征图的每一个提取听觉要点矢量。可以通过增加从一个或多个特征图提取的每个听觉主向量来获得累积主向量。通过使用机器学习算法将累积要点矢量映射到一个或多个音节或元音或电话边界特性,可以检测声音输入窗口中的一个或多个音节或元音或电话边界。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号