首页> 外国专利> SMOOTHENING THE INFORMATION DENSITY OF SPOKEN WORDS IN AN AUDIO SIGNAL

SMOOTHENING THE INFORMATION DENSITY OF SPOKEN WORDS IN AN AUDIO SIGNAL

机译:消除音频信号中口语的信息密度

摘要

A portion of an audio signal is identified corresponding to a spoken word and its phonemes. A set of alternate spoken words satisfying phonetic similarity criteria to the spoken word is generated. A subset of the set of alternate spoken words is also identified; each member of the subset shares the same phoneme in a similar temporal position as the spoken word. A significance factor is then calculated for the phoneme based on the number of alternates in the subset and on the total number of alternates. The calculated significance factor may then be used to lengthen or shorten the temporal duration of the phoneme in the audio signal according to its significance in the spoken word.
机译:识别出音频信号的一部分,该部分对应于语音单词及其音素。产生满足与语音单词相似的语音相似性标准的一组备用语音单词。还标识了一组备用口语单词的子集;子集的每个成员在与口头单词相似的时间位置共享相同的音素。然后,根据子集中的替换数和替换总数,为音素计算一个重要因子。然后,可以根据语音信号在语音中的重要性,将计算出的重要性因子用于延长或缩短音频信号中音素的时间持续时间。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号