首页> 外国专利> SMOOTHENING THE INFORMATION DENSITY OF SPOKEN WORDS IN AN AUDIO SIGNAL

SMOOTHENING THE INFORMATION DENSITY OF SPOKEN WORDS IN AN AUDIO SIGNAL

机译：消除音频信号中口语的信息密度

页面导航

摘要
著录项
相似文献

摘要

A portion of an audio signal is identified corresponding to a spoken word and its phonemes. A set of alternate spoken words satisfying phonetic similarity criteria to the spoken word is generated. A subset of the set of alternate spoken words is also identified; each member of the subset shares the same phoneme in a similar temporal position as the spoken word. A significance factor is then calculated for the phoneme based on the number of alternates in the subset and on the total number of alternates. The calculated significance factor may then be used to lengthen or shorten the temporal duration of the phoneme in the audio signal according to its significance in the spoken word.

机译：识别出音频信号的一部分，该部分对应于语音单词及其音素。产生满足与语音单词相似的语音相似性标准的一组备用语音单词。还标识了一组备用口语单词的子集;子集的每个成员在与口头单词相似的时间位置共享相同的音素。然后，根据子集中的替换数和替换总数，为音素计算一个重要因子。然后，可以根据语音信号在语音中的重要性，将计算出的重要性因子用于延长或缩短音频信号中音素的时间持续时间。

著录项

公开/公告号US2015073803A1

专利类型
公开/公告日2015-03-12

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US201314025323
发明设计人 LAV R. VARSHNEY;FLEMMING BOEGELUND;
展开▼

申请日2013-09-12
分类号G10L15/187;
国家 US
入库时间 2022-08-21 15:26:05

相似文献

专利
外文文献
中文文献