【24h】

Spectro-Temporal Interactions in Auditory and Auditory-Visual Speech Processing

机译:听觉和听觉-视觉语音处理中的光谱-时间相互作用

获取原文
获取原文并翻译 | 示例

摘要

Speech recognition often involves the face-to-face communication between two or more individuals. The combined influences of auditory and visual speech information leads to a remarkably robust signal that is greatly resistant to noise, reverberation, hearing loss, and other forms of signal distortion. Studies of auditory-visual speech processing have revealed that speechreading interacts with audition in both the spectral and temporal domain. For example, not all speech frequencies are equal in their ability to supplement speechreading, with low-frequency speech cues providing more benefit than high-frequency speech cues. Additionally, in contrast to auditory speech processing which integrates information across frequency over relatively short time windows (20-40 ms), auditory-visual speech processing appears to use relatively long time windows of integration (roughly 250 ms). In this paper, some of the basic spectral and temporal interactions between auditory and visual speech channels are enumerated and discussed.
机译:语音识别通常涉及两个或更多个人之间的面对面交流。听觉和视觉语音信息的综合影响会产生非常强大的信号,该信号可以极大地抵抗噪声,混响,听力损失和其他形式的信号失真。听觉-视觉语音处理的研究表明,语音阅读与听觉在频谱和时域上都相互作用。例如,并非所有语音频率在补充语音阅读方面的能力都相等,低频语音提示比高频语音提示提供更多的好处。另外,与听觉语音处理在相对较短的时间窗口(20-40 ms)上跨频率集成信息相比,听觉-视觉语音处理似乎使用相对较长的时间窗口(大约250 ms)。在本文中,列举并讨论了听觉和视觉语音通道之间的一些基本的频谱和时间交互作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号