首页> 外文期刊>Speech Communication >Detection of auditory (cross-spectral) and auditory-visual (cross-modal) synchrony
【24h】

Detection of auditory (cross-spectral) and auditory-visual (cross-modal) synchrony

机译:听觉(跨谱)和听觉-视觉(跨模态)同步检测

获取原文
获取原文并翻译 | 示例
       

摘要

Detection thresholds for temporal synchrony in auditory and auditory-visual sentence materials were obtained on normal-hearing subjects. For auditory conditions, thresholds were determined using an adaptive-tracking procedure to control the degree of temporal asynchrony of a narrow audio band of speech, both positive and negative in separate tracks, relative to three other narrow audio bands of speech. For auditory-visual conditions, thresholds were determined in a similar manner for each of four narrow audio bands of speech as well as a broadband speech condition, relative to a video image of a female speaker. Four different auditory filter conditions, as well as a broadband auditory-visual speech condition, were evaluated in order to determine whether detection thresholds were dependent on the spectral content of the acoustic speech signal. Consistent with previous studies of auditory-visual speech recognition which showed a broad, asymmetrical range of temporal synchrony for which intelligibility was basically unaffected (audio delays roughly between -40ms and +240ms), auditory-visual synchrony detection thresholds also showed a broad, asymmetrical pattern of similar magnitude (audio delays roughly between -45 ms and +200 ms). No differences in synchrony thresholds were observed for the different filtered bands of speech, or for broadband speech. In contrast, detection thresholds for audio-alone conditions were much smaller (between -17ms and +23ms) and symmetrical. These results suggest a fairly tight coupling between a subject's ability to detect cross-spectral (auditory) and cross-modal (auditory visual) asynchrony and the intelligibility of auditory and auditory-visual speech materials. Published by Elsevier B.V.
机译:在听力正常的受试者上获得听觉和听觉-视觉句子材料中时间同步的检测阈值。对于听觉条件,使用自适应跟踪过程确定阈值,以控制相对于其他三个窄音频语音带的窄音频语音带的时间异步程度,该音频带在单独的音轨中为正和负。对于听觉视觉条件,相对于女性讲话者的视频图像,以类似的方式为四个窄音频语音带和宽带语音条件中的每一个确定阈值。为了确定检测阈值是否取决于声学语音信号的频谱含量,对四种不同的听觉滤波器条件以及宽带听觉-视觉语音条件进行了评估。与先前的听觉-视觉语音识别研究一致,该研究显示了时间同步的广泛,不对称范围,其清晰度基本不受影响(音频延迟大约在-40ms到+ 240ms之间),听觉-视觉同步检测阈值也显示出广泛的,不对称大小相似的模式(音频延迟大约在-45毫秒至+200毫秒之间)。对于不同的语音滤波频带或宽带语音,没有观察到同步阈值的差异。相反,仅音频条件的检测阈值要小得多(在-17ms和+ 23ms之间)并且对称。这些结果表明,受试者检测跨谱(听觉)和跨模态(听觉视觉)异步的能力与听觉和听觉语音材料的清晰度之间存在相当紧密的联系。由Elsevier B.V.发布

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号