首页> 外文会议>International Conference on Auditory Display; 20030706-09; Boston,MA(US) >OPTIMIZING THE SPATIAL CONFIGURATION OF A SEVEN-TALKER SPEECH DISPLAY
【24h】

OPTIMIZING THE SPATIAL CONFIGURATION OF A SEVEN-TALKER SPEECH DISPLAY

机译:优化七言语显示的空间配置

获取原文
获取原文并翻译 | 示例

摘要

Although there is substantial evidence that performance in mul-titalker listening tasks can be improved by spatially separating the apparent locations of the competing talkers, very little effort has been made to determine the best locations and presentation levels for the talkers in a multichannel speech display. In this experiment, a call-sign based color and number identification task was used to evaluate the effectiveness of three different spatial configurations and two different level normalization schemes in a seven-channel binaural speech display. When only two spatially-adjacent channels of the seven-channel system were active, overall performance was substantially better with a geometrically-spaced spatial configuration (with far-field talkers at -90°, -30°, -10°, 0°, +10°, +30°, and +90° azimuth) or a hybrid near-far configuration (with far-field talkers at -90°, -30°, 0°, +30°, and +90° azimuth and near-field talkers at ±90°) than with a more conventional linearly-spaced configuration (with far-field talkers at -90°, -60°, -30°, 0°, +30°, +60°, and +90° azimuth). When all seven channels were active, performance was generally better with a "better-ear" normalization scheme that equalized the levels of the talkers in the more intense ear than with a default normalization scheme that equalized the levels of the talkers at the center of the head. The best overall performance in the seven-talker task occurred when the hybrid near-far spatial configuration was combined with the better-ear normalization scheme. This combination resulted in a 20% increase in the number of correct identifications relative to the baseline condition with linearly-spaced talker locations and no level normalization. Although this is a relatively modest improvement, it should be noted that it could be achieved at little or no cost simply by reconfiguring the HRTFs used in a multitalker speech display.
机译:尽管有充分的证据表明,可以通过在空间上分隔竞争讲话者的明显位置来提高多讲话者收听任务的性能,但在确定多声道语音显示中讲话者的最佳位置和呈现级别方面所做的工作很少。在此实验中,基于呼叫符号的颜色和数字识别任务用于评估七通道双耳语音显示中三种不同空间配置和两种不同级别归一化方案的有效性。当七通道系统中只有两个在空间上相邻的通道处于活动状态时,采用几何间隔的空间配置(在-90°,-30°,-10°,0°, + 10°,+ 30°和+ 90°方位角或混合远近配置(在-90°,-30°,0°,+ 30°和+ 90°方位角及附近具有远场发声器场扬声器在±90°上)而不是更常规的线性间隔配置(在-90°,-60°,-30°,0°,+ 30°,+ 60°和+90时具有远场扬声器°方位角)。当所有七个通道均处于活动状态时,使用“更好的耳朵”归一化方案使讲话者的耳朵更加均衡,其性能通常要优于使用默认的归一化方案来使讲话者的声音在扬声器中心均衡的性能更好。头。当混合近距离空间配置与更好的耳朵归一化方案相结合时,七说话者任务中的总体性能最佳。这种组合导致相对于基线条件的正确识别数量增加了20%,基线状态的讲话者位置呈线性间隔,并且未进行电平归一化。尽管这是一个相对适度的改进,但应注意的是,只需重新配置多方通话器语音显示中使用的HRTF,就可以以很少的成本或根本没有的成本来实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号