Searching for audiovisual correspondence in multiple speaker scenarios

Agnès Alsius; Salvador Soto-Faraco

首页> 外文期刊>Experimental Brain Research >Searching for audiovisual correspondence in multiple speaker scenarios

【24h】

Searching for audiovisual correspondence in multiple speaker scenarios

机译：在多个说话者场景中搜索视听对应

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A critical question in multisensory processing is how the constant information flow that arrives to our different senses is organized in coherent representations. Some authors claim that pre-attentive detection of inter-sensory correlations supports crossmodal binding, whereas other findings indicate that attention plays a crucial role. We used visual and auditory search tasks for speaking faces to address the role of selective spatial attention in audiovisual binding. Search efficiency amongst faces for the match with a voice declined with the number of faces being monitored concurrently, consistent with an attentive search mechanism. In contrast, search amongst auditory speech streams for the match with a face was independent of the number of streams being monitored concurrently, as long as localization was not required. We suggest that the fundamental differences in the way in which auditory and visual information is encoded play a limiting role in crossmodal binding. Based on these unisensory limitations, we provide a unified explanation for several previous apparently contradictory findings.

机译：多传感器处理中的一个关键问题是，如何以连贯的表示形式组织到达我们不同感官的恒定信息流。一些作者声称，对注意力之间的相互关系进行细心的检测可以支持跨峰绑定，而其他发现则表明注意力起着至关重要的作用。我们使用视觉和听觉搜索任务来处理人脸，以解决选择性空间注意力在视听绑定中的作用。与同时进行监视的脸部数量同时，与语音匹配的脸部搜索效率下降，这与注意力搜索机制一致。相反，只要不需要本地化，在听觉语音流中搜索与面部的匹配与并发监视的流的数量无关。我们建议听觉和视觉信息的编码方式的根本差异在跨峰绑定中起着限制作用。基于这些单感的局限性，我们为先前几个明显矛盾的发现提供了统一的解释。

著录项

来源
《Experimental Brain Research》 |2011年第3期|p.175-183|共9页
作者
Agnès Alsius; Salvador Soto-Faraco;
展开▼
作者单位

Department of Psychology, Queen’s University, 62 Arch st., Kingston, Ontario, K7L3N6, Canada;

Departament de Tecnologies de la Informació i les Comunicacions, Universitat Pompeu Fabra, Barcelona, Spain;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Multisensory integration; Audiovisual speech perception; Spatial attention; Visual search; Auditory search;

机译：多感觉整合;视听语音感知;空间注意力;视觉搜索;听觉搜索;

相似文献

外文文献
中文文献
专利

1. Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings [J] . Gatica-Perez D., Lathoud G., Odobez J.-M., IEEE transactions on audio, speech and language processing . 2007,第2期

机译：会议中多个发言人的视听概率跟踪
2. Audiovisual Localization of Multiple Speakers in a Video Teleconferencing Setting [J] . Bill Kapralos, Michael R. M. Jenkin, Evangelos Milios International journal of imaging systems and technology . 2003,第1期

机译：视频电话会议设置中多个发言人的视听本地化
3. Audiovisual Speaker Identification Based on Lip and Speech Modalities [J] . Chelali Fatma, Djeradi Amar The international arab journal of information technology . 2017,第1期

机译：基于嘴唇和语音模态的视听说话人识别
4. DANTE Speaker Recognition Module. An Efficient and Robust Automatic Speaker Searching Solution for Terrorism-Related Scenarios [C] . Jesus Jorrin, Luis Buera International conference on multimedia modeling . 2019

机译：DANTE说话人识别模块。针对恐怖主义场景的高效健壮的自动说话人搜索解决方案
5. Probabilistic correspondence mapping for audiovisual speaker modeling [D] . Liu, Ming 2007

机译：视听说话人建模的概率对应映射
6. Audiovisual perceptual learning with multiple speakers [O] . Aaron D. Mitchel, Chip Gerfen, Daniel J. Weiss -1

机译：多个说话人的视听感知学习
7. Searching for audiovisual correspondence in multiple speaker scenarios [O] . Alsius, Agnès, Soto-Faraco, Salvador, 1970- 2011

机译：在多个演讲者场景中搜索视听通信
8. Transcription of Multiple Speakers Using Speaker Dependent Speech Recognition [R] . 2003

机译：使用说话人相关语音识别转录多个扬声器

Searching for audiovisual correspondence in multiple speaker scenarios

摘要

著录项

相似文献

相关主题

期刊订阅