Audiovisual speaker localization in medium smart meeting room

机译：中型智能会议室中的视听演讲者本地化

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The issue of automatic selection of the current active speaker among more than thirty participants located in the medium-sized meeting room is considered. Techniques of video tracking and sound source localization are implemented for recording AVI files of speaker remarks in the developed smart meeting room. Video processing of streams from five cameras serves for registration of participants in fixed chair positions, tracking main speaker based on histogram comparison and AdaBoosted cascade classifier for face detection. Multichannel sound source localization based on GCC-PHAT method is used for estimation of the speaker position by four microphone arrays. In the 18dB SNR case the sound source localization rate was about 97% and fine RMSE was lower 0.23 m.

机译：考虑了在中型会议室的三十多个参与者中自动选择当前活动发言人的问题。实现了视频跟踪和声源定位技术，用于在开发的智能会议室中记录演讲者的AVI文件。来自五个摄像头的流的视频处理用于将参与者固定在椅子上的位置进行注册，并基于直方图比较和AdaBoosted级联分类器跟踪主要说话者，以进行面部检测。基于GCC-PHAT方法的多声道声源定位被用于通过四个麦克风阵列估计扬声器位置。在18dB SNR的情况下，声源定位率约为97％，精细的RMSE较低，为0.23 m。

著录项

来源
《Information, Communications and Signal Processing (ICICS) 2011 8th International Conference on》|2011年|p.1- 5|共5页
会议地点 Singapore(SG)
作者
Ronzhin A.; Ronzhin A.; Budkov V.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类通信;
关键词

相似文献

外文文献
中文文献
专利

1. Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings [J] . Gatica-Perez D., Lathoud G., Odobez J.-M., IEEE transactions on audio, speech and language processing . 2007,第2期

机译：会议中多个发言人的视听概率跟踪
2. Voice activity detection and speaker localization using audiovisual cues [J] . Dante A. Blauth, Vicente P. Minotto, Claudio R. Jung, Pattern recognition letters . 2012,第4期

机译：使用视听提示进行语音活动检测和说话人定位
3. Audiovisual Localization of Multiple Speakers in a Video Teleconferencing Setting [J] . Bill Kapralos, Michael R. M. Jenkin, Evangelos Milios International journal of imaging systems and technology . 2003,第1期

机译：视频电话会议设置中多个发言人的视听本地化
4. Audiovisual speaker localization in medium smart meeting room [C] . Ronzhin A., Budkov V. International Conference on Information, Communications and Signal Processing . 2011

机译：中等智能会议室的视听演讲人定位
5. Probabilistic correspondence mapping for audiovisual speaker modeling [D] . Liu, Ming 2007

机译：视听说话人建模的概率对应映射
6. Audiovisual perceptual learning with multiple speakers [O] . Aaron D. Mitchel, Chip Gerfen, Daniel J. Weiss -1

机译：多个说话人的视听感知学习
7. Audiovisual probabilistic tracking of multiple speakers in meetings [O] . Daniel Gatica-perez, Jean-marc Odobez, Guillaume Lathoud, 2007

机译：会议中多个发言者的视听概率跟踪
8. Small and Medium Power Reactors 1985. Report of Two Meetings on Small and Medium Power Reactors: A Scientific Afternoon Held on 25 September 1985 and Technical Committee Meeting Held on 26 September 1985. Both Meetings Were Held in Vienna During the General Conference of the IAEA (International Atomic Energy Agency) [R] . 1986

机译：中小型电力反应堆1985.两次中小型电力反应堆会议的报告：1985年9月25日举行的科学下午会议和1985年9月26日举行的技术委员会会议。两次会议在原子能机构大会期间在维也纳举行（国际原子能机构）

Audiovisual speaker localization in medium smart meeting room

摘要

著录项

相似文献

相关主题

期刊订阅