Audiovisual speaker localization in medium smart meeting room

机译：中等智能会议室的视听演讲人定位

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The issue of automatic selection of the current active speaker among more than thirty participants located in the medium-sized meeting room is considered. Techniques of video tracking and sound source localization are implemented for recording AVI files of speaker remarks in the developed smart meeting room. Video processing of streams from five cameras serves for registration of participants in fixed chair positions, tracking main speaker based on histogram comparison and AdaBoosted cascade classifier for face detection. Multichannel sound source localization based on GCC-PHAT method is used for estimation of the speaker position by four microphone arrays. In the 18dB SNR case the sound source localization rate was about 97% and fine RMSE was lower 0.23 m.

机译：考虑了位于中等大型会议室的30多个参与者中当前活跃扬声器自动选择的问题。视频跟踪和声源定位的技术用于在开发的智能会议室中录制扬声器备注的AVI文件。从五个摄像机的流式流式处理用于固定椅子位置的参与者的登记，基于直方图比较和Adaboosted级联分类器进行脸部检测的主扬声器。基于GCC-PHAT方法的多通道声源定位用于通过四个麦克风阵列估计扬声器位置。在18dB的SNR情况下，声源定位速率约为97％，细微的RMSE为0.23米。

著录项

来源
《International Conference on Information, Communications and Signal Processing》|2011年||共5页
会议地点
作者
Ronzhin A.; Budkov V.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN91-53;
关键词

相似文献

外文文献
中文文献
专利

1. Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings [J] . Gatica-Perez D., Lathoud G., Odobez J.-M., IEEE transactions on audio, speech and language processing . 2007,第2期

机译：会议中多个发言人的视听概率跟踪
2. Voice activity detection and speaker localization using audiovisual cues [J] . Dante A. Blauth, Vicente P. Minotto, Claudio R. Jung, Pattern recognition letters . 2012,第4期

机译：使用视听提示进行语音活动检测和说话人定位
3. Audiovisual Localization of Multiple Speakers in a Video Teleconferencing Setting [J] . Bill Kapralos, Michael R. M. Jenkin, Evangelos Milios International journal of imaging systems and technology . 2003,第1期

机译：视频电话会议设置中多个发言人的视听本地化
4. Audiovisual speaker localization in medium smart meeting room [C] . Ronzhin A., Ronzhin A., Budkov V. Information, Communications and Signal Processing (ICICS) 2011 8th International Conference on . 2011

机译：中型智能会议室中的视听演讲者本地化
5. Probabilistic correspondence mapping for audiovisual speaker modeling [D] . Liu, Ming 2007

机译：视听说话人建模的概率对应映射
6. Audiovisual perceptual learning with multiple speakers [O] . Aaron D. Mitchel, Chip Gerfen, Daniel J. Weiss -1

机译：多个说话人的视听感知学习
7. Audiovisual probabilistic tracking of multiple speakers in meetings [O] . Daniel Gatica-perez, Jean-marc Odobez, Guillaume Lathoud, 2007

机译：会议中多个发言者的视听概率跟踪
8. Small and Medium Power Reactors 1985. Report of Two Meetings on Small and Medium Power Reactors: A Scientific Afternoon Held on 25 September 1985 and Technical Committee Meeting Held on 26 September 1985. Both Meetings Were Held in Vienna During the General Conference of the IAEA (International Atomic Energy Agency) [R] . 1986

机译：中小型电力反应堆1985.两次中小型电力反应堆会议的报告：1985年9月25日举行的科学下午会议和1985年9月26日举行的技术委员会会议。两次会议在原子能机构大会期间在维也纳举行（国际原子能机构）

Audiovisual speaker localization in medium smart meeting room

摘要

著录项

相似文献

相关主题

期刊订阅