首页> 外国专利> Using speaker clustering to switch between different camera views in a video conference system

Using speaker clustering to switch between different camera views in a video conference system

机译:使用扬声器群集在视频会议系统中的不同摄像机视图之间切换

摘要

A video conference endpoint includes one or more cameras to capture video of different views and a microphone array to sense audio. One or more closeup views are defined. The endpoint detects faces in the captured video and active audio sources from the sensed audio. The endpoint detects any active talker having detected face positions that coincide with detected active audio sources, and also uses speaker clustering to detect whether any active talker is associated with a previously stored closeup views. Based on whether an active talker is detected in any of the stored closeup views, the endpoint switches between capturing video of one of the closeup views and a best overview of the participants in the conference room.
机译:视频会议端点包括一个或多个摄像头(用于捕获不同视图的视频)和麦克风阵列(用于感测音频)。定义了一个或多个特写视图。端点从感测到的音频中检测捕获的视频和活动音频源中的面部。端点检测到已检测到的面部位置与检测到的活动音频源一致的任何活动的讲话者,并且还使用扬声器聚类来检测任何活动的讲话者是否与先前存储的特写视图相关联。基于是否在任何存储的特写视图中检测到活动的讲话者,端点在捕获特写视图之一的视频和会议室中参与者的最佳概览之间切换。

著录项

  • 公开/公告号US9633270B1

    专利类型

  • 公开/公告日2017-04-25

    原文格式PDF

  • 申请/专利权人 CISCO TECHNOLOGY INC.;

    申请/专利号US201615091056

  • 申请日2016-04-05

  • 分类号H04N7/15;G06K9;H04N5/232;G10L17;G10L17/10;

  • 国家 US

  • 入库时间 2022-08-21 13:44:07

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号