首页>
外国专利>
Using speaker clustering to switch between different camera views in a video conference system
Using speaker clustering to switch between different camera views in a video conference system
展开▼
机译:使用扬声器群集在视频会议系统中的不同摄像机视图之间切换
展开▼
页面导航
摘要
著录项
相似文献
摘要
A video conference endpoint includes one or more cameras to capture video of different views and a microphone array to sense audio. One or more closeup views are defined. The endpoint detects faces in the captured video and active audio sources from the sensed audio. The endpoint detects any active talker having detected face positions that coincide with detected active audio sources, and also uses speaker clustering to detect whether any active talker is associated with a previously stored closeup views. Based on whether an active talker is detected in any of the stored closeup views, the endpoint switches between capturing video of one of the closeup views and a best overview of the participants in the conference room.
展开▼