首页> 外文会议>Image Processing: Algorithms and Systems IV >Current speaker detection system using lip motion information
【24h】

Current speaker detection system using lip motion information

机译:当前使用嘴唇运动信息的说话者检测系统

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose a system that detects the current speaker in multi-speaker videoconferencing by using lip motion. First, the system detects the face and lip region of each of the candidate speakers using face color and shape information. Then, to detect the current speaker, it calculates the change between the current frame and the previous frame in lip region. To close-up the detected current speaker, we used two CCD cameras. One is a general CCD camera, the other is a PTZ camera controlled by RS-232C serial port. The experimental result is the proposed system capable of detecting the face of current speaker in a video feed with more than three people, regardless of orientation of the faces. With this system, it only takes 4 to 5 seconds to zoom in on the speaker from the initial reference image. Also, it is a more efficient image transmission system for such things as video conferencing and internet broadcasting because it offers a close up face image at a resolution of 320x240, while at the same tune providing a whole background image.
机译:在本文中,我们提出了一种通过嘴唇运动检测多说话者视频会议中当前说话者的系统。首先,系统使用面部颜色和形状信息来检测每个候选发言人的面部和嘴唇区域。然后,要检测当前说话者,它会计算嘴唇区域中当前帧和前一帧之间的变化。为了关闭检测到的当前扬声器,我们使用了两个CCD摄像机。一个是普通的CCD摄像机,另一个是由RS-232C串行端口控制的PTZ摄像机。实验结果是所提出的系统能够检测具有三个以上人员的视频馈送中当前说话者的面部,而无需考虑面部的方向。使用此系统,只需4到5秒钟即可从初始参考图像放大扬声器。同样,它是用于视频会议和互联网广播等事物的更有效的图像传输系统,因为它以320x240的分辨率提供特写的人脸图像,而同时提供完整的背景图像。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号