首页> 外文会议>IEEE International Conference on Cybernetics >Multi-view video based tracking and audio-visual identification of persons in a human-computer-interaction scenario
【24h】

Multi-view video based tracking and audio-visual identification of persons in a human-computer-interaction scenario

机译:人机交互场景下基于多视角视频的人员跟踪和视听识别

获取原文

摘要

User identification and tracking are definitely the basic tasks in any human computer interaction (HCI) scenario. For these tasks we propose a multi-view approach utilizing multi-camera systems and audio processing systems. Face detectors and face recognizers are based on orientation histogram and eigenface techniques, and Mel Frequency Cepstral Coefficients (MFCC) are applied for speaker identification. In order to achieve a robust user identification and localization spatio-temporal classifier fusion methods have been integrated into the overall classifier system, support vector machines (SVM) and k nearest neighbor (kNN) models are used as base classifiers. A general office environment with up to six persons was the test bed for data collection and numerical evaluation.
机译:用户识别和跟踪绝对是任何人机交互(HCI)方案中的基本任务。对于这些任务,我们提出了一种利用多摄像机系统和音频处理系统的多视图方法。面部检测器和面部识别器基于方向直方图和特征脸技术,并且梅尔频率倒谱系数(MFCC)用于说话人识别。为了实现鲁棒的用户识别和定位,时空分类器融合方法已集成到整个分类器系统中,支持向量机(SVM)和k最近邻(kNN)模型用作基本分类器。最多可容纳六人的一般办公环境是数据收集和数值评估的试验台。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号