首页> 外文会议>Multimodal Technologies for Perception of Humans; Lecture Notes in Computer Science; 4122 >Audio, Video and Multimodal Person Identification in a Smart Room
【24h】

Audio, Video and Multimodal Person Identification in a Smart Room

机译:智慧室中的音频,视频和多模式人员识别

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we address the modality integration issue on the example of a smart room environment aiming at enabling person identification by combining acoustic features and 2D face images. First we introduce the monomodal audio and video identification techniques and then we present the use of combined input speech and face images for person identification. The various sensory modalities, speech and faces, are processed both individually and jointly. It's shown that the multimodal approach results in improved performance in the identification of the participants.
机译:在本文中,我们以智能房间环境为例解决模式集成问题,该环境旨在通过结合声学特征和2D面部图像来实现人员识别。首先,我们介绍单峰音频和视频识别技术,然后介绍使用组合的输入语音和面部图像进行人识别的方法。各种感官形式(言语和面部)都可以单独或联合处理。结果表明,多模式方法可以提高参与者识别的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号