Audio, Video and Multimodal Person Identification in a Smart Room

机译：智慧室中的音频，视频和多模式人员识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we address the modality integration issue on the example of a smart room environment aiming at enabling person identification by combining acoustic features and 2D face images. First we introduce the monomodal audio and video identification techniques and then we present the use of combined input speech and face images for person identification. The various sensory modalities, speech and faces, are processed both individually and jointly. It's shown that the multimodal approach results in improved performance in the identification of the participants.

机译：在本文中，我们以智能房间环境为例解决模式集成问题，该环境旨在通过结合声学特征和2D面部图像来实现人员识别。首先，我们介绍单峰音频和视频识别技术，然后介绍使用组合的输入语音和面部图像进行人识别的方法。各种感官形式（言语和面部）都可以单独或联合处理。结果表明，多模式方法可以提高参与者识别的性能。

著录项

来源
《Multimodal Technologies for Perception of Humans; Lecture Notes in Computer Science; 4122》|2006年|258-269|共12页
会议地点 Southampton(GB)
作者
J. Luque; R. Morros; A. Garde; J. Anguita; M. Farrus; D. Macho; F. Marques; C. Martinez; V. Vilaplana; J. Hernando;
展开▼
作者单位

Universitat Politecnica de Catalunya Jordi Girona 1-3, Campus Nord Edifici D5 08034 Barcelona, Spain;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Context-based person identification framework for smart video surveillance [J] . Liyan Zhang, Dmitri V. Kalashnikov, Sharad Mehrotra, Machine Vision and Applications . 2014,第7期

机译：基于上下文的智能视频监控人员识别框架
2. A Multimodal Saliency Model for Videos With High Audio-Visual Correspondence [J] . Min Xiongkuo, Zhai Guangtao, Zhou Jiantao, IEEE Transactions on Image Processing . 2020,第期

机译：具有高视听通信的视频的多峰显着性模型
3. Multimodal framework based on audio-visual features for summarisation of cricket videos [J] . Javed Ali, Irtaza Aun, Malik Hafiz, Image Processing, IET . 2019,第4期

机译：基于视听功能的多模式框架，用于板球视频摘要
4. 3D face reconstruction and multimodal person identification from video captured using smartphone camera [C] . Raghavendra R., Raja Kiran B., Pflug Anika, 2013 IEEE International Conference on Technologies for Homeland Security . 2013

机译：通过使用智能手机摄像头拍摄的视频进行3D面部重建和多模式人员识别
5. Multimodal Sensing and Data Processing for Speaker and Emotion Recognition Using Deep Learning Models with Audio, Video and Biomedical Sensors [D] . Abtahi, Farnaz. 2018

机译：使用具有音频，视频和生物医学传感器的深度学习模型，对说话人和情感识别进行多模式传感和数据处理
6. Lights Camera…Citizen Science: Assessing the Effectiveness of Smartphone-Based Video Training in Invasive Plant Identification [O] . Jared Starr, Charles M. Schweik, Nathan Bush, -1

机译：灯光照相机……公民科学：评估基于智能手机的视频培训在入侵植物识别中的有效性
7. Audio, Video and Multimodal Person Identification in a Smart Room [O] . J. Luque, R. Morros, A. Garde, 2008

机译：智能房间内的音频，视频和多模人识别

Audio, Video and Multimodal Person Identification in a Smart Room

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅