首页> 外文会议>International Conference on Image Processing and Its Applications >Feature-level data fusion for bimodal person recognition
【24h】

Feature-level data fusion for bimodal person recognition

机译:BimoDal人识别的特征级数据融合

获取原文

摘要

Consistently high person recognition accuracy is difficult to attain using a single recognition modality. This paper assesses the fusion of voice and outer lip-margin features for person identification. Feature fusion is investigated in the form of audio-visual feature vector concatenation, principal component analysis, and linear discriminant analysis. The paper shows that, under mismatched test and training conditions, audio-visual feature fusion is equivalent to an effective increase in the signal-to-noise ratio of the audio signal. Audio-visual feature vector concatenation is shown to be an effective method for feature combination, and linear discriminant analysis is shown to possess the capability of packing discriminating audio-visual information into fewer coefficients than principal component analysis. The paper reveals a high sensitivity of bimodal person identification to a mismatch between LDA or PCA feature-fusion module and speaker model training noise-conditions. Such a mismatch leads to worse identification accuracy than unimodal identification.
机译:始终如一的高人识别准确性难以使用单一识别方式获得。本文评估了人物识别的语音和外唇缘特征的融合。以音频视觉特征向量级联,主成分分析和线性判别分析的形式研究了特征融合。本文表明,在不匹配的测试和训练条件下,视听特征融合等同于音频信号的信噪比的有效增加。视听特征向量级联被示出为特征组合的有效方法,并且显示线性判别分析具有比主要成分分析相比将视听信息判别识别视听信息的能力。本文揭示了双峰人物识别对LDA或PCA特征融合模块和扬声器模型训练噪声条件不匹配的高度敏感性。这种不匹配导致比单峰识别更差的识别精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号