首页> 外文会议>Annual conference of the International Speech Communication Association >Speech Pattern Discovery using Audio-Visual Fusion and Canonical Correlation Analysis
【24h】

Speech Pattern Discovery using Audio-Visual Fusion and Canonical Correlation Analysis

机译:使用视听融合和典范相关分析的语音模式发现

获取原文

摘要

In this paper, we address the problem of automatic discovery of speech patterns using audio-visual information fusion. Unlike those previous studies based on single audio modality, our work not only uses the acoustic information, but also takes into account the visual features extracted from the mouth region. To improve the effectiveness of the use of multimodal information, several audio-visual fusion strategies, including feature concatenation, similarity weighting and decision fusion, are utilized. Specifically, our decision fusion approach retains the reliable patterns discovered in the audio and visual modalities. Moreover, we use canonical correlation analysis (CCA) to address the issue of temporal asynchrony between audio and visual speech modalities and unbounded dynamic time warping (UDTW) is adopted to search for the speech patterns through audio and visual similarity matrices calculated on the aligned audio and visual sequence. Experiments on an audio-visual corpus show that, for the first time, speech pattern discovery can be improved by the use of visual information. The decision fusion approach shows superior performance compared with standard feature concatenation and similarity weighting. CCA-based audio-visual synchronization plays an important role in the performance improvement.
机译:在本文中,我们解决了使用视听信息融合自动发现语音模式的问题。与以前基于单个音频模态的研究不同,我们的工作不仅使用声学信息,还考虑了从嘴巴区域提取的视觉特征。为了提高使用多模式信息的有效性,利用了几种视听融合策略,包括特征串联,相似性加权和决策融合。具体来说,我们的决策融合方法保留了在音频和视频模式中发现的可靠模式。此外,我们使用规范相关分析(CCA)来解决音频和视觉语音模态之间的时间异步问题,并采用无界动态时间规整(UDTW)通过在对齐音频上计算出的音频和视觉相似性矩阵来搜索语音模式和视觉顺序。视听语料库的实验表明,语音模式的发现首次可以通过使用视觉信息来改善。与标准特征串联和相似度加权相比,决策融合方法显示出更高的性能。基于CCA的视听同步在性能提高中起着重要作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号