首页> 外文会议>Iberoamerican congress on pattern recognition >Class Confusability Reduction in Audio-Visual Speech Recognition Using Random Forests
【24h】

Class Confusability Reduction in Audio-Visual Speech Recognition Using Random Forests

机译:使用随机森林的视听语音识别中的类混淆性降低

获取原文

摘要

This paper presents an audio-visual speech classification system based on Random Forests classifiers, aiming to reduce the intra-class misclassification problems, which is a very usual situation, specially in speech recognition tasks. A novel training procedure is proposed, introducing the concept of Complementary Random Forests (CRF) classifiers. Experimental results over three audio-visual databases, show that a good performance is achieved with the proposed system for the different types of input information considered, viz., audio-only information, video-only information and fused audio-video information. In addition, these results also indicate that the proposed method performs satisfactorily over the three databases using the same configuration parameters.
机译:本文提出了一种基于随机森林分类器的视听语音分类系统,旨在减少类内误分类问题,这是一种很常见的情况,特别是在语音识别任务中。提出了一种新颖的训练程序,引入了互补随机森林(CRF)分类器的概念。在三个视听数据库上的实验结果表明,对于所考虑的不同类型的输入信息(即纯音频信息,纯视频信息和融合的音频视频信息),该系统可实现良好的性能。此外,这些结果还表明,使用相同的配置参数,该方法在三个数据库上的性能令人满意。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号