An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework

机译：一种信息融合方法，以识别基于深度学习框架的Chime-3挑战中麦克风阵列语音的信息融合方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present an information fusion approach to robust recognition of microphone array speech for the recently launched 3rd CHiME Challenge. It is based on a deep learning framework with a large neural network consisting of subnets with different architectures. Multiple knowledge sources are integrated via an early fusion of normalized noisy features with different beamforming techniques, speech enhanced features, speaker related features, and other auxiliary features concatenated as the input to each subnet, and a late fusion by combining the outputs of all subnets to produce one single output set. Our experiments demonstrate that all information sources are complementary in our proposed framework. Our best system achieves an average word error rate reduction of 68% from the officially released baseline results on the test set of real data.

机译：我们展示了一个信息融合方法，以强大地识别麦克风阵列语音最近推出的第三个唱片挑战。它基于具有大型神经网络的深度学习框架，由具有不同架构的子网组成。通过具有不同的波束成形技术，语音增强功能，扬声器相关特征的正常化噪声功能的早期融合集成了多个知识源，以及作为每个子网的输入连接的其他辅助功能，并通过将所有子网的输出组合到每个子网来融合产生一个单个输出集。我们的实验表明，我们提出的框架中的所有信息来源都是互补的。我们的最佳系统从正式发布的基线导致测试集的实际数据的基准结果实现了68％的平均字错误率。

著录项

来源
《IEEE Workshop on Automatic Speech Recognition and Understanding》|2015年||共6页
会议地点
作者
Jun Du; Qing Wang; Yan-Hui Tu; Xiao Bao; Li-Rong Dai; Chin-Hui Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电声技术和语音信号处理;
关键词
CHiME Challenge; deep learning; information fusion; microphone array; robust speech recognition;

机译：Chime挑战;深入学习;信息融合;麦克风阵列;强大的语音识别;

相似文献

外文文献
中文文献
专利

1. An information fusion framework with multi-channel feature concatenation and multi-perspective system combination for the deep-learning-based robust recognition of microphone array speech [J] . Tu Yanhui, Du Jun, Wang Qing, Computer speech and language . 2017,第nova期

机译：具有多通道特征串联和多视角系统组合的信息融合框架，用于基于深度学习的麦克风阵列语音鲁棒识别
2. Speech-recognizer-based filter optimization for microphone array processing [J] . Seltzer M.L., Raj B. IEEE signal processing letters . 2003,第3期

机译：基于语音识别器的麦克风阵列处理滤波器优化
3. Speech-recognizer-based filter optimization for microphone array processing [J] . Seltzer M.L., Raj B. IEEE signal processing letters . 2003,第3期

机译：基于语音识别器的麦克风阵列处理滤波器优化
4. An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework [C] . Jun Du, Qing Wang, Yan-Hui Tu, IEEE Workshop on Automatic Speech Recognition and Understanding . 2015

机译：一种基于深度学习框架的CHiME-3挑战中识别麦克风阵列语音的信息融合方法
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. A Real-Time Speech Separation Method Based on Camera and Microphone Array Sensors Fusion Approach [O] . Ching-Feng Liu, Wei-Siang Ciou, Peng-Ting Chen, 2020

机译：基于摄像头和麦克风阵列传感器融合方法的实时语音分离方法
7. Speech recognizer-based microphone array processing for robust hands-free speech recognition [O] . Michael L. Seltzer, Bhiksha Raj, Richard M. Stern 2002

机译：基于语音识别器的麦克风阵列处理，实现强大的免提语音识别

An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework

摘要

著录项

相似文献

相关主题

期刊订阅