机译:通过多通道直方图均衡化实现环境鲁棒的语音和说话人识别
MediaLabs, Department of Information Engineering, Universita Politecnica delle Marche, Via Brecce Bianche 1. 60131, Ancona, Italy;
MediaLabs, Department of Information Engineering, Universita Politecnica delle Marche, Via Brecce Bianche 1. 60131, Ancona, Italy;
MediaLabs, Department of Information Engineering, Universita Politecnica delle Marche, Via Brecce Bianche 1. 60131, Ancona, Italy;
MediaLabs, Department of Information Engineering, Universita Politecnica delle Marche, Via Brecce Bianche 1. 60131, Ancona, Italy;
multi-channel audio processing; feature statistics normalization; histogram equalization; speech recognition; speaker recognition;
机译:语音特征语境统计的直方图均衡化,可增强语音识别能力
机译:语音表示的直方图均衡化,可增强语音识别能力
机译:贝叶斯估计噪声鲁棒语音识别的直方图均衡
机译:改进的基于直方图的特征补偿,可实现可靠的语音识别和无监督的说话人适应
机译:具有有限学习数据的自动语音识别中的环境和说话者鲁棒性。
机译:识别消息和使者:仿生频谱分析可增强语音和说话者识别能力
机译:语音表示的直方图均衡化,可增强语音识别能力
机译:强大的语音处理和识别:说话者ID,语言ID,语音识别/关键字识别,Diarization / Co-Channel /环境表征,说话者状态评估。