机译:基于稀疏的健壮说话人识别音频模型
School of Computer Science and Technology Harbin Institute of Technology, 92 West Dazhi Street Nan Gang District, Harbin, 150001, P. R. China;
School of Computer Science and Technology Harbin Institute of Technology, 92 West Dazhi Street Nan Gang District, Harbin, 150001, P. R. China;
School of Computer Science and Technology Harbin Institute of Technology, 92 West Dazhi Street Nan Gang District, Harbin, 150001, P. R. China;
School of Computer Science and Technology Harbin Institute of Technology, 92 West Dazhi Street Nan Gang District, Harbin, 150001, P. R. China;
sparse representation; selectivity gain; robust feature; speaker recognition;
机译:通过语音优化的说话人建模,可实现可靠的说话人识别
机译:基于张量结构的健壮说话人识别的听觉稀疏表示
机译:使用由MLLR转换生成的伪扬声器特征进行声学模型训练,以实现与扬声器无关的可靠语音识别
机译:健壮的说话人识别的模拟听觉感知模型
机译:语音和听觉建模对强大语音识别的协同作用。
机译:通过状态空间建模对来自演讲者环境中MEG的选择性听觉注意力进行可靠解码
机译:基于张量结构的健壮说话人识别的听觉稀疏表示
机译:强大的语音处理和识别:说话者ID,语言ID,语音识别/关键字识别,Diarization / Co-Channel /环境表征,说话者状态评估。