机译:基于音频事件和主题模型的音频场景识别
Shandong Normal Univ, Inst Biomed Sci, Sch Phys & Elect, Shandong Prov Key Lab Med Phys & Image Proc Techn, Jinan 250014, Peoples R China;
Shandong Normal Univ, Inst Biomed Sci, Sch Phys & Elect, Shandong Prov Key Lab Med Phys & Image Proc Techn, Jinan 250014, Peoples R China;
Nanchang Hangkong Univ, Sch Informat, Nanchang 330063, Jiangxi, Peoples R China;
Shandong Coll Elect Technol, Dept Comp Sci & Technol, Jinan 250014, Peoples R China;
Shandong Normal Univ, Inst Biomed Sci, Sch Phys & Elect, Shandong Prov Key Lab Med Phys & Image Proc Techn, Jinan 250014, Peoples R China;
Shandong Normal Univ, Inst Biomed Sci, Sch Phys & Elect, Shandong Prov Key Lab Med Phys & Image Proc Techn, Jinan 250014, Peoples R China;
Univ Jinan, Shandong Prov Key Lab Network Based Intelligent C, Sch Informat Sci & Engn, Jinan 250014, Peoples R China;
Shandong Normal Univ, Inst Biomed Sci, Sch Phys & Elect, Shandong Prov Key Lab Med Phys & Image Proc Techn, Jinan 250014, Peoples R China;
Audio scene recognition; Audio event; Topic model; PLSA; LDA; Support vector machine;
机译:用于环境音频场景和声音事件识别的混合框架中的生成模型驱动表示学习
机译:基于上下文的环境音频事件识别,可用于场景理解
机译:实现电影情感场景分类的情感视听词与潜在主题驱动模型
机译:音频事件和场景识别:使用强标签和弱标签数据的统一方法
机译:基于麦克风阵列,视听和帧选择的强大语音处理功能,可实现车载语音识别和内置说话人识别。
机译:Meta-Analyzes支持人类大脑中不同类别的视听相互作用事件的陈述的分类模型
机译:音频事件和场景识别:一种强有力的和谐的统一方法 弱标签数据
机译:用于音频场景分析的基于视觉的音频纹理分割。