Fuzzy Audio-Visual Feature Maps for Speaker Identification

机译：模糊的视听特征图用于说话人识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech-based person recognition by machine has not reached the level of technological maturity required by some of its potential applications. The deficiencies revolve around sub-optimal pre-processing, feature extraction or selection, and classification, particularly under conditions of input data variability. The joint use of audible and visible manifestations of speech aims to alleviate these shortcomings, but the development of effective combination techniques is challenging. This paper proposes and evaluates a combination approach for speaker identification based on fuzzy modelling of acoustic and visual speaker characteristics. The proposed audio-visual model has been evaluated experimentally on a speaker identification task. The results show that the joint model outperforms its isolated components in terms of identification accuracy. In particular, the cross-modal coupling of audio-visual streams is shown to improve identification accuracy.

机译：机器进行的基于语音的人识别尚未达到其某些潜在应用所要求的技术成熟水平。缺陷围绕着次优的预处理，特征提取或选择以及分类，尤其是在输入数据可变性的情况下。声音的可听和可见的表现的联合使用旨在减轻这些缺点，但是有效的组合技术的发展具有挑战性。本文提出并评估了基于声音和视觉说话者特征的模糊建模的说话人识别组合方法。拟议的视听模型已经在说话者识别任务上进行了实验评估。结果表明，在识别精度方面，联合模型优于其孤立的组件。特别地，示出了视听流的交叉模式耦合以提高识别精度。

著录项

来源
《International Conference on Recent Advances in Soft Computing; 20021212-13; Nottingham(GB)》|2002年|P.317-322|共6页
会议地点 Nottingham(GB)
作者
Claude C. Chibelushi;
展开▼
作者单位

School of Computing, Staffordshire University, Beaconside, Stafford ST18 0DG, UK;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations [J] . Md. RabiulIslam, Md. AbdusSobhan Applied computational intelligence and soft computing . 2014,第1期

机译：不同光照变化下基于隐马尔可夫模型的基于特征融合的视听说话人识别
2. Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations [J] . Rabiul Islam, Abdus Sobhan Applied computational intelligence and soft computing . 2014,第期

机译：不同光照变化下基于隐马尔可夫模型的基于特征融合的视听说话人识别
3. Audio-visual speaker identification with asynchronous articulatory feature [J] . Yanxiang Chen, Liu M. Electronics Letters . 2010,第3期

机译：具有异步发音功能的视听说话人识别
4. Fuzzy Audio-Visual Feature Maps for Speaker Identification [C] . Claude C. Chibelushi International Conference on Recent Advances in Soft Computing . 2004

机译：扬声器识别的模糊音频视觉功能映射
5. A probablistic framework for mapping audio-visual features to high-level semantics in terms of concepts and context. [D] . Naphade, Milind Ramesh. 2001

机译：根据概念和上下文将视听功能映射到高级语义的概率框架。
6. Acoustic Feature Selection with Fuzzy Clustering Self Organizing Maps and Psychiatric Assessments [O] . Olga Kamińska, Katarzyna Kaczmarek-Majer, Olgierd Hryniewicz -1

机译：具有模糊聚类自组织图和精神病学评估的声学特征选择
7. Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations [O] . Md. Rabiul Islam, Md. Abdus Sobhan 2014

机译：在不同的照明变化下使用隐马尔可夫模型采用基于融合的视听扬声器识别

Fuzzy Audio-Visual Feature Maps for Speaker Identification

摘要

著录项

相似文献

相关主题

期刊订阅