AUDIOVISUAL-BASED ADAPTIVE SPEAKER IDENTIFICATION

机译：基于视听的自适应扬声器识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

An adaptive speaker identification system is presented in this paper, which aims to recognize speakers in feature films by exploiting both audio and visual cues. Specifically, the audio source is first analyzed to identify speakers using a likelihood-based approach. Meanwhile, the visual source is parsed to recognize talking faces using face detection/recognition and mouth tracking techniques. These two information sources are then integrated under a probabilistic framework for improved system performance. Moreover, to account for speakers' voice variations along time, we update their acoustic models on the fly by adapting to their newly contributed speech data. An average of 80% identification accuracy has been achieved on two test movies. This shows a promising future of the proposed audiovisual-based adaptive speaker identification approach.

机译：本文提出了一种自适应扬声器识别系统，其目的是通过利用音频和视觉线索来识别特征胶片中的扬声器。具体地，首先分析音频源以使用基于可能性的方法来识别扬声器。同时，解析视觉源以识别使用面部检测/识别和嘴巴跟踪技术识别谈话面。然后，这两个信息源在概率框架下集成了改进的系统性能。此外，要考虑沿时间的发言者的语音变化，我们通过适应新贡献的语音数据来更新他们的声学模型。平均在两个测试电影上实现了80％的识别准确性。这表明了建议基于视听的自适应扬声器识别方法的有希望的未来。

著录项

来源
《International Conference on Multimedia and Expo》|2003年||共4页
会议地点
作者
Ying Li; Shrikanth Narayanan; C.-C. Jay Kuo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP37-53;
关键词

相似文献

外文文献
中文文献
专利

1. Speaker identification by combining speaker specific GMM with speaker adapted syllable-based HMM [J] . Seiichi Nakagawa, Wei Zhang 電子情報通信学会技術研究報告. 音声. Speech . 2003,第94期

机译：通过将特定于说话人的GMM与基于说话人的基于音节的HMM相结合来进行说话人识别
2. Speaker identification by combining speaker specific GMM with speaker adapted syllable-based HMM [J] . Seiichi Nakagawa, Wei Zhang 電子情報通信学会技術研究報告. 音声. Speech . 2003,第94期

机译：演讲者识别通过将扬声器特定的GMM与扬声器适应的基于音节的HMM组合
3. An Adaptively Enhanced Auditory Transform Based Feature Extraction Algorithm for Robust Speaker Identification [J] . S.D. Umarani, R.S.D. Wahidabanu, P. Raviram International journal of soft computing . 2013,第1期

机译：基于自适应增强听觉变换的特征提取算法
4. AUDIOVISUAL-BASED ADAPTIVE SPEAKER IDENTIFICATION [C] . Ying Li, Shrikanth Narayanan, C.-C. Jay Kuo International Conference on Multimedia and Expo . 2003

机译：基于视听的自适应扬声器识别
5. Statistical recursive estimation algorithms for speaker adaption. [D] . Wang, Shaojun. 2001

机译：用于说话人适应的统计递归估计算法。
6. Decoding the Attended Speaker From EEG Using Adaptive Evaluation Intervals Captures Fluctuations in Attentional Listening [O] . Manuela Jaeger, Bojana Mirkovic, Martin G. Bleichner, 2020

机译：使用自适应评估间隔解码来自EEG的扬声器捕获注意力倾听的波动
7. AUDIOVISUAL-BASED ADAPTIVE SPEAKER IDENTIFICATION [O] . Ying Li, Shrikanth Narayanan, C. -c. Jay Kuo 2009

机译：基于音频的自适应扬声器识别

AUDIOVISUAL-BASED ADAPTIVE SPEAKER IDENTIFICATION

摘要

著录项

相似文献

相关主题

期刊订阅