首页> 外文会议>IEEE Conference on Systems, Process Control >Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition

【24h】

Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition

机译：日语语音素的增强分类和视觉语音识别的分层加权判别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on ‘viseme’. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.

机译：为了自动语音识别和语音动画合成，说话者验证等目的，已经对“ viseme”进行了研究。 Viseme是语音的可视可识别单位，或者是音频域中音素的可视域中的等效单位。视位素的分类和判别方法仍然是重要的话题。本文着重于日语视位素的分类单位数量和判别程序：我们将视位素的数量从6个扩展到9个，以扩大单词表示的系列，然后提出使用多重判别分析（MDA）的分层加权判别方法增强判别能力。为了验证和讨论我们的建议的可用性，进行了视位识别和单词识别实验。从这些结果，证实了所提出方法的有效性。

著录项

来源
《IEEE Conference on Systems, Process Control》|2013年|62-67|共6页
会议地点 Kuala Lumpur(MY)
作者
Okita Shinsuke; Mitsukura Yasue; Hamada Nozomu;
展开▼
作者单位

Department of System Design Engineering Keio University 3-14-1 Hiyoshi Yokohama 223-8522 Japan;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
image processing; pattern recognition; visemes; visual speech recognition;

机译：图像处理;模式识别;视位视觉语音识别;

相似文献

外文文献
中文文献
专利

1. About Neural-Network Algorithms Application in Viseme Classification Problem with Face Video in Audiovisual Speech Recognition Systems [J] . A. V. Savchenko, Ya. I. Khokhlova Optical memory & neural networks . 2014,第1期

机译：关于神经网络算法在视听语音识别系统中带有面部视频的Viseme分类问题中的应用
2. A hybrid approach for automatic lip localization and viseme classification to enhance visual speech recognition [J] . Walid Mahdi, Salah Werda, Abdelmajid Ben Hamadou Integrated Computer-Aided Engineering . 2008,第3期

机译：自动嘴唇定位和视位分类的混合方法，以增强视觉语音识别
3. Lip Localization and Viseme Classification for Visual Speech Recognition [J] . Salah Werda, Walid Mahdi, Abdelmajid Ben Hamadou International Journal of Computing and Information Sciences . 2007,第1期

机译：视觉语音识别的嘴唇定位和Viseme分类
4. Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition [C] . Okita Shinsuke, Mitsukura Yasue, Hamada Nozomu IEEE Conference on Systems, Process Control . 2013

机译：用于视觉语音识别的日本鼠标和分层加权歧视的增强分类
5. The effects of augmented visual feedback on the learning of non-native speech sounds: English speakers' acquisition of the Japanese flap . [D] . Levitt, June S. 2009

机译：增强视觉反馈对非母语语音学习的影响：英语使用者对日语襟翼的掌握。
6. R/DWD: distance-weighted discrimination for classification visualization and batch adjustment [O] . Hanwen Huang, Xiaosun Lu, Yufeng Liu, -1

机译：R / DWD：用于分类可视化和批量调整的距离加权判别
7. Accuracy increase for automatic visual Russian speech recognition: viseme classes optimization [O] . D.V. Ivanko, D.V. Fedotov, A.A. Karpov 2018

机译：自动视觉俄语语音识别的准确性增加：Viseme类优化

Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅