首页> 外文会议>IEEE Conference on Systems, Process Control >Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition
【24h】

Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition

机译:日语语音素的增强分类和视觉语音识别的分层加权判别

获取原文

摘要

For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on ‘viseme’. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.
机译:为了自动语音识别和语音动画合成,说话者验证等目的,已经对“ viseme”进行了研究。 Viseme是语音的可视可识别单位,或者是音频域中音素的可视域中的等效单位。视位素的分类和判别方法仍然是重要的话题。本文着重于日语视位素的分类单位数量和判别程序:我们将视位素的数量从6个扩展到9个,以扩大单词表示的系列,然后提出使用多重判别分析(MDA)的分层加权判别方法增强判别能力。为了验证和讨论我们的建议的可用性,进行了视位识别和单词识别实验。从这些结果,证实了所提出方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号