首页> 外文会议>International Conference on Information Science, Electronics and Electrical Engineering >Korean speech recognition using phonemics for lip-sync animation
【24h】

Korean speech recognition using phonemics for lip-sync animation

机译:使用音素进行口型同步动画的韩语语音识别

获取原文

摘要

A speaker dependent voice recognition algorithm has been developed for producing an autonomic natural animating of the character' s mouth shape for small and medium sized animation productions or e-learning contents productions. Since the basic technique for recognizing Korean speech has been based on research results of other languages such as English and Japanese, it should check once at least or a margin for applying the Korean vocal sound system. One of reason is that Korean phonemes always have a same phonetic value. However, the scope of this study is the recognition of single vowels for a digital contents producing, particularly lip sync animation, since the lip sync producing generally requires lots of tedious hand work of animators and it seriously affects the animation producing cost and development period to get a high quality of lip animation. In this research, a real time processed automatic lip sync algorithm for virtual characters as the animation key in digital contents is studied by considering Korean vocal sound system. The proposed algorithm contributes to produce a natural condonable lip animation with the lower producing cost and the shorter development period. The recognition process consists of speech signal as the input, filtering, Fast Fourier Transform and identification. The result shows the proposed speaker dependent single vowel recognition system is able to distinguish Korean single vowels from dialogue of a dubbing artist with real-time. The average of the recognition ratio was 97.3% in the laboratory environment.
机译:已经开发了与说话者相关的语音识别算法,用于为中小型动画制作或电子学习内容制作生成角色嘴形的自主自然动画。由于识别朝鲜语语音的基本技术是基于其他语言(例如英语和日语)的研究结果,因此它至少应检查一次或不适用朝鲜语声音系统。原因之一是韩语音素始终具有相同的语音价值。但是,本研究的范围是对数字内容制作(尤其是口型同步动画)的单个元音的识别,因为口型同步生产通常需要大量繁琐的动画师手工工作,并且严重影响了动画制作成本和开发周期。获得高质量的嘴唇动画。在这项研究中,通过考虑韩国人的声音系统,研究了一种实时处理的自动嘴唇同步算法,该算法以虚拟角色为数字内容中的动画关键。所提出的算法有助于以较低的生产成本和较短的开发周期产生自然的可容许的唇部动画。识别过程包括语音信号作为输入,滤波,快速傅立叶变换和识别。结果表明,所提出的依赖于说话人的单元音识别系统能够实时区分配音艺术家的对话中的韩国单元音。在实验室环境中,识别率的平均值为97.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号