Korean speech recognition using phonemics for lip-sync animation

机译：使用音素进行口型同步动画的韩语语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A speaker dependent voice recognition algorithm has been developed for producing an autonomic natural animating of the character' s mouth shape for small and medium sized animation productions or e-learning contents productions. Since the basic technique for recognizing Korean speech has been based on research results of other languages such as English and Japanese, it should check once at least or a margin for applying the Korean vocal sound system. One of reason is that Korean phonemes always have a same phonetic value. However, the scope of this study is the recognition of single vowels for a digital contents producing, particularly lip sync animation, since the lip sync producing generally requires lots of tedious hand work of animators and it seriously affects the animation producing cost and development period to get a high quality of lip animation. In this research, a real time processed automatic lip sync algorithm for virtual characters as the animation key in digital contents is studied by considering Korean vocal sound system. The proposed algorithm contributes to produce a natural condonable lip animation with the lower producing cost and the shorter development period. The recognition process consists of speech signal as the input, filtering, Fast Fourier Transform and identification. The result shows the proposed speaker dependent single vowel recognition system is able to distinguish Korean single vowels from dialogue of a dubbing artist with real-time. The average of the recognition ratio was 97.3% in the laboratory environment.

机译：已经开发了与说话者相关的语音识别算法，用于为中小型动画制作或电子学习内容制作生成角色嘴形的自主自然动画。由于识别朝鲜语语音的基本技术是基于其他语言（例如英语和日语）的研究结果，因此它至少应检查一次或不适用朝鲜语声音系统。原因之一是韩语音素始终具有相同的语音价值。但是，本研究的范围是对数字内容制作（尤其是口型同步动画）的单个元音的识别，因为口型同步生产通常需要大量繁琐的动画师手工工作，并且严重影响了动画制作成本和开发周期。获得高质量的嘴唇动画。在这项研究中，通过考虑韩国人的声音系统，研究了一种实时处理的自动嘴唇同步算法，该算法以虚拟角色为数字内容中的动画关键。所提出的算法有助于以较低的生产成本和较短的开发周期产生自然的可容许的唇部动画。识别过程包括语音信号作为输入，滤波，快速傅立叶变换和识别。结果表明，所提出的依赖于说话人的单元音识别系统能够实时区分配音艺术家的对话中的韩国单元音。在实验室环境中，识别率的平均值为97.3％。

著录项

来源
《International Conference on Information Science, Electronics and Electrical Engineering》|2014年|1011-1014|共4页
会议地点
作者
Hwang Sun-Min; Song Bok-Hee; Yun Han-Kyung;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Animation; Feature extraction; Real-time systems; Shape; Speech; Speech recognition; Synchronization; Korean phoneme; phonemics; speaker dependent; voice recognition;

机译：动画;特征提取;实时系统;形状;语音;语音识别;同步;韩语音素;音素;与说话者相关;语音识别;

相似文献

外文文献
中文文献
专利

1. Real-Time Continuous Phoneme Recognition System Using Class-Dependent Tied-Mixture HMM With HBT Structure for Speech-Driven Lip-Sync [J] . Park J., Ko H. IEEE transactions on multimedia . 2008,第7期

机译：实时连续音素识别系统，该系统使用基于类的具有HBT结构的绑定混合HMM进行语音驱动的口型同步
2. Psychometric functions for shortened administrations of a speech recognition approach using Tri-Word presentations and phonemic scoring [J] . GelfandS.A., GelfandJ.T. Journal of speech, language, and hearing research: JSLHR . 2012,第3期

机译：心理测量功能可简化使用Tri-Word演示和音位评分的语音识别方法的管理
3. Tri-word presentations with phonemic scoring for practical high-reliability speech recognition assessment. [J] . Gelfand SA Journal of speech, language, and hearing research: JSLHR . 2003,第2期

机译：具有语音评分的三字演示，实现实用高可靠性语音识别评估。
4. Korean speech recognition using phonemics for lip-sync animation [C] . Hwang Sun-Min, Song Bok-Hee, Yun Han-Kyung International Conference on Information Science, Electronics and Electrical Engineering . 2014

机译：韩国语音识别使用唇部同步动画的音素
5. A laminar cortical model of conscious speech perception: Phonemic restoration and speech category learning [D] . Kazerounian, Sohrob 2012

机译：意识语音感知的层状皮层模型：音素恢复和语音类别学习
6. Automatic Classification of the Korean Triage Acuity Scale in Simulated Emergency Rooms Using Speech Recognition and Natural Language Processing: a Proof of Concept Study [O] . Dongkyun Kim, Jaehoon Oh, Heeju Im, 2021

机译：使用语音识别和自然语言处理的模拟急诊室中韩国分流刻度的自动分类：概念研究证明
7. A Study on Phonemic Analysis for the Recognition of Korean Speech [O] . Jeong Young Song, Min Wook Kil, Il Seok Ko 2007

机译：韩国言论认可的音素分析研究

Korean speech recognition using phonemics for lip-sync animation

摘要

著录项

相似文献

相关主题

期刊订阅