Performance Analysis of Lip Synchronization Using LPC, MFCC and PLP Speech Parameters

机译：LPC，MFCC和PLP语音参数唇部同步性能分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many multimedia applications and entertainment industry products like games, cartoons and film dubbing require speech driven face animation and audio-video synchronization. Only Automatic Speech Recognition system (ASR) does not give good results in noisy environment. Audio Visual Speech Recognition system plays vital role in such harsh environment as it uses both - audio and visual - information. In this paper, we have proposed a novel approach with enhanced performance over traditional methods that have been reported so far. Our algorithm works on the bases of acoustic and visual parameters to achieve better results. We have tested our system for English language using LPC, MFCC and PLP parameters of the speech. Lip parameters like lip width, lip height etc are extracted from the video and these both acoustic and visual parameters are used to train systems like Artificial Neural Network (ANN), Vector Quantization (VQ), Dynamic Time Warping (DTW), Support Vector Machine (SVM). We have employed neural network in our research work with LPC, MFCC and PLP parameters. Results show that our system is giving very good response against tested vowels.

机译：许多多媒体应用和娱乐行业产品如游戏，漫画和电影配音需要语音驱动的面部动画和音频视频同步。只有自动语音识别系统（ASR）在嘈杂的环境中没有给出良好的结果。视听语音识别系统在这种恶劣环境中起着重要作用，因为它使用 - 音频和视觉信息。在本文中，我们提出了一种提高迄今为止报告的传统方法表现的新方法。我们的算法适用于声学和视觉参数的基础，以实现更好的结果。我们使用LPC，MFCC和PLP参数来测试我们的英语系统。像唇宽，唇高等这样的唇部参数从视频中提取，这些声学和视觉参数都用于训练人工神经网络（ANN），矢量量化（VQ），动态时间翘曲（DTW），支持向量机等系统（SVM）。我们在我们的研究工作中使用了神经网络，使用LPC，MFCC和PLP参数。结果表明，我们的系统对测试元音的反应非常好。

著录项

来源
《International Conference on Computational Intelligence and Communication Networks》|2010年||共6页
会议地点
作者
Mahesh Goyani; Namrata Dave; N. M. Patel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301-53;
关键词
Automatic Speech Recognition; Phoneme; Viseme; Speech Parameter; Neural Network;

机译：自动语音识别;音素;斗争;语音参数;神经网络;

相似文献

外文文献
中文文献
专利

1. Robust Speech Recognition System Using Conventional and Hybrid Features of MFCC, LPCC, PLP, RASTA-PLP and Hidden Markov Model Classifier in Noisy Conditions [J] . Veton Z. K?puska, Hussien A. Elharati Journal of Computer and Communications . 2015,第6期

机译：噪声条件下使用MFCC，LPCC，PLP，RASTA-PLP和隐马尔可夫模型分类器的常规和混合特征的鲁棒语音识别系统
2. A comparative Performance Analysis of LPC and MFCC for Noise Estimation in Speech Recognition Task [J] . Syed Sibtain Khalid, Safdar Tanweer, Dr. Abdul Mobin, International Journal of Electronics Engineering Research . 2017,第3期

机译：LPC和MFCC在语音识别任务中进行噪声估计的比较性能分析
3. Performance Enhancement in Lip Synchronization Using MFCC Parameters [J] . MAHESH GOYANI, NARENDRA PATEL, MUKESH ZAVERI International Journal of Engineering Science and Technology . 2010,第6期

机译：使用MFCC参数增强嘴唇同步的性能
4. Performance Analysis of Lip Synchronization Using LPC, MFCC and PLP Speech Parameters [C] . Mahesh Goyani, Namrata Dave, N. M. Patel International Conference on Computational Intelligence and Communication Networks . 2010

机译：LPC，MFCC和PLP语音参数唇部同步性能分析
5. Evolutionary algorithm for generation of air pressure and lip pressure parameters for automated performance of brass instruments. [D] . Bilitski, James A. 2006

机译：用于生成用于自动执行铜管乐器的气压和唇压参数的进化算法。
6. Molecular Genetic Analysis of the PLP1 Gene in 38 Families with PLP1-related disorders: Identification and Functional Characterization of 11 Novel PLP1 Mutations [O] . Serena Grossi, Stefano Regis, Roberta Biancheri, 2011

机译：38个与PLP1相关疾病的家庭中PLP1基因的分子遗传分析：11个新型PLP1突变的鉴定和功能表征
7. The Bone Conduction Microphone Parameter Measurement Architecture and Its Speech Recognization Performance Analysis [O] . Hang-hong Kuo, Ying-yu Yu, Jun-juh Yan 2015

机译：骨传导麦克风参数测量架构及其语音识别性能分析

Performance Analysis of Lip Synchronization Using LPC, MFCC and PLP Speech Parameters

摘要

著录项

相似文献

相关主题

期刊订阅