Lip feature extraction for visual speech recognition using Hidden Markov Model

机译：利用隐马尔可夫模型进行视觉语音识别的唇部特征提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Visual speech recognition refers to recognizing the spoken words based on visual information of lip movements. In this paper, a new approach for lip reading is presented. Visual speech recognition is applied in science areas, such as speech recognition system and also in social activities, such as recognizing the spoken words of hearing impaired persons. The visual speech video is given as input to the face localization module for detecting the face region. The mouth region is determined relative to the face region. Different methods were used for feature extraction. Out of the different feature extraction methods, the 16 point DCT method gives the experimental results of 93.5% of performance accuracy. Then, these feature vectors are applied separately as inputs to the Hidden Markov Model (HMM) for recognizing the visual speech. 10 participants were uttered 35 different isolated words. For each word, 20 samples are collected for training and testing the HMM.

机译：视觉语音识别是指基于唇部运动的视觉信息来识别口语词。在本文中，提出了一种新方法。视觉语音识别适用于科学领域，例如语音识别系统，也在社交活动中，例如识别听力受损人口的口语。视觉语音视频被给出为用于检测面部区域的面部定位模块的输入。相对于面部区域确定嘴区域。使用不同的方法进行特征提取。出不同特征提取方法，16点DCT方法为实验结果提供了93.5％的性能准确性。然后，这些特征向量被单独应用为对隐藏的马尔可夫模型（HMM）的输入，用于识别视觉语音。 10名参与者被列出了35个不同的孤立词语。对于每个单词，收集20个样本以进行培训和测试嗯。

著录项

来源
《International Conference on Computing, Communication and Applications》|2012年||共5页
会议地点
作者
Sujatha P.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN91-53;
关键词

相似文献

外文文献
中文文献
专利

1. Lip Detection and Lip Geometric Feature Extraction using Constrained Local Model for Spoken Language Identification using Visual Speech Recognition [J] . Aparna Brahme, Umesh Bhadade Indian Journal of Science and Technology . 2016,第32期

机译：基于视觉语音识别的受限局部模型用于口语识别的嘴唇检测和嘴唇几何特征提取
2. A Low-Complexity Parabolic Lip Contour Model With Speaker Normalization for High-Level Feature Extraction in Noise-Robust Audiovisual Speech Recognition [J] . Borgstrom B.J., Alwan A. IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans . 2008,第6期

机译：具有说话人归一化功能的低复杂度抛物线形嘴唇轮廓模型，用于噪声鲁棒的视听语音识别中的高级特征提取
3. Robust speech recognition based on joint model and feature space optimization of hidden Markov models [J] . Seokyong Moon, Jenq-Neng Hwang IEEE Transactions on Neural Networks . 1997,第2期

机译：基于联合模型和隐马尔可夫模型特征空间优化的鲁棒语音识别
4. Lip feature extraction for visual speech recognition using Hidden Markov Model [C] . Sujatha P. Computing, Communication and Applications (ICCCA), 2012 International Conference on . 2012

机译：使用隐马尔可夫模型提取嘴唇特征以进行视觉语音识别
5. Online Learning of Large Margin Hidden Markov Models for Automatic Speech Recognition. [D] . Cheng, Chih-Chieh. 2011

机译：在线学习大余量隐马尔可夫模型以进行自动语音识别。
6. Assessment of Dysarthria Using One-Word Speech Recognition with Hidden Markov Models [O] . Seung Hak Lee, Minje Kim, Han Gil Seo, 2019

机译：使用隐马尔可夫模型的单字语音识别评估构音障碍
7. Robust Visual Lips Feature Extraction Method for Improved Visual Speech Recognition System [O] . 2018

机译：强大的视觉嘴唇特征提取方法，用于改进的视觉语音识别系统

Lip feature extraction for visual speech recognition using Hidden Markov Model

摘要

著录项

相似文献

相关主题

期刊订阅