Lip Location Normalized Training for Visual Speech Recognition

Oscar Vanegas; Keiichi Tokuda; Tadashi Kitamura

首页> 外文期刊>IEICE Transactions on Information and Systems >Lip Location Normalized Training for Visual Speech Recognition

【24h】

Lip Location Normalized Training for Visual Speech Recognition

机译：嘴唇位置归一化训练用于视觉语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a method to normalize the lip position for improving the performance of a visual- information-based speech recognition system. Basically, there are two types of information useful in speech recognition pro- cesses; the first one is the speech signal itself and the second one is the visual information from the lips in motion. This paper tries to solve some problems caused by using images from the lips in motion such as the effect produced by the variation of the lip location.

机译：本文介绍了一种方法来标准化嘴唇的位置，以改善基于视觉信息的语音识别系统的性能。基本上，在语音识别过程中有两种有用的信息：第一个是语音信号本身，第二个是来自运动中的嘴唇的视觉信息。本文试图解决由于使用嘴唇运动图像而引起的一些问题，例如由于嘴唇位置变化而产生的效果。

著录项

来源
《IEICE Transactions on Information and Systems》 |2000年第11期|p.1969-1977|共9页
作者
Oscar Vanegas; Keiichi Tokuda; Tadashi Kitamura;
展开▼
作者单位

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
hidden markow model; lip location normalization; lipreading;

机译：隐藏的马尔可夫模型;嘴唇位置标准化唇读;
入库时间 2022-08-18 00:34:01

相似文献

外文文献
中文文献
专利

1. A Low-Complexity Parabolic Lip Contour Model With Speaker Normalization for High-Level Feature Extraction in Noise-Robust Audiovisual Speech Recognition [J] . Borgstrom B.J., Alwan A. IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans . 2008,第6期

机译：具有说话人归一化功能的低复杂度抛物线形嘴唇轮廓模型，用于噪声鲁棒的视听语音识别中的高级特征提取
2. Normalized Training for HMM-Based Visual Speech Recognition [J] . Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura, Electronics and Communications in Japan. Part 3, Fundamental Electronic Science . 2006,第11期

机译：基于HMM的视觉语音识别的规范化训练
3. Lip Detection and Lip Geometric Feature Extraction using Constrained Local Model for Spoken Language Identification using Visual Speech Recognition [J] . Aparna Brahme, Umesh Bhadade Indian Journal of Science and Technology . 2016,第32期

机译：基于视觉语音识别的受限局部模型用于口语识别的嘴唇检测和嘴唇几何特征提取
4. Normalized training for HMM-BASED visual speech recognition [C] . Nankaku Y., Tokuda K., Kitamura T., Image Processing, 2000. Proceedings. 2000 International Conference on . 2000

机译：基于HMM的视觉语音识别的规范化训练
5. Speech recognition: The interpretation of training and using speech recognition software from the perspectives of postsecondary students with learning challenges. [D] . Soenksen, Delann. 2006

机译：语音识别：从具有学习挑战的大专学生的角度解释培训和使用语音识别软件的解释。
6. Hearing Lips and Seeing Voices: How Cortical Areas Supporting Speech Production Mediate Audiovisual Speech Perception [O] . Jeremy I. Skipper, Virginie van Wassenhove, Howard C. Nusbaum, -1

机译：听到嘴唇并听到声音：支持语音产生的皮质区域如何介导视听语音感知
7. Normalized Training for HMM-Based Visual Speech Recognition [O] . Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura, 2000

机译：基于HMM的视觉语音识别的规范化训练

Lip Location Normalized Training for Visual Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅