Lipreading model based on a two-way convolutional neural network and feature fusion

Zhu Meili; Wang Qingqing; Ge Yingying

首页> 外文期刊>Journal of electronic imaging >Lipreading model based on a two-way convolutional neural network and feature fusion

【24h】

Lipreading model based on a two-way convolutional neural network and feature fusion

机译：基于双向卷积神经网络和特征融合的Lipreading模型

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Lipreading feature extraction is essentially the feature extraction of continuous video frame sequences. A lipreading model based on a two-way convolutional neural network and features is proposed to obtain more reasonable visual-spatial-temporal characteristics. Unlike other lipreading methods based on deep learning, the rank pooling method transforms lip video into a standard RGB image that can be directly input into the convolutional neural network, which effectively reduces the input dimension. In addition, to compensate for the lack of spatial information, the apparent shape and depth features are fused, and then the joint cost function is used to guide the network model learning to obtain more distinguishing features. The experimental results were evaluated on the public GRID database and OuluVS2 database. It shows that the accuracy of the proposed method can reach more than 93%, which validates the effectiveness of the method. (C) 2021 SPIE and IS&T

机译：Lipreading特征提取基本上是连续视频帧序列的特征提取。提出了一种基于双向卷积神经网络和特征的Lileding模型，以获得更合理的视觉空间时间特征。与基于深度学习的其他LIPREADING方法不同，等级汇集方法将唇视频转换为可以直接输入到卷积神经网络的标准RGB图像中，这有效地降低了输入维度。另外，为了补偿空间信息缺乏，表观形状和深度特征融合，然后联合成本函数用于引导网络模型学习以获得更具区别的特征。在公共网格数据库和Ouluvs2数据库上评估了实验结果。它表明，所提出的方法的准确性可以达到93％以上，验证该方法的有效性。（c）2021个SPIE和IS＆T

著录项

来源
《Journal of electronic imaging》 |2021年第6期|063003.1-063003.14|共14页
作者
Zhu Meili; Wang Qingqing; Ge Yingying;
展开▼
作者单位

Jilin Animat Inst Sch Game Changchun Peoples R China;

Jilin Animat Inst Sch Animat Art Changchun Peoples R China;

Jilin Animat Inst Sch Game Changchun Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
visual speech recognition; bidirectional dynamic image; histogram of oriented gradients; convolutional neural network;

机译：视觉语音识别;双向动态图像;面向梯度的直方图;卷积神经网络;
入库时间 2022-08-19 03:26:00

相似文献

外文文献
中文文献
专利

1. Non-Intrusive Load Identification Model Based on 3D Spatial Feature and Convolutional Neural Network [J] . Jiangyong Liu, Ning Liu, Huina Song, 能源与动力工程（英文） . 2021,第004期
2. Feature Fusion Multi XMNet Convolution Neural Network for Clothing Image Classification [J] . ZHOU Honglei, PENG Zhifei, TAO Ran, 东华大学学报（英文版） . 2021,第006期
3. Facial Beauty Prediction Based on Lighted Deep Convolution Neural Network with Feature Extraction Strengthened [J] . GAN Junying, JIANG Kaiyong, TAN Haiying, 电子学报（英文版） . 2020,第002期
4. Hybrid descriptor definition for content based image classification using fusion of handcrafted features to convolutional neural network features [J] . Rik Das, Khusbu Kumari, Sourav De, International Journal of Information Technology . 2021,第4期

机译：基于内容的图像分类的混合描述符定义使用手工特征融合到卷积神经网络功能
5. Convolutional neural networks for relevance feedback in content based image retrieval A Content based image retrieval system that exploits convolutional neural networks both for feature extraction and for relevance feedback [J] . Lorenzo Putzu, Luca Piras, Giorgio Giacinto Multimedia Tools and Applications . 2020,第37a38期

机译：基于内容的图像检索的相关反馈的卷积神经网络基于内容的图像检索系统，用于利用特征提取和相关性反馈的卷积神经网络
6. Covid-19 classification by FGCNet with deep feature fusion from graph convolutional network and convolutional neural network [J] . Wang Shui-Hua, Govindaraj Vishnu Varthanan, Manuel Gorriz Juan, Information Fusion . 2021,第1期

机译：Covid-19 FGCNet分类，具有来自Graph卷积网络和卷积神经网络的深度特征融合
7. Palm Vein Recognition Using Convolution Neural Network Based on Feature Fusion with HOG Feature [C] . Hailan Kuang, Zhenhua Zhong, Xinhua Liu, International Conference on Smart Grid and Electrical Automation . 2020

机译：基于HOG特征融合的卷积神经网络掌纹识别
8. Multimodal Data Fusion and Feature Visualization in Convolutional Neural Networks [D] . Punjabi, Arjun Naresh. 2020

机译：卷积神经网络中的多模式数据融合与特征可视化
9. Bearing Fault Diagnosis with a Feature Fusion Method Based on an Ensemble Convolutional Neural Network and Deep Neural Network [O] . Hongmei Li, Jinying Huang, Shuwei Ji 2019

机译：基于集成卷积神经网络和深度神经网络的特征融合方法进行轴承故障诊断
10. Application of Convolutional Neural Network-Based Feature Extraction and Data Fusion for Geographical Origin Identification of Radix Astragali by Visible/Short-Wave Near-Infrared and Near Infrared Hyperspectral Imaging [O] . Qinlin Xiao, Xiulin Bai, Pan Gao, 2020

机译：基于神经网络的应用基于神经网络的特征提取和数据融合来通过可见/短波近红外线和近红外高光谱成像进行地理原产地对地域原产地识别
11. Keypoint Density-Based Region Proposal for Fine-Grained Object Detection and Classification Using Regions with Convolutional Neural Network Features. [R] . Turner, J. T., Gupta, K., Morris, B., 2015

机译：基于关键点密度的区域提议，用于使用具有卷积神经网络特征的区域进行细粒度目标检测和分类。

Lipreading model based on a two-way convolutional neural network and feature fusion

摘要

著录项

相似文献

相关主题

期刊订阅