基于图像信息的话者识别

刘培培; 杨祥来

首页> 中文期刊>中国科技论文 >基于图像信息的话者识别

基于图像信息的话者识别

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A speaker recognition scheme based on image information is proposed in this paper.A dataset with 916 samples is constructed, in which each sample includes 20 consecutive images.We achieve the task of speaker recognition based on image information through two steps:all mouth areas of the faces are found by face recognition technology to perform lip movement detection, and the faces which are detected by lip movements are recognized.The paper has designed two different methods to construct lip movement detection model.By obtaining the width of the nose and distance between the upper and lower lips on the face of each image in the sample, the ratio of distance to width is used as the feature for each image.A model can be trained by support vector machine based on these features.Cutting the lips of the face in each image, a convolutional neural network is used to extract the features of the cropped lip images.These features are used as inputs for long short time memory networks, and then the training of temporal classification is carried out.The experiment results show that speaker recognition based on image information can achieve high accuracy.%提出了一种使用图像信息进行话者识别的方案,建立了一个共计916个样本、每个样本包含连续20帧图片的实验数据集.将基于图像信息的话者识别分为借助人脸识别技术找出人脸的嘴唇部分并执行唇动检测和对被检测出唇动的人脸进行人脸识别2个阶段.唇动检测模型通过2种方法获得:计算样本中每帧图片的人脸上下嘴唇间距与鼻部宽度的比例,并将该比例作为该帧图像的特征,基于总体样本特征使用支持向量机进行模型训练;对人脸的嘴唇部分进行裁剪,使用卷积神经网络对裁剪后的嘴唇图片提取特征,并将特征作为长短时记忆网络的输入进行模型的训练.实验结果表明,基于图像信息的话者识别能够达到较高的准确率.

著录项

来源
《中国科技论文》|2018年第20期|2388-2393|共6页
作者
刘培培; 杨祥来;
展开▼
作者单位

山东科技大学计算机科学与工程学院, 山东青岛 266590;

中国科学院计算技术研究所, 北京 100190;

国家电网国网技术学院, 济南 250002;

展开▼
原文格式 PDF
正文语种 chi
中图分类多媒体技术与多媒体计算机;
关键词
人脸识别; 话者识别; 唇动检测; 支持向量机; 卷积神经网络; 长短时记忆网络;

相似文献

中文文献
外文文献
专利

1. 基于图像识别技术的信息卡识别 [J] . 卢军 . 陕西科技大学学报（自然科学版） . 2002,第002期
2. 基于图像信息的海上目标转弯机动快速识别方法 [J] . 蒋智博 ,王海川 ,王亚飞 . 指挥控制与仿真 . 2021,第006期
3. 基于图像识别技术的电力信息化建设探讨 [J] . 毛一凡 ,徐兴 . 中国信息化 . 2021,第010期
4. 基于图像识别的编程工具在医院信息化的应用研究 [J] . 方联青 ,左秀然 . 中国数字医学 . 2020,第001期
5. 基于图像识别技术在工业设计中信息交互的应用 [J] . 郝书敏 . 西部皮革 . 2020,第008期
6. 基于图像识别的低成本集成化土石坝安全信息采集系统 [C] . 王东 ,项霞 ,胡再国 . 第七届水库大坝新技术推广研讨会 . 2018
7. 基于图像处理的搅拌车信息提取与识别 [A] . 张奥祥 . 2020

基于图像信息的话者识别

摘要

著录项

相似文献

相关主题

期刊订阅