首页> 外文期刊>IAENG Internaitonal journal of computer science >Dynamic Facial Dataset Capture and Processing for Visual Speech Recognition using an RGB-D Sensor
【24h】

Dynamic Facial Dataset Capture and Processing for Visual Speech Recognition using an RGB-D Sensor

机译:使用RGB-D传感器的可视语音识别动态面部数据集捕获和处理

获取原文
获取原文并翻译 | 示例
           

摘要

This work presents a new RGB-D acquisition system to capture a comprehensive dynamic facial dataset that can be used for visual speech recognition. The RGB-D facial dataset acquisition system uses a Kinect to record detailed facial features of a person. The dynamic facial dataset is comprised of the facial data of 20 individuals saying 20 common English words or phrases. The acquisition system employs Kinect facial tracking, which records a large number of dynamic facial features. These features include: facial points, facial outline, RGB data, depth data, mapping between RGB and depth data, facial animation units, facial shape units, and finally 2D and 3D face representations of the face along with the 3D head orientation. The effectiveness of acquired RGB-D dynamic facial dataset is demonstrated by presenting a new visual speech recognition method that employs three-dimensional spatiotemporal data of different facial feature points. A number of visual speech recognition methods from the literature are also tested on the new dataset and they obtain a comparable or favorable visual speech recognition results. The results demonstrate the effectiveness of the proposed RGB-D dynamic facial dataset and show that it can be effectively employed in a visual speech recognition system.
机译:这项工作提出了一个新的RGB-D采集系统,可以捕获一个可用于可视语音识别的全面动态面部数据集。 RGB-D面部数据集采集系统使用Kinect来记录一个人的详细面部特征。动态面部数据集包括20个个人的面部数据,称为20个常见的英语单词或短语。采集系统采用Kinect面部跟踪,记录了大量的动态面部特征。这些特征包括:面部点,面部轮廓,RGB数据,深度数据,RGB和深度数据之间的映射,面部动画单元,面部形状单元,以及最后2D和3D面向3D头部方向的2D和3D面部表示。通过呈现采用不同面部特征点的三维时空数据的新视觉语音识别方法来证明所获取的RGB-D动态面部数据集的有效性。来自文献的许多可视语音识别方法也在新数据集上进行测试,并且它们获得了可比或有利的视觉语音识别结果。结果证明了所提出的RGB-D动态面部数据集的有效性,并表明它可以在视觉语音识别系统中有效地使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号