首页> 外文会议>IEEE Visual Communications and Image Processing Conference >Synthesizing 3D Acoustic-Articulatory Mapping Trajectories: Predicting Articulatory Movements by Long-Term Recurrent Convolutional Neural Network
【24h】

Synthesizing 3D Acoustic-Articulatory Mapping Trajectories: Predicting Articulatory Movements by Long-Term Recurrent Convolutional Neural Network

机译:合成3D声学-发音映射轨迹:通过长期递归卷积神经网络预测发音运动

获取原文

摘要

Robust and accurate predicting of articulatory movements has various important applications, such as 3D articulatory animations and visual communication. Various approaches have been proposed to solve the acoustic-articulatory mapping problem. However, their precision is not high enough. Recently, deep neural network (DNN), especially convolutional neural network (CNN) and recurrent neural network (RNN), has brought tremendous success in speech recognition and synthesis. To increase the accuracy, we propose a new network architecture for acoustic-articulatory mapping, called long-term recurrent convolutional neural network (LTRCNN). The network consists of CNN, RNN and a skip connection. CNN can model the spectral correlation among acoustic features efficiently. RNN, like long short-term memory (LSTM), can learn the temporal context information from sequential data powerfully. Besides, skip connections can increase the input representation from different levels to preserve the feature information. Experiments show that LTRCNN achieves the state-of-the-art root-mean-squared error (RMSE) with 0.690 mm and the correlation coefficient with 0.949 in this prediction task.
机译:关节运动的鲁棒且准确的预测具有各种重要的应用,例如3D关节动画和视觉传达。已经提出了各种方法来解决声学发音映射问题。但是,它们的精度不够高。近年来,深度神经网络(DNN),尤其是卷积神经网络(CNN)和递归神经网络(RNN)在语音识别和合成方面取得了巨大的成功。为了提高准确性,我们提出了一种用于声学发音映射的新网络架构,称为长期递归卷积神经网络(LTRCNN)。该网络由CNN,RNN和跳过连接组成。 CNN可以有效地模拟声学特征之间的频谱相关性。像长短期记忆(LSTM)一样,RNN可以从顺序数据中强大地学习时间上下文信息。此外,跳过连接可以从不同级别增加输入表示,以保留特征信息。实验表明,在该预测任务中,LTRCNN实现了0.690 mm的最新均方根误差(RMSE)和0.949的相关系数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号