A Minimum Converted Trajectory Error (MCTE) Approach to High Quality Speech-to-Lips Conversion

机译：高质量语音到嘴唇转换的最小转换轨迹误差（MCTE）方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

High quality speech-to-lips conversion, investigated in this work, renders realistic lips movement (video) consistent with input speech (audio) without knowing its linguistic content. Instead of memoryless frame-based conversion, we adopt maximum likelihood estimation of the visual parameter trajectories using an audio-visual joint Gaussian Mixture Model (GMM). We propose a minimum converted trajectory error approach (MCTE) to further refine the converted visual parameters. First, we reduce the conversion error by training the joint audio-visual GMM with weighted audio and visual likelihood. Then MCTE uses the generalized probabilistic descent algorithm to minimize a conversion error of the visual parameter trajectories defined on the optimal Gaussian kernel sequence according to the input speech. We demonstrate the effectiveness of the proposed methods using the LIPS 2009 Visual Speech Synthesis Challenge dataset, without knowing the linguistic (phonetic) content of the input speech.

机译：在这项工作中研究了高质量的语音到嘴唇的转换，在不知道其语言内容的情况下，使逼真的嘴唇运动（视频）与输入语音（音频）一致。代替无记忆的基于帧的转换，我们采用视听联合高斯混合模型（GMM）对视觉参数轨迹进行最大似然估计。我们提出了一种最小转换轨迹误差方法（MCTE），以进一步完善转换后的视觉参数。首先，我们通过用加权的视听可能性训练联合视听GMM来减少转换误差。然后，MCTE使用广义概率下降算法根据输入语音，将在最佳高斯核序列上定义的视觉参数轨迹的转换误差最小化。我们在不知道输入语音的语言（语音）内容的情况下，使用LIPS 2009视觉语音合成挑战数据集演示了所提出方法的有效性。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.1736-1739|共4页
会议地点
作者
Xiaodan Zhuang; Lijuan Wang; Frank Soong; Mark Hasegawa-Johnson;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
visual speech synthesis; speech-to-lips conversion; minimum conversion error; minimum generation error;

机译：视觉语音合成;语音到嘴唇的转换;最小转换误差;最小生成误差;

相似文献

外文文献
中文文献
专利

1. Trajectory Planning with Minimum Synthesis Error for Industrial Robots Using Screw Theory [J] . Liu Zhifeng, Xu Jingjing, Cheng Qiang, International Journal of Precision Engineering and Manufacturing . 2018,第2期

机译：使用螺杆理论的工业机器人最小合成误差轨迹规划
2. Improving Trajectory Modelling for DNN-Based Speech Synthesis by Using Stacked Bottleneck Features and Minimum Generation Error Training [J] . Zhizheng Wu, Simon King Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第7期

机译：通过使用堆叠的瓶颈特征和最小生成误差训练来改进基于DNN的语音合成的轨迹模型
3. Minimum Time Trajectory Optimization of CNC Machining with Tracking Error Constraints [J] . QiangZhang, ShurongLi, JianxinGuo Abstract and applied analysis . 2014,第2期

机译：跟踪误差约束的数控加工最小时间轨迹优化
4. A Minimum Converted Trajectory Error (MCTE) Approach to High Quality Speech-to-Lips Conversion [C] . Xiaodan Zhuang, Lijuan Wang, Frank Soong, Annual conference of the International Speech Communication Association . 2010

机译：最小转换后的轨迹错误（MCTE）对高质量的语音到嘴唇转换的方法
5. Minimum jerk trajectory planning for trajectory constrained redundant robots [D] . Freeman, Philip 2012

机译：轨迹的最小混蛋轨迹规划约束冗余机器人
6. A Formal Approach to the Selection by Minimum Error and Pattern Method for Sensor Data Loss Reduction in Unstable Wireless Sensor Network Communications [O] . Changhwa Kim, DongHyun Shin 2017

机译：减少无线传感器网络不稳定通信中传感器数据丢失的最小误差和模式方法选择的一种形式化方法
7. Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Generation Error Training [O] . Wu, Zhizheng, King, Simon 2016

机译：用maTLaB改进基于DNN语音合成的轨迹建模堆叠瓶颈特征与最小代错误训练

A Minimum Converted Trajectory Error (MCTE) Approach to High Quality Speech-to-Lips Conversion

摘要

著录项

相似文献

相关主题

期刊订阅