首页> 外文期刊>EURASIP journal on audio, speech, and music processing >On the Importance of Audiovisual Coherence for the Perceived Quality of Synthesized Visual Speech
【24h】

On the Importance of Audiovisual Coherence for the Perceived Quality of Synthesized Visual Speech

机译:视听连贯性对合成视觉语音感知质量的重要性

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Audiovisual text-to-speech systems convert a written text into an audiovisual speech signal. Typically, the visual mode of the synthetic speech is synthesized separately from the audio, the latter being either natural or synthesized speech. However, the perception of mismatches between these two information streams requires experimental exploration since it could degrade the quality of the output. In order to increase the intermodal coherence in synthetic 2D photorealistic speech, we extended the well-known unit selection audio synthesis technique to work with multimodal segments containing original combinations of audio and video. Subjective experiments confirm that the audiovisual signals created by our multimodal synthesis strategy are indeed perceived as being more synchronous than those of systems in which both modes are not intrinsically coherent. Furthermore, it is shown that the degree of coherence between the auditory mode and the visual mode has an influence on the perceived quality of the synthetic visual speech fragment. In addition, the audio quality was found to have only a minor influence on the perceived visual signal's quality.
机译:视听文本到语音系统将书面文本转换为视听语音信号。通常,合成语音的视觉模式与音频分开合成,后者是自然语音或合成语音。但是,这两个信息流之间不匹配的感知需要进行实验探索,因为这可能会降低输出质量。为了增加合成的2D真实感语音中的模式间连贯性,我们扩展了众所周知的单位选择音频合成技术,以处理包含音频和视频原始组合的多模式段。主观实验证实,由我们的多模式合成策略创建的视听信号确实比其中两种模式本质上不相干的系统的同步性更高。此外,表明听觉模式和视觉模式之间的连贯程度对合成视觉语音片段的感知质量有影响。另外,发现音频质量仅对感知的视觉信号的质量有较小的影响。

著录项

  • 来源
    《EURASIP journal on audio, speech, and music processing》 |2009年第suppla期|P.51-62|共12页
  • 作者单位

    Department of ETRO-DSSP, Interdisciplinary Institute for Broadband Technology IBBT, Vrije Universiteit Brussel, Pleinlaan 2, B-1050 Brussels, Belgium;

    rnDepartment of ETRO-DSSP, Interdisciplinary Institute for Broadband Technology IBBT, Vrije Universiteit Brussel, Pleinlaan 2, B-1050 Brussels, Belgium;

    rnDepartment of ETRO-DSSP, Interdisciplinary Institute for Broadband Technology IBBT, Vrije Universiteit Brussel, Pleinlaan 2, B-1050 Brussels, Belgium;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号