首页> 外文会议>9th International conference on language resources and evaluation >The MMASCS multi-modal annotated synchronous corpus of audio, video, facial motion and tongue motion data of normal, fast and slow speech
【24h】

The MMASCS multi-modal annotated synchronous corpus of audio, video, facial motion and tongue motion data of normal, fast and slow speech

机译:正常,快速和慢速语音的音频,视频,面部运动和舌头运动数据的MMASCS多模式注释同步语料库

获取原文

摘要

In this paper, we describe and analyze a corpus of speech data that we have recorded in multiple modalities simultaneously: facial motion via optical motion capturing, tongue motion via electro-magnetic articulography, as well as conventional video and high-quality audio. The corpus consists of 320 phonetically diverse sentences uttered by a male Austrian German speaker at normal, fast and slow speaking rate. We analyze the influence of speaking rate on phone durations and on tongue motion. Furthermore, we investigate the correlation between tongue and facial motion.
机译:在本文中,我们描述并分析了同时以多种方式记录的语音数据集:通过光学动作捕捉进行的面部动作,通过电磁关节造影进行的舌部动作以及常规视频和高质量音频。语料库由320个语音上不同的句子组成,这些句子由奥地利的德国男性男性以正常,快速和慢速的说话速度说出。我们分析了语速对电话持续时间和舌头运动的影响。此外,我们调查了舌头和面部运动之间的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号