首页> 外文会议>Spoken Language Technology Workshop >Tal: A Synchronised Multi-Speaker Corpus of Ultrasound Tongue Imaging, Audio, and Lip Videos
【24h】

Tal: A Synchronised Multi-Speaker Corpus of Ultrasound Tongue Imaging, Audio, and Lip Videos

机译:TAL:超声舌头成像,音频和唇视频的同步多扬声器语料库

获取原文

摘要

We present the Tongue and Lips corpus (TaL), a multi-speaker corpus of audio, ultrasound tongue imaging, and lip videos. TaL consists of two parts: TaL1 is a set of six recording sessions of one professional voice talent, a male native speaker of English; TaL80 is a set of recording sessions of 81 native speakers of English without voice talent experience. Overall, the corpus contains 24 hours of parallel ultrasound, video, and audio data, of which approximately 13.5 hours are speech. This paper describes the corpus and presents benchmark results for the tasks of speech recognition, speech synthesis (articulatory-to-acoustic mapping), and automatic synchronisation of ultrasound to audio. The TaL corpus is publicly available under the CC BY-NC 4.0 license.
机译:我们介绍了舌头和嘴唇(TAL),一个音频,超声舌头成像和唇视频的多扬声器语料库。 Tal由两部分组成:Tal1是一套六个录音会话,一个专业语音人才,一个男性母语的英语; Tal80是一组81母语英语的录音会话,没有语音人才体验。总的来说,语料库包含24小时的并行超声,视频和音频数据,其中大约13.5小时是语音。本文介绍了语料库,并为语音识别,语音合成(铰接到声学映射)的任务提供了基准结果,以及超声波到音频的自动同步。 TAL语料库在CC BY-NC 4.0许可证下公开提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号