首页> 外文会议>International Conference on Language Resources and Evaluation >Open-source Multi-speaker Corpora of the English Accents in the British Isles
【24h】

Open-source Multi-speaker Corpora of the English Accents in the British Isles

机译:英国群岛英语口音的开源多扬声器语料

获取原文

摘要

This paper presents a dataset of transcribed high-quality audio of English sentences recorded by volunteers speaking with different accents of the British Isles, The dataset is intended for linguistic analysis as well as use for speech technologies. The recording scripts were curated specifically for accent elicitation, covering a variety of phonological phenomena and providing a high phoneme coverage. The scripts include pronunciations of global locations, major airlines and common personal names in different accents; and native speaker pronunciations of local words. Overlapping lines for all speakers were included for idiolect elicitation, which include the same or similar lines with other existing resources such as the CSTR VCTK corpus and the Speech Accent Archive to allow for easy comparison of personal and regional accents. The resulting corpora include over 31 hours of recordings from 120 volunteers who self-identify as native speakers of Southern England, Midlands, Northern England, Welsh, Scottish and Irish varieties of English.
机译:本文介绍了与英国群岛不同的口音志愿者录制的录制的转录高质量音频的数据集,该数据集旨在用于语言分析以及语音技术的用途。记录脚本专门针对口音引出,涵盖各种语音现象并提供高位音素覆盖。脚本包括全球地点,主要航空公司和普通个人名称在不同口音中的发音;和本地单词的母语发音。包括所有扬声器的重叠线被包括在一起的独一无二的elicitation,其包括与其他现有资源(例如CSTR VCTK语料库和语音档案)的相同或相似的线路,以便容易比较个人和区域性。由此产生的Corpora包括来自120多名志愿者的31多小时的录音,他们自我认定为英格兰南部,中南,英国北部,威尔士州,苏格兰和爱尔兰各种英语的母语。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号