首页> 外文会议>International Conference on Language Resources and Evaluation >Call My Net 2: A New Resource for Speaker Recognition
【24h】

Call My Net 2: A New Resource for Speaker Recognition

机译:致电我的网2:扬声器识别的新资源

获取原文

摘要

We introduce the Call My Net 2 (CMN2) Corpus, a new resource for speaker recognition featuring Tunisian Arabic conversations between friends and family, incorporating both traditional telephony and VoIP data The corpus contains data from over 400 Tunisian Arabic speakers collected via a custom-built platform deployed in Tunis, with each speaker making 10 or more calls each lasting up to 10 minutes. Calls include speech in various realistic and natural acoustic settings, both noisy and non-noisy. Speakers used a variety of handsets, including landline and mobile devices, and made VoIP calls from tablets or computers. All calls were subject to a series of manual and automatic quality checks, including speech duration, audio quality, language identity and speaker identity. The CMN2 corpus has been used in two NIST Speaker Recognition Evaluations (SRE18 and SRE19). And the SRE test sets as well as the full CMN2 corpus will be published in the Linguistic Data Consortium Catalog. We describe CMN2 corpus requirements, the telephone collection platform, and procedures for call collection. We review properties of the CMN2 dataset and discuss features of the corpus that distinguish it from prior SRE collection efforts, including some of the technical challenges encountered with collecting VoIP data.
机译:我们介绍了我的网2(CMN2)语料库,这是一个新的发言者识别的新资源,具有朋友和家庭之间的突尼斯阿拉伯对话,包括传统的电话和VoIP数据,语料库包含来自超过400个突尼斯阿拉伯语扬声器的数据,通过自定义构建部署在突尼斯的平台,每个扬声器都会制作10个或更多调用,每个呼叫持续最多10分钟。呼叫包括各种现实和自然声学设置的演讲,嘈杂和非嘈杂。扬声器使用各种手机,包括固定电话和移动设备,并从平板电脑或计算机进行VoIP呼叫。所有呼叫都受到一系列手动和自动质量检查,包括语音持续时间,音频质量,语言身份和扬声器标识。 CMN2语料库已用于两个NIST扬声器识别评估(SRE18和SRE19)。和SRE测试集以及全CMN2语料库将在语言数据联盟目录中发布。我们描述了CMN2语料库要求,电话收集平台和呼叫收集程序。我们审查CMN2数据集的属性,并讨论语料库的特征,将其区分开于先前的SRE收集工作,包括收集VoIP数据的一些技术挑战。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号