首页> 外文会议>INTERSPEECH 2012 >The 'Audio-Visual Face Cover Corpus': Investigations into audio-visual speech and speaker recognition when the speaker's face is occluded by facewear
【24h】

The 'Audio-Visual Face Cover Corpus': Investigations into audio-visual speech and speaker recognition when the speaker's face is occluded by facewear

机译:当扬声器的脸被面部遮挡时,“视听面部封面语料库”:调查视听语音和扬声器识别

获取原文

摘要

The Audio-Visual Face Cover Corpus consists of high-quality audio and video recordings of 10 native British English speakers wearing different types of 'facewear'. Speakers read aloud a set of 64/C_1VC_2/ syllables embedded in a carrier phrase. 18 English consonants occurred twice each in onset and coda positions. Speakers recited the list 1+8 times, i.e. once in control condition (no facewear) and eight times while wearing a forensicallyr relevant face covering. Audio recordings were made by simultaneously capturing the speech via a headband microphone and two shotgun microphones placed facing and behind the speaker. Footage of the subject's head and shoulders was filmed from two camera angles, frontal and half-profile. In total, 6,120 utterances were recorded per device. This paper aims to specify the database design, to introduce forensic-phonetic research utilising the data, and to demonstrate the corpus's potential applications in related fields of study and in casework conducted by forensic speech scientists.
机译:视听面盖语料库由高质量的英国英语扬声器的高质量音频和录像组成,穿着不同类型的“面部衣”。扬声器大声朗读一组64 / c_1vc_2 / syllables嵌入在运营商短语中。 18英文辅音在发行和CODA位置中发生两次。扬声器叙述了1 + 8次,即控制条件(无面部)和八次,同时穿着不采用的相关面孔覆盖。通过通过头带麦克风同时捕获语音和放置在扬声器后面的两个霰弹枪麦克风来进行录音。受试者的头部和肩部的镜头由两个相机角度,正面和半平面拍摄。总共记录了6,120个话语。本文旨在指定数据库设计,以利用数据来引入法医语音研究,并展示Corpus在相关研究领域和法医语音科学家进行的案例中的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号