首页> 外文会议>International conference on spoken language processing >Construction of speech Corpus in Moving Car Environment
【24h】

Construction of speech Corpus in Moving Car Environment

机译:移动汽车环境中言语语料库的构建

获取原文

摘要

The Center for Integrated Acoustic Information Research (CIAIR) at Nagoya University has been collecting speech corpora in moving cars which are made available as resources to advance the research and development of robust ASRs and spoken dialogue systems under high-noise conditions. The speech corpus consists of (1) phonetically balanced sentences, (2) digit strigns, (3) discrete words and (4) transcribed spoken dialogues between drivers and information systems for navigation and information retrieval. These data are collected in vehicles under both idling and driving situations. The language of the corpus is currently Japanese. The number of subjects is currently about 300,. total recording time is over 200 hours and total corpus size is about 160GByte. We have also been recording video images from three different angles, vehicle-control signals, and vehicle location, all synchronized with the speech recording. We report the objective of the speech corpus, the recording, methods and the recording vehicle developed.
机译:名古屋大学的综合声学信息研究中心(CIAIR)一直在搬家车上收集讲座语料集团,这些车辆可作为资源提供资源,以在高噪声条件下推进强大的ASR和口头对话系统的研发。语音语料库由(1)语音平衡的句子组成,(2)数字争论,(3)离散字和(4)在导航和信息检索的驱动程序和信息系统之间转录的口语对话。这些数据在怠速和驾驶情况下的车辆中收集。语料库的语言目前是日语。受试者的数量目前约为300。总录制时间超过200小时,总体粗糙大小约为160GByte。我们还通过三种不同的角度,车辆控制信号和车辆位置录制视频图像,所有这些都与语音记录同步。我们报告了语音语料库,录制,方法和录制车辆的目标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号