...
首页> 外文期刊>Journal of VLSI signal processing >Multimedia Corpus of In-Car Speech Communication
【24h】

Multimedia Corpus of In-Car Speech Communication

机译:车载语音通信多媒体语料库

获取原文
获取原文并翻译 | 示例

摘要

An ongoing project for constructing a multimedia corpus of dialogues under the driving condition is reported. More than 500 subjects have been enrolled in this corpus development and more than 2 gigabytes of signals have been collected during approximately 60 minutes of driving per subject. Twelve microphones and three video cameras are installed in a car to obtain audio and video data. In addition, five signals regarding car control and the location of the car provided by the Global Positioning System (GPS) are recorded. All signals are simultaneously recorded directly onto the hard disk of the PCs onboard the specially designed data collection vehicle (DCV). The in-car dialogues are initiated by a human operator, an automatic speech recognition (ASR) system and a wizard of OZ (WOZ) system so as to collect as many speech disfluencies as possible. In addition to the details of data collection, in this paper, preliminary results on intermedia signal conversion are described as an example of the corpus-based in-car speech signal processing research.
机译:据报道,在驾驶条件下,正在建立一个多媒体对话语料库的正在进行的项目。在这个语料库的开发中,已经招募了500多名受试者,并且在每名受试者驾驶大约60分钟的过程中,已经收集了2 GB以上的信号。汽车上安装了十二个麦克风和三个摄像机,以获取音频和视频数据。此外,还记录了五个由全球定位系统(GPS)提供的有关轿厢控制和轿厢位置的信号。所有信号均同时直接记录在专门设计的数据收集工具(DCV)上的PC硬盘上。车内对话是由操作员,自动语音识别(ASR)系统和OZ向导(WOZ)系统发起的,以便收集尽可能多的语音干扰。除了数据收集的细节外,本文还介绍了有关中间信号转换的初步结果,作为基于语料库的车内语音信号处理研究的一个示例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号