首页> 外文期刊>IEICE Transactions on Information and Systems >Construction and Evaluation of a Large In-Car Speech Corpus
【24h】

Construction and Evaluation of a Large In-Car Speech Corpus

机译:大型车载语音语料库的构建与评价

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we discuss the construction of a large in-car spoken dialogue corpus and the result of its analysis. We have developed a system specially built into a Data Collection Vehicle (DCV) which supports the synchronous recording of multichannel audio data from 16 microphones that can be placed in flexible positions, multichannel video data from 3 cameras, and vehicle related data. Multimedia data has been collected for three sessions of spoken dialogue with different modes of navigation, during approximately a 60 minute drive by each of 800 subjects. We have characterized the collected dialogues across the three sessions. Some characteristics such as sentence complexity and SNR are found to differ significantly among the sessions. Linear regression analysis results also clarify the relative importance of various corpus characteristics.
机译:在本文中,我们讨论了一个大型的车载语音对话语料库的构建及其分析结果。我们已经开发出一种专门内置在数据收集车(DCV)中的系统,该系统支持同步记录来自16个可灵活放置的麦克风的多通道音频数据,来自3个摄像机的多通道视频数据以及与车辆相关的数据。在800名受试者中,每人大约60分钟的车程内,已经收集了三段具有不同导航方式的口语对话的多媒体数据。我们对这三个会议的对话进行了描述。发现会话之间的某些特性(例如句子复杂度和SNR)明显不同。线性回归分析结果还阐明了各种语料库特征的相对重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号