...
首页> 外文期刊>Computer speech and language >The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments
【24h】

The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments

机译:嘈杂环境中录制的多麦克风原位语音语料库COSINE的设计和收集

获取原文
获取原文并翻译 | 示例
           

摘要

We present an overview of the data collection and transcription efforts for the Conversational Speech In Noisy Environments (COSINE) corpus. The corpus is a set of multi-party conversations recorded in real world environments with background noise. It can be used to train noise-robust speech recognition systems or develop speech de-noising algorithms. We explain the motivation for creating such a corpus, and describe the resulting audio recordings and transcriptions that comprise the corpus. These high quality recordings were captured in situ on a custom wearable recording system, whose design and construction is also described. On separate synchronized audio channels, seven-channel audio is captured with a 4-channel far-field microphone array, along with a close-talking, a monophonic far-field, and a throat microphone. This corpus thus creates many possibilities for speech algorithm research.
机译:我们概述了嘈杂环境中的会话语音(COSINE)语料库的数据收集和转录工作。语料库是在现实环境中记录的带有背景噪音的一组多方对话。它可以用于训练抗噪语音识别系统或开发语音去噪算法。我们解释了创建这种语料库的动机,并描述了构成该语料库的音频记录和转录。这些高质量的记录是在定制的可穿戴记录系统上原位捕获的,还描述了其设计和构造。在单独的同步音频通道上,使用4通道远场麦克风阵列,近距离通话,单声道远场和喉咙麦克风捕获七声道音频。因此,该语料库为语音算法研究创造了许多可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号