首页> 外文会议>2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference. >Microphone array processing for distant speech recognition: Spherical arrays
【24h】

Microphone array processing for distant speech recognition: Spherical arrays

机译:用于远距离语音识别的麦克风阵列处理:球形阵列

获取原文
获取原文并翻译 | 示例

摘要

Distant speech recognition (DSR) holds out the promise of the most natural human computer interface because it enables man-machine interactions through speech, without the necessity of donning intrusive body- or head-mounted microphones. With the advent of the Microsoft Kinect, the application of non-uniform linear arrays to the DSR problem has become commonplace. Performance analysis of such arrays is well-represented in the literature. Recently, spherical arrays have become the subject of intense research interest in the acoustic array processing community. Such arrays have heretofore been analyzed solely with theoretical metrics under idealized conditions. In this work, we analyze such arrays under realistic conditions. Moreover, we compare a linear array with 64-channel arrays and a total length of 126 cm to a spherical array with 32 channels and a radius of 4.2 cm; we found that these provided word error rates of 9.3% and 10.2%, respectively, on a DSR task. For a speaker positioned at an oblique angle with respect to the linear array, we recorded error rates of 12.8% and 9.7%, respectively, for the linear and spherical arrays. The compact size and outstanding performance of the spherical array recommends itself well to space-limited and mobile applications such as homegaming consoles and humanoid robots.
机译:远距离语音识别(DSR)支持最自然的人机界面,因为它可以通过语音实现人机交互,而无需佩戴侵入性的头戴式或头戴式麦克风。随着Microsoft Kinect的问世,非均匀线性阵列在DSR问题上的应用已变得司空见惯。此类阵列的性能分析在文献中已充分体现。近来,球形阵列已经成为声学阵列处理界中的强烈研究兴趣的主题。迄今为止,仅在理想条件下用理论度量来分析这种阵列。在这项工作中,我们分析了现实条件下的此类阵列。此外,我们将具有64通道阵列和126 cm总长度的线性阵列与具有32通道且半径为4.2 cm的球形阵列进行了比较。我们发现,在DSR任务上,它们分别提供了9.3%和10.2%的单词错误率。对于相对于线性阵列呈倾斜角度放置的扬声器,线性和球形阵列的误差率分别为12.8%和9.7%。球形阵列的紧凑尺寸和出色性能使其非常适合空间有限的移动应用,例如家庭游戏机和类人机器人。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号