...
首页> 外文期刊>Pattern recognition letters >Exploiting alternative acoustic sensors for improved noise robustness in speech communication
【24h】

Exploiting alternative acoustic sensors for improved noise robustness in speech communication

机译:开发替代声学传感器以提高语音通信中的噪声鲁棒性

获取原文
获取原文并翻译 | 示例
           

摘要

This study investigates the use of non-conventional body-conductive acoustic sensors in human-human speech communication and automatic speech recognition. The body-conductive sensors are directly attached to the speaker and receive the uttered speech through the skin and bones, resulting in higher robustness against environmental noise. In this study, a throat microphone, an ear bone microphone, and a standard microphone were evaluated using subjective speech intelligibility tests and automatic speech recognition experiments. In addition to the use of these sensors on their own, several methods were also applied for sensor integration, thereby achieving higher recognition rates. Namely, multi-stream hidden Markov model (HMM) decision fusion, and late fusion methods were used to integrate several sensors. By using late fusion, a 40% relative recognition rate improvement in a noisy environment, and a 24% relative recognition rate improvement in a clean environment were achieved. In the case of late fusion, a novel adaptive weighting method was introduced that does not require any pre-adjustment of the weights. In this study, a technique to automatically segment noisy speech data by using a body-conductive sensor in conjunction with the desired microphone during recording is presented. The Lombard effect phenomenon when using body-conductive acoustic sensors was also investigated. (c) 2018 Elsevier B.V. All rights reserved.
机译:这项研究调查了非常规的人体导电声学传感器在人与人之间的语音交流和自动语音识别中的使用。人体传导传感器直接连接到扬声器,并通过皮肤和骨骼接收发出的语音,从而具有更高的抗环境噪声能力。在这项研究中,使用主观语音清晰度测试和自动语音识别实验对嗓音麦克风,耳骨麦克风和标准麦克风进行了评估。除了单独使用这些传感器外,还采用了几种方法来集成传感器,从而实现更高的识别率。即,使用多流隐马尔可夫模型(HMM)决策融合,以及后期融合方法来集成多个传感器。通过使用后期融合,在嘈杂的环境中,相对识别率提高了40%,在干净的环境中,相对识别率提高了24%。在后期融合的情况下,引入了一种新颖的自适应加权方法,该方法不需要任何权重的预先调整。在这项研究中,提出了一种通过在录制过程中结合使用人体感应器和所需麦克风来自动分割嘈杂语音数据的技术。还研究了使用人体传导声传感器时的伦巴第效应现象。 (c)2018 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号