首页> 外文会议>International Symposium on Computer Music Modeling and Retrieval >Intelligibility of HE-AAC Coded Japanese Words with Various Stereo Coding Modes in Virtual 3D Audio Space
【24h】

Intelligibility of HE-AAC Coded Japanese Words with Various Stereo Coding Modes in Virtual 3D Audio Space

机译:虚拟3D音频空间中具有各种立体声编码模式的He-AAC编码日语单词的可懂度

获取原文

摘要

In this paper, we investigated the influence of stereo coding on Japanese speech localized in virtual 3-D space. We encoded localized speech using joint stereo and parametric stereo modes within the HE-AAC encoder. First, we tested subjective quality of localized speech at various azimuths on the horizontal plane relative to the listener using the standard MUSHRA tests. We compared the encoded localized speech quality with various stereo encoding modes. The joint stereo mode showed significantly higher MUSHRA scores than the parametric stereo mode at azimuths of ±45 degrees. Next, the Japanese word intelligibility tests were conducted using the Japanese Diagnostic Rhyme Tests. Test speech was first localized at 0 and ±45 degrees and compared with localized speech with no coding. Parametric stereo-coded speech showed lower scores when localized at -45 degrees, but all other speech showed no difference between speech samples with no coding. Next, test speech was localized in front, while competing noise was localized at various angles. The two stereo coding modes with bit rates of 56, 32, and 24 kbps were tested. In most cases, these conditions show just as good intelligibility as speech with no encoding at all noise azimuths. This shows that stereo coding has almost no effect on the intelligibility in the bit rate range tested.
机译:在本文中,我们调查了立体声编码对虚拟3-D空间中的日语语音的影响。我们使用HE-AAC编码器中的联合立体声和参数立体声模式编码本地化语音。首先,我们使用标准脉冲试验测试了相对于听众的水平平面上各方位角的定位语音的主观质量。我们将编码的本地化语音质量与各种立体声编码模式进行比较。联合立体模式显示比±45度的方位角的参数立体模式显着更高。接下来,使用日本诊断押韵进行日语单词可懂度测试。测试语音首先定位为0和±45度,并与本地化语音相比,没有编码。参数立体声编码语音在-45度本地化时显示出较低的分数,但所有其他语音在没有编码的语音样本之间没有显示出差异。接下来,测试语音在前面定位,同时竞争噪声以各种角度定位。测试具有56,32和24kbps的比特率的两个立体编码模式。在大多数情况下,这些条件显示出与所有噪声方位角没有编码的语音一样好。这表明立体声编码几乎没有对测试的比特率范围中的可懂度影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号