首页> 外文会议>Canadian Conference on Electrical and Computer Engineering >PARAMETRIC MIXING FOR CENTRALIZED VOIP CONFERENCING USING ITU-T RECOMMENDATION G.722.2
【24h】

PARAMETRIC MIXING FOR CENTRALIZED VOIP CONFERENCING USING ITU-T RECOMMENDATION G.722.2

机译:使用ITU-T推荐G.722.2的集中式VoIP会议参数混合

获取原文

摘要

VoIP conferencing with a centralized speech mixing bridgeintroduces additional end-to-end latency into packetized voice communication. This paper investigates how full tandem speech decoding, time-domain mixing, speech encoding cycle can be circumvented by instead extracting the coded speech parameters and performing the speech packet mixing without time-domain reconstruction. By mixing through coded speech parameters, we show that nearly an 85 percent decrease in computational complexity can be achieved over full tandem mixing of two speakers for G.722.2, thus significantly reducing the packet latency at the centralized speech mixing bridge. For the G.722.2 parametric mixer presented, linear prediction coefficients (LPCs), pitch lags, fixed codebooks, and gains, are extracted (without full speech reconstruction) from the encoded bit stream, mixed, and then re-encoded instead of the full tandem approach where each speech frame must be fully reconstructed. We investigate the mixing in two scenarios: i) mix two 12.65 kbps G.722.2 speech streams at a mixed rate of 12.65 kbps, and ii) mix two 12.65 kbps G.722.2 speech streams at a mixed rate of 18.25 kbps. PAMS is used to evaluate the speech quality of the parametric mixer, resulting in an average distortion of 0.37 MOS (compared to tandem mixing) as shown by simulations using typical conversation models.
机译:VoIP会议具有集中语音混合BridgeIntRoduces额外的端到端延迟进入打包语音通信。本文研究了如何通过代替提取编码的语音参数并在没有时域重建的情况下执行语音分组混合来避免串联语音解码,时域混合,语音编码周期的全部串联语音解码。通过混合通过编码的语音参数,我们表明,在G.722.2的两个扬声器的完全串联混合中可以实现近85%的计算复杂性降低,从而显着降低了集中语音混合桥的分组延迟。对于G.722.2参数混频器呈现,线性预测系数(LPC),音高滞后,固定码本和增益,被从编码比特流中提取(没有完全语音重建),混合,然后重新编码而不是完整的必须完全重建每个语音帧的串联方法。我们研究了两种情况的混合:i)以12.65kbps的混合速率混合两种12.65kbps G.722.2语音流,ii)以18.25kbps的混合速率混合两种12.65kbps G.722.2语音流。 PAMS用于评估参数混频器的语音质量,导致0.37 mOS的平均失真(与串联混合相比),如使用典型的谈话模型所示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号