首页> 外文会议>International Congress on Acoustics >Restoring Lost Speech Components with Generative Adversarial Networks for Speech Communications in Adverse Conditions
【24h】

Restoring Lost Speech Components with Generative Adversarial Networks for Speech Communications in Adverse Conditions

机译:恢复失去的语音组件,具有生成的对抗性网络,用于在不利条件下进行语音通信

获取原文

摘要

Speech enhancement has been widely implemented to restore quality of speech in communications between humans or between human and machine. For different speech communication scenarios with specific channel and environmental conditions, the types and degrees of speech distortion could vary significantly and many speech enhancement strategies have been developed accordingly. This study deal with a severe distortion problem, i.e., part of the spectral and/or temporal components of the speech are lost completely. The spectral loss is simulated by a transmission channel with very narrow passing bandwidth (lower than 2 kHz) which results in severely degraded speech quality; the temporal loss is simulated by packet loss up to 50% percent in massive communication which results in poor speech intelligibility. A generative adversarial networks (GAN) based speech enhancement scheme is proposed for restoring the missing spectral and temporal components with different network structure and parameters. A set of experiments have been conducted to evaluate the effectiveness of proposed enhancement scheme and promising results have achieved.
机译:致辞增强已被广泛实施,以恢复人类或人工和机器之间的通信中的语音质量。对于具有特定渠道和环境条件的不同语音通信场景,语音失真的类型和程度可以显着变化,并且相应地开发了许多语音增强策略。这项研究处理了严重的扭曲问题,即语音的频谱和/或时间分量的一部分完全丢失。通过具有非常窄的通过带宽(低于2kHz)的传输通道模拟光谱损耗,从而导致严重降级的语音质量;在大规模通信中的数据包损耗模拟了时间损耗,这导致语音清晰度差。提出了一种基于生成的对抗网络(GAN)的语音增强方案,用于恢复具有不同网络结构和参数的缺失的频谱和时间分量。已经进行了一组实验以评估所提升的增强方案的有效性,并实现了有希望的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号