首页> 外文会议>IEEE/ACM International Conference on Computer Systems and Applications >Performance Evaluation for Transform Domain Model-based Single-channel Speech Separation
【24h】

Performance Evaluation for Transform Domain Model-based Single-channel Speech Separation

机译:基于转换域模型的单声道语音分离的性能评估

获取原文

摘要

It is already demonstrated that selected features have a much larger effect to the overall performance in speech applications accuracy than the selected generative models have. In this paper, we propose subband perceptually weighted transformation (SPWT) applied on magnitude spectrum to improve the performance of single-channel separation scenario (SCSS). In particular, we compare three feature types namely, log-spectrum, magnitude spectrum and the proposed SPWT. A comprehensive statistical analysis is performed to evaluate the performance of a VQ-based SCSS framework in terms of the lower error bound. At the core of this approach are two trained codebooks of the quantized feature vectors of speakers, whereby the main evaluation for separation is performed. The simulation results show that the proposed transformation offers an attractive candidate to improve the separation performance of model-based SCSS. It is also observed that the proposed feature can result in a lower-error bound in terms of the spectral distortion (SD) as well as higher SSNR in comparison with other features.
机译:已经证明,所选功能对语音应用精度的整体性能具有比所选择的生成模型的总体性能更大。在本文中,我们提出了在幅度谱上应用的子带感知加权转换(SPWT),以提高单通道分离场景(SCSS)的性能。特别是,我们比较三种特征类型,即log-spectrum,幅度谱和所提出的spwt。执行全面的统计分析,以评估基于VQ的SCSS框架的性能,从而缩小的误差。在这种方法的核心,是扬声器的量化特征向量的两个训练有素的码本,从而执行分离的主要评估。仿真结果表明,该改造提供了一种有吸引力的候选者,可以提高基于模型的SCS的分离性能。还观察到,所提出的特征可以在与其他特征相比,在光谱失真(SD)以及更高的SSNR方面产生较低的误差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号