Performance Evaluation for Transform Domain Model-based Single-channel Speech Separation

机译：基于转换域模型的单声道语音分离的性能评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is already demonstrated that selected features have a much larger effect to the overall performance in speech applications accuracy than the selected generative models have. In this paper, we propose subband perceptually weighted transformation (SPWT) applied on magnitude spectrum to improve the performance of single-channel separation scenario (SCSS). In particular, we compare three feature types namely, log-spectrum, magnitude spectrum and the proposed SPWT. A comprehensive statistical analysis is performed to evaluate the performance of a VQ-based SCSS framework in terms of the lower error bound. At the core of this approach are two trained codebooks of the quantized feature vectors of speakers, whereby the main evaluation for separation is performed. The simulation results show that the proposed transformation offers an attractive candidate to improve the separation performance of model-based SCSS. It is also observed that the proposed feature can result in a lower-error bound in terms of the spectral distortion (SD) as well as higher SSNR in comparison with other features.

机译：已经证明，所选功能对语音应用精度的整体性能具有比所选择的生成模型的总体性能更大。在本文中，我们提出了在幅度谱上应用的子带感知加权转换（SPWT），以提高单通道分离场景（SCSS）的性能。特别是，我们比较三种特征类型，即log-spectrum，幅度谱和所提出的spwt。执行全面的统计分析，以评估基于VQ的SCSS框架的性能，从而缩小的误差。在这种方法的核心，是扬声器的量化特征向量的两个训练有素的码本，从而执行分离的主要评估。仿真结果表明，该改造提供了一种有吸引力的候选者，可以提高基于模型的SCS的分离性能。还观察到，所提出的特征可以在与其他特征相比，在光谱失真（SD）以及更高的SSNR方面产生较低的误差。

著录项

来源
《IEEE/ACM International Conference on Computer Systems and Applications》|2009年||共8页
会议地点
作者
Pejman Mowlaee; Abolghasem Sayadiyan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Single-channel speech separation; Magnitude spectrum; Transform domain; Vector Quantization; Spectral Distortion;

机译：单通道语音分离;幅度谱;转换域;矢量量化;频谱失真;

相似文献

外文文献
中文文献
专利

1. Evaluating single-channel speech separation performance in transform-domain [J] . Pejman?Mowlaee, Abolghasem?Sayadiyan, Hamid?Sheikhzadeh Journal of Zhejiang university science . 2010,第3期

机译：评估变换域中的单通道语音分离性能
2. Evaluating single-channel speech separation performance in transform-domain [J] . Pejman MOWLAEE, Abolghasem SAYADIYAN, Hamid SHEIKHZADEH 浙江大学学报（英文版）（C辑：计算机与电子） . 2010,第003期

机译：评估变换域中的单通道语音分离性能
3. More general performance evaluation for single-channel PCMA signals blind separation [J] . Chaowei Duan, Yafeng Zhan, Hao Liang Communications, IET . 2017,第15期

机译：更多<？show [AQ ID = Q1]？>单通道PCMA信号盲分离的常规性能评估
4. Performance evaluation for transform domain model-based single-channel speech separation [C] . Mowlaee P., Sayadiyan A. IEEE/ACS International Conference on Computer Systems and Applications (AICCSA 2009) . 2009

机译：基于变换域模型的单通道语音分离的性能评估
5. Transform Domain Model-Based Wideband Speech Enhancement with Hearing Aid Applications. [D] . Laska, Brady N. M. 2010

机译：助听器应用基于变换域模型的宽带语音增强。
6. Impact of phase estimation on single-channel speech separation based on time-frequency masking [O] . Florian Mayer, Donald S. Williamson, Pejman Mowlaee, -1

机译：基于时频掩蔽的相位估计对单通道语音分离的影响
7. A MAP Criterion for Detecting the Number of Speakers at frame level in Model-based Single-Channel Speech Separation [O] . Mowlaee, Pejman, Christensen, Mads Græsbøll, Tan, Zheng-Hua, 2010

机译：基于模型的单通道语音分离中检测帧级扬声器数量的map准则
8. Frequency Domain Speech Compression Using the Karhunen-Loeve Transform [R] . Dryley, D. W. 1993

机译：使用Karhunen-Loeve变换的频域语音压缩

Performance Evaluation for Transform Domain Model-based Single-channel Speech Separation

摘要

著录项

相似文献

相关主题

期刊订阅