首页> 外文学位 >Short-time Independent Component Analysis for blind separation of speech sources.
【24h】

Short-time Independent Component Analysis for blind separation of speech sources.

机译:短时独立分量分析,用于语音源的盲分离。

获取原文
获取原文并翻译 | 示例

摘要

Independent Component Analysis (ICA) has long been regarded as a powerful technique for speech source separation. In practice, however, speaker moving or reverberant environments may necessitate ICA to be implemented in short time intervals, which makes the fundamental assumption of sources' independence collapse in ICA. This leads to two important but often overlooked problems, namely: (1) excursion of global optimum from the desired solution and (2) diffusion of local optima in search of the de-mixing matrix. These two problems occur in most practical situations and greatly degrade the performance of the existing ICA algorithms.; Based on the insight into the effect on the aforementioned problems by the input sources as well as the mixing channel, three basic short time Local Optima Distribution (LOD) types are investigated. Information is derived from the characteristics of these LOD types for: (1) choosing simultaneous or sequential ICA algorithm; (2) shrinking feasible search region; and (3) producing possible initial points in search of the de-mixing matrix. As a result, the technique of LOD-based ICA is developed in this thesis to assign different procedures according to the LOD type of the observed short time mixtures. The analytical and simulation results demonstrated that more accurate de-mixing matrix estimation could be obtained; thereby producing improved separation performance.; Among all the three LOD types, the Dominant LOD manifests to be with comparatively higher efficiency in yielding accurate separation performance. The production mechanism of the Dominant LOD indicates that higher energy ratio of sources helps to build this type of LOD. Considering the sparse energy distribution of speech signals in the time-frequency domain, the Dominant LOD may arise in some short time subbands even though it appears to be Non-dominant LOD in its fullband. Therefore the proposed LOD-based ICA is extended to the frequency subbands for more opportunities to attain such Dominant LOD type.; The effectiveness of the proposed short time LOD-based ICA is validated by applying it to a speaker-moving model and a mixing system with abrupt changes, which approaches the practical applications better since the mixing system is not always constant as in standard ICA model. We have also explored the separation task with noise-contaminated speech signals. This suggests us that: other than the long time analysis, the short time analysis may provide an alternative means with extra information for separation when the independence information is impaired and subsequently fails to yield the desirable separation performance.
机译:长期以来,独立成分分析(ICA)一直被认为是语音源分离的强大技术。然而,实际上,说话者移动或混响环境可能需要在较短的时间间隔内实施ICA,这使ICA中源独立性的基本假设崩溃了。这导致两个重要但经常被忽视的问题,即:(1)从所需解中求出全局最优值,以及(2)寻找解混合矩阵时局部最优值的扩散。这两个问题在大多数实际情况下都会发生,大大降低了现有ICA算法的性能。基于对输入源以及混合通道对上述问题的影响的了解,研究了三种基本的短时局部最优分布(LOD)类型。信息是从以下LOD类型的特征中得出的:(1)选择同时或顺序ICA算法; (2)缩小可行搜索区域; (3)产生可能的初始点以搜索去混合矩阵。因此,本文开发了基于LOD的ICA技术,根据观察到的短时混合物的LOD类型分配不同的程序。分析和仿真结果表明,可以获得更准确的解混矩阵估计。从而产生改进的分离性能。在这三种LOD类型中,优势LOD在产生精确的分离性能方面表现出相对较高的效率。优势LOD的产生机理表明,较高的能量能量比有助于构建这种类型的LOD。考虑到语音信号在时频域中的稀疏能量分布,即使在其全频带中似乎是非主导LOD,也可能在一些短时间子带中出现主导LOD。因此,建议的基于LOD的ICA扩展到频率子带,以获得更多机会获得这种主导LOD类型。提议的基于LOD的短时ICA的有效性已通过将其应用于扬声器移动模型和突然变化的混音系统进行了验证,由于混音系统并不总是像标准ICA模型中那样恒定,因此可以更好地接近实际应用。我们还探讨了受噪声污染的语音信号的分离任务。这建议我们:除了长时间分析之外,短时分析可能​​会在独立性信息受损并且随后无法产生理想的分离性能时,为分离提供额外的信息。

著录项

  • 作者

    Zhang, Jing.;

  • 作者单位

    The Chinese University of Hong Kong (Hong Kong).;

  • 授予单位 The Chinese University of Hong Kong (Hong Kong).;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 151 p.
  • 总页数 151
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号