首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Boosted Locality Sensitive Hashing: Discriminative Binary Codes for Source Separation
【24h】

Boosted Locality Sensitive Hashing: Discriminative Binary Codes for Source Separation

机译:增强的本地敏感哈希:用于源分离的区分性二进制代码

获取原文

摘要

Speech enhancement tasks have seen significant improvements with the advance of deep learning technology, but with the cost of increased computational complexity. In this study, we propose an adaptive boosting approach to learning locality sensitive hash codes, which represent audio spectra efficiently. We use the learned hash codes for single-channel speech denoising tasks as an alternative to a complex machine learning model, particularly to address the resource-constrained environments. Our adaptive boosting algorithm learns simple logistic regressors as the weak learners. Once trained, their binary classification results transform each spectrum of test noisy speech into a bit string. Simple bitwise operations calculate Hamming distance to find the $mathcal{K}$-nearest matching frames in the dictionary of training noisy speech spectra, whose associated ideal binary masks are averaged to estimate the denoising mask for that test mixture. Our proposed learning algorithm differs from AdaBoost in the sense that the projections are trained to minimize the distances between the self-similarity matrix of the hash codes and that of the original spectra, rather than the misclassification rate. We evaluate our discriminative hash codes on the TIMIT corpus with various noise types, and show comparative performance to deep learning methods in terms of denoising performance and complexity.
机译:语音增强任务已经随着深度学习技术的进步而得到了显着改善,但代价是计算复杂性增加。在这项研究中,我们提出了一种自适应的增强方法来学习局部敏感的哈希码,该哈希码可以有效地表示音频频谱。我们将学习到的哈希码用于单通道语音降噪任务,以替代复杂的机器学习模型,尤其是解决资源受限的环境。我们的自适应提升算法将简单的逻辑回归器作为弱学习者进行学习。经过训练后,它们的二进制分类结果会将测试噪声语音的每个频谱转换为一个位串。简单的按位运算会计算汉明距离,以在训练有声语音频谱字典中找到$ \ mathcal {K} $最近的匹配帧,将其相关的理想二进制掩码求平均以估计该测试混合物的降噪掩码。我们提出的学习算法与AdaBoost的区别在于,对投影进行训练以最大程度地减小哈希码的自相似矩阵与原始光谱的自相似矩阵之间的距离,而不是错误分类率。我们在TIMIT语料库上评估具有各种噪声类型的区分性哈希码,并在降低性能和复杂度方面显示与深度学习方法相比的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号