Speech Enhancement Based on Discrete Wavelet Packet Transform and Itakura-Saito Nonnegative Matrix Factorisation

Houguang LIU; Wenbo WANG; Lin XUE; Jianhua YANG; Zhihua WANG; Chunli HUA

首页> 外文期刊>Archives of acoustics >Speech Enhancement Based on Discrete Wavelet Packet Transform and Itakura-Saito Nonnegative Matrix Factorisation

【24h】

Speech Enhancement Based on Discrete Wavelet Packet Transform and Itakura-Saito Nonnegative Matrix Factorisation

机译：基于离散小波包变换和Itakura-Saito非负矩阵分子的语音增强

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nonnegative matrix factorization (NMF) is one of the most popular machine learning tools for speech enhancement (SE). However, there are two problems reducing the performance of the traditional NMF-based SE algorithms. One is related to the overlap-and-add operation used in the short time Fourier transform (STFT) based signal reconstruction, and the other is the Euclidean distance used commonly as an objective function; these methods can cause distortion in the SE process. In order to get over these shortcomings, we propose a novel SE joint framework which combines the discrete wavelet packet transform (DWPT) and the Itakura-Saito nonnegative matrix factorisation (ISNMF). In this approach, the speech signal was first split into a series of subband signals using the DWPT. Then, the ISNMF was used to enhance the speech for each subband signal. Finally, the inverse DWPT (IDWT) was utilised to reconstruct these enhanced speech subband signals. The experimental results show that the proposed joint framework effectively enhances the performance of speech enhancement and performs better in the unseen noise case compared to the traditional NMF methods.

机译：非负矩阵分解（NMF）是用于语音增强（SE）最受欢迎的机器学习工具之一。然而，有两个问题降低了传统的基于NMF的SE算法的性能。一种与基于短时间傅里叶变换（STFT）的信号重建中使用的重叠和添加操作有关，另一个是欧几里德距离通常用作目标函数;这些方法可能导致SE过程中的失真。为了克服这些缺点，我们提出了一种新的SE联合框架，该联合框架结合了离散小波分组变换（DWPT）和Itakura-Saito非负矩阵分子（ISNMF）。在这种方法中，使用DWPT首先将语音信号分成一系列子带信号。然后，使用ISNMF来增强每个子带信号的语音。最后，利用逆dwpt（IDWT）来重建这些增强的语音子带信号。实验结果表明，与传统的NMF方法相比，所提出的联合框架有效地提高了语音增强的性能，在看不见的噪声箱中表现得更好。

著录项

来源
《Archives of acoustics》 |2020年第4期|565-572|共8页
作者
Houguang LIU; Wenbo WANG; Lin XUE; Jianhua YANG; Zhihua WANG; Chunli HUA;
展开▼
作者单位

School of Mechatronic Engineering China University of Mining and Technology Xuzhou 221116 China;

School of Mechatronic Engineering China University of Mining and Technology Xuzhou 221116 China;

School of Mechatronic Engineering China University of Mining and Technology Xuzhou 221116 China;

School of Mechatronic Engineering China University of Mining and Technology Xuzhou 221116 China;

School of Mechatronic Engineering China University of Mining and Technology Xuzhou 221116 China;

School of Mechatronic Engineering China University of Mining and Technology Xuzhou 221116 China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
speech enhancement; discrete wavelet packet transform; nonnegative matrix factorisation; Itakura-Saito divergence;

机译：语音增强;离散小波包变换;非负矩阵分子;Itakura-Saito发散;

相似文献

外文文献
中文文献
专利

1. Supervised Single Channel Speech Enhancement Based on Dual-Tree Complex Wavelet Transforms and Nonnegative Matrix Factorization Using the Joint Learning Process and Subband Smooth Ratio Mask [J] . Md Shohidul Islam, Tarek Hasan Al Mahmud, Wasim Ullah Khan, Electronics . 2019,第3期

机译：基于双树复小波变换和非负矩阵分解的联合学习过程和子带平滑率掩码的监督单通道语音增强
2. Robust audio hashing based on discrete-wavelet-transform and non-negative matrix factorisation [J] . Chen N., Wan W., Xiao H.-D. Communications, IET . 2010,第14期

机译：基于离散小波变换和非负矩阵分解的鲁棒音频哈希
3. Comparison of Citrus Fruit Surface Defect Classification using Discrete Wavelet Transform, Stationary Wavelet Transform and Wavelet Packet Transform Based Features [J] . K. Vijayarekha Research journal of applied science, engineering and technology . 2012,第24期

机译：基于离散小波变换，平稳小波变换和小波包变换的柑橘果实表面缺陷分类比较
4. Ultrasound-coupled semi-supervised nonnegative matrix factorisation for speech enhancement [C] . Barker Tom, Virtanen Tuomas, Delhomme Olivier IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：超声耦合半监督非负矩阵分解以增强语音
5. Improvement of Customer Baseline Calculation Methodologies of Demand Response Using Maximal Overlap Discrete Wavelet Packet Transform [D] . Opadere, Johnson. 2020

机译：使用最大重叠离散小波分组变换改进需求响应的客户基准计算方法
6. Application of 1-D discrete wavelet transform based compressed sensing matrices for speech compression [O] . Yuvraj V. Parkale, Sanjay L. Nalbalwar -1

机译：基于一维离散小波变换的压缩感知矩阵在语音压缩中的应用
7. A Lifting-Based Discrete Wavelet Transform and Discrete Wavelet Packet Processor with Support for Higher Order Wavelet Filters [O] . Andre Guntoro, Manfred Glesner 2015

机译：基于提升的离散小波变换和离散小波包处理器支持高阶小波滤波器

Speech Enhancement Based on Discrete Wavelet Packet Transform and Itakura-Saito Nonnegative Matrix Factorisation

摘要

著录项

相似文献

相关主题

期刊订阅