基于非负矩阵分解的语音深层低维特征提取方法

秦楚雄; 张连海

首页> 中文期刊>数据采集与处理 >基于非负矩阵分解的语音深层低维特征提取方法

基于非负矩阵分解的语音深层低维特征提取方法

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

作为一种基于深层神经网络提取的低维特征,瓶颈特征在连续语音识别中取得了很大的成功.然而训练瓶颈结构的深层神经网络时,瓶颈层的存在会降低网络输出层的帧准确率,进而反过来影响该特征的性能.针对这一问题,本文基于非负矩阵分解算法,提出一种利用不包含瓶颈层的深层神经网络提取低维特征的方法.该方法利用半非负矩阵分解和凸非负矩阵分解算法对隐含层权值矩阵分解得到基矩阵,将其作为新的特征层权值矩阵,然后在该层不设置偏移向量的情况下,通过数据前向传播提取新型特征.实验表明,该特征具有较为稳定的规律,且适用于不同的识别任务和网络结构.当使用训练数据充足的语料进行实验时,该特征表现出同瓶颈特征几乎相同的识别性能;而在低资源环境下,基于该特征识别系统的识别率明显优于深层神经网络混合识别系统和瓶颈特征识别系统.%As a type of deep neural network (DNN) based low-dimensional feature,bottleneck feature (BNF) has achieved great success in continuous speech recognition.However,the existing of bottleneck layer reduces the frame accuracy of output layer when training a bottleneck deep neural network (BNDNN),which in return has a bad impact on the performance of bottleneck feature.To solve this problem,a nonnegative matrix factorization based low-dimensional feature extraction approach using DNN without bottleneck layer is proposed in this paper.Specifically,semi-nonnegative matrix factorization and convex-nonnegative matrix factorization algorithms are applied to hidden-layer weights matrix to obtain a basis matrix as the new feature-layer weights matrix,and a new type of feature is extracted by forward passing input data without setting a bias vector in the new feature-layer.Experiments show that the feature has a relatively stable pattern around different tasks and network structures.For corpus with enough training data,the proposed features have almost the same recognition performance with conventional bottleneck feature.Under low-resource environment,the recognition accuracy of the new feature-tandem system outperforms both DNN hybrid system and bottleneck-tandem system obviously.

著录项

来源
《数据采集与处理》|2017年第5期|921-930|共10页
作者
秦楚雄; 张连海;
展开▼
作者单位

解放军信息工程大学信息系统工程学院,郑州,450001;

解放军信息工程大学信息系统工程学院,郑州,450001;

展开▼
原文格式 PDF
正文语种 chi
中图分类语音识别与设备;
关键词
连续语音识别; 深层神经网络; 半非负矩阵分解; 凸非负矩阵分解; 低维特征;
入库时间 2023-07-25 17:42:04

相似文献

中文文献
外文文献
专利

1. 面向语音分离的深层转导式非负矩阵分解并行算法 [J] . 李雨蓉 ,刘杰 ,刘亚林 . 计算机科学 . 2020,第008期
2. 基于非负矩阵分解的语音增强方法综述 [J] . 鲍长春 ,白志刚 . 信号处理 . 2020,第006期
3. 基于非负矩阵分解和长短时记忆网络的单通道语音分离 [J] . 崔建峰 ,邓泽平 ,申飞 . 科学技术与工程 . 2019,第012期
4. 基于感知掩蔽的重构非负矩阵分解单通道语音增强算法 [J] . 李艳生 ,刘园 ,张毅 . 计算机应用 . 2019,第003期
5. 基于非负矩阵分解的情感语音基频转换研究 [J] . 邓叶勋 ,赵晖 . 计算机工程 . 2018,第005期
6. 基于非负矩阵分解和向量相似测度的语音与音乐分离算法 [C] . 郑能恒 ,蔡毅 ,李霞 . 第十一届全国人机语音通讯学术会议 . 2011
7. 面向说话人识别的语音活动检测与低维向量提取方法研究 [A] . 张扬帆 . 2020

基于非负矩阵分解的语音深层低维特征提取方法

摘要

著录项

相似文献

相关主题

期刊订阅