首页> 外文会议>2011 IEEE International Conference on Acoustics, Speech and Signal Processing >Localization of non-linguistic events in spontaneous speech by Non-Negative Matrix Factorization and Long Short-Term Memory

【24h】

Localization of non-linguistic events in spontaneous speech by Non-Negative Matrix Factorization and Long Short-Term Memory

机译：非负矩阵分解和长短时记忆对自发语音中非语言事件的定位

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Features generated by Non-Negative Matrix Factorization (NMF) have successfully been introduced into robust speech processing, including noise-robust speech recognition and detection of non-linguistic vocalizations. In this study, we introduce a novel tandem approach by integrating likelihood features derived from NMF into Bidirectional Long Short-Term Memory Recurrent Neural Networks (BLSTM-RNNs) in order to dynamically localize non-linguistic events, i. e., laughter, vocal, and non-vocal noise, in highly spontaneous speech. We compare our tandem architecture to a baseline conventional phoneme-HMM-based speech recognizer, and achieve a relative reduction of the frame error rate by 37.5% in the discrimination of speech and different non-speech segments.

机译：非负矩阵分解（NMF）生成的功能已成功引入健壮的语音处理中，包括噪声健壮的语音识别和非语言发声的检测。在这项研究中，我们通过将源自NMF的似然特征集成到双向长期短期记忆递归神经网络（BLSTM-RNN）中，从而动态定位非语言事件，从而引入了一种新颖的串联方法。例如，高度自发的语音中的笑声，人声和非人声噪声。我们将串联架构与基于基线的传统音素-基于HMM的语音识别器进行比较，并在区分语音和不同的非语音段方面实现了37.5％的帧错误率的相对降低。

著录项

来源
《2011 IEEE International Conference on Acoustics, Speech and Signal Processing 》|2011年|p.5840-5843|共4页
会议地点
作者
Weninger Felix; Schuller Bjorn; Wollmer Martin; Rigoll Gerhard;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信理论 ;
关键词
Non-Linguistic Vocalizations; Non-Negative Matrix Factorization; Recurrent Neural Networks;

机译：非语言发声;非负矩阵分解;递归神经网络;

相似文献

外文文献
中文文献
专利

1. Supervised Single Channel Speech Enhancement Based on Stationary Wavelet Transforms and Non-negative Matrix Factorization with Concatenated Framing Process and Subband Smooth Ratio Mask [J] . Journal of signal processing systems for signal, image, and video technology . 2020 ,第4期

机译：基于级联过程和子带平滑率掩码的平稳小波变换和非负矩阵分解的有监督单通道语音增强
2. Supervised single channel dual domains speech enhancement using sparse non-negative matrix factorization [J] . Digital Signal Processing . 2020 ,第期

机译：使用稀疏非负矩阵分解监督单通道双域语音增强
3. Single-Channel Speech Separation Based on Non-negative Matrix Factorization and Factorial Conditional Random Field [J] . Li Xu, Tu Ming, Wang Xiaofei, Chinese Journal of Electronics . 2018 ,第5期

机译：基于非负矩阵分解和阶乘条件随机场的单通道语音分离
4. LOCALIZATION OF NON-LINGUISTIC EVENTS IN SPONTANEOUS SPEECH BY NON-NEGATIVE MATRIX FACTORIZATION AND LONG SHORT-TERM MEMORY [C] . Felix Weninger, Bjorn Schuller, Martin Wollmer, IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：非负矩阵分解和长短期记忆的自发言语中的非语言事件的本地化
5. Group Convex Orthogonal Non-negative Matrix Tri-Factorization with Applications in FC Fingerprinting [D] . ?Li, Kendrick 2020

机译：集团凸正交非负矩阵三分解与 FC 指纹应用
6. Encoding of rat working memory by power of multi-channel local field potentials via sparse non-negative matrix factorization [O] . Xu Liu, Tiao-Tiao Liu, Wen-Wen Bai, 2013

机译：稀疏非负矩阵分解通过多通道局部场电势对大鼠工作记忆进行编码
7. Discrimination of speech and non-linguistic vocalizations by non-negative matrix factorization [O] . Björn Schuller, Felix Weninger 2010

机译：通过非负矩阵分解对语音和非语言发声进行区分

Localization of non-linguistic events in spontaneous speech by Non-Negative Matrix Factorization and Long Short-Term Memory

摘要

著录项

相似文献

相关主题

期刊订阅