EXTRACTING DOMAIN INVARIANT FEATURES BY UNSUPERVISED LEARNING FOR ROBUST AUTOMATIC SPEECH RECOGNITION

机译：通过无监督学习提取域不变特征，用于强大的自动语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The performance of automatic speech recognition (ASR) systems can be significantly compromised by previously unseen conditions, which is typically due to a mismatch between training and testing distributions. In this paper, we address robustness by studying domain invariant features, such that domain information becomes transparent to ASR systems, resolving the mismatch problem. Specifically, we investigate a recent model, called the Factorized Hierarchical Variational Autoencoder (FHVAE). FHVAEs learn to factorize sequence-level and segment-level attributes into different latent variables without supervision. We argue that the set of latent variables that contain segment-level information is our desired domain invariant feature for ASR. Experiments are conducted on Aurora-4 and CHiME-4, which demonstrate 41% and 27% absolute word error rate reductions respectively on mismatched domains.

机译：自动语音识别（ASR）系统的性能可以通过先前看不见的条件显着地损害，这通常是由于训练和测试分布之间的不匹配。在本文中，我们通过研究域不变特征来解决鲁棒性，使得域信息对ASR系统变得透明，解决了不匹配问题。具体而言，我们研究了最近的模型，称为分解分层变形Autiachoder（FHVAE）。 FHVAES学会在没有监控的情况下将序列级别和分段级别属性分解为不同的潜在变量。我们争辩说，包含段级信息的潜在变量是我们的ASR所需的域不变功能。实验在极光-4和Chime-4上进行，分别在错配域中展示了41％和27％的绝对字错误率减少。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2018年|5089-5738p|共5页
会议地点
作者
Wei-Ning Hsu; James Glass;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
robust speech recognition; factorized hierarchical variational autoencoder; domain invariant representations;

机译：强大的语音识别;分解分层变化自身拓扑;域不变表示;

相似文献

外文文献
中文文献
专利

1. Bio-inspired unsupervised learning of visual features leads to robust invariant object recognition [J] . Kheradpisheh Saeed Reza, Ganjtabesh Mohammad, Masquelier Timothee Neurocomputing . 2016,第sepa12期

机译：受生物启发的视觉特征无监督学习导致强大的不变对象识别
2. Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition [J] . Shimada Kazuki, Bando Yoshiaki, Mimura Masato, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第5期

机译：基于多通道NMF信息波束形成的无监督语音增强技术，用于强噪声自动语音识别
3. Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition [J] . Shimada Kazuki, Bando Yoshiaki, Mimura Masato, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第5期

机译：基于多通道NMF的噪声强度自动语音识别的无监督语音增强
4. EXTRACTING DOMAIN INVARIANT FEATURES BY UNSUPERVISED LEARNING FOR ROBUST AUTOMATIC SPEECH RECOGNITION [C] . Wei-Ning Hsu, James Glass IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：通过无监督学习提取域不变特征，用于强大的自动语音识别
5. Towards Robust and Domain Invariant Feature Representations in Deep Learning [D] . Sankaranarayanan, Swaminathan. 2018

机译：走向深度学习中的稳健和领域不变特征表示
6. Chemical named entity recognition in patents by domain knowledge and unsupervised feature learning [O] . Yaoyun Zhang, Jun Xu, Hui Chen, 2016

机译：通过领域知识和无监督特征学习来识别专利中的化学命名实体
7. Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition [O] . Wei-Ning Hsu, James Glass 2018

机译：通过无监督学习提取域不变特征，用于强大的自动语音识别

EXTRACTING DOMAIN INVARIANT FEATURES BY UNSUPERVISED LEARNING FOR ROBUST AUTOMATIC SPEECH RECOGNITION

摘要

著录项

相似文献

相关主题

期刊订阅