End-to-End Spoken Language Understanding Using Transformer Networks and Self-Supervised Pre-Trained Features

机译：使用变压器网络和自我监督的预训练功能的端到端口语理解

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Transformer networks and self-supervised pre-training have consistently delivered state-of-art results in the field of natural language processing (NLP); however, their merits in the field of spoken language understanding (SLU) still need further investigation. In this paper we introduce a modular End-to-End (E2E) SLU transformer network based architecture which allows the use of self-supervised pre- trained acoustic features, pre-trained model initialization and multi-task training. Several SLU experiments for predicting intent and entity labels/values using the ATIS dataset are performed. These experiments investigate the interaction of pre-trained model initialization and multi-task training with either traditional filterbank or self-supervised pre-trained acoustic features. Results show not only that self-supervised pre-trained acoustic features outperform filterbank features in almost all the experiments, but also that when these features are used in combination with multi-task training, they almost eliminate the necessity of pre-trained model initialization.

机译：变压器网络和自我监督的预培训在自然语言处理领域（NLP）始终提供最先进的结果;但是，他们在语言理解领域的优点（SLU）仍需要进一步调查。在本文中，我们介绍了一种模块化端到端（E2E）SLU变压器网络的架构，允许使用自我监督的预训练的声学功能，预先训练的模型初始化和多任务培训。执行用于预测使用ATI数据集的意图和实体标签/值的几个SLU实验。这些实验研究了预先训练的模型初始化和多任务培训与传统滤波器或自我监督的预训练的声学特征的相互作用。结果表明，不仅是自我监督的预训练的声学特征几乎所有实验中的溢出扫描器功能差错，而且当这些功能与多任务培训结合使用时，它们几乎消除了预先训练的模型初始化的必要性。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|7483-7487|共5页
会议地点
作者
Edmilson Morais; Hong-Kwang J. Kuo; Samuel Thomas; Zoltán Tüske; Brian Kingsbury;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Conferences; Filter banks; Signal processing; Acoustics; Natural language processing; Speech processing;

机译：培训;会议;过滤器银行;信号处理;声学;自然语言处理;语音处理;

相似文献

外文文献
中文文献
专利

1. Low resource end-to-end spoken language understanding with capsule networks [J] . Jakob Poncelet, Vincent Renkens, Hugo Van hamme Computer speech and language . 2021,第Mara期

机译：使用胶囊网络的低资源端到端口语语言理解
2. Retrieving Dialogue History in Deep Neural Networks for Spoken Language Understanding [J] . Myoung-Wan Koo, Guanghao Xu, Hyunjung Lee, Advances in Science, Technology and Engineering Systems . 2017,第3期

机译：检索深度神经网络中的对话历史以了解口语
3. Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding [J] . Mesnil Gregoire, Dauphin Yann, Yao Kaisheng, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2015,第3期

机译：使用递归神经网络进行口语填写以理解语言
4. Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining [C] . Cheng-I Lai, Yung-Sung Chuang, Hung-Yi Lee, IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：通过自我监督的语言模型预先润滑的半监督口语理解
5. Incorporating Task-Specific Features into End-to-End Models to Extract Argument Structures from Natural Language Corpora [D] . Xue, Linting. 2020

机译：将特定于任务特定功能的结尾模型结合起来，以从自然语言语言中提取参数结构
6. Design and Implementation of Fast Spoken Foul Language Recognition with Different End-to-End Deep Neural Network Architectures [O] . Abdulaziz Saleh Ba Wazir, Hezerul Abdul Karim, Mohd Haris Lye Abdullah, 2021

机译：不同端到端深神经网络架构的快速口语臭语识别的设计与实现
7. End-to-End Neural Transformer Based Spoken Language Understanding [O] . Martin Radfar, Athanasios Mouchtaris, Siegfried Kunzmann 2020

机译：基于端到端的神经变压器的口语理解
8. Real-Time Spoken-Language System for Interactive Problem-Solving, Combining Linguistic and Statistical Technology for Improved Spoken Language Understanding. [R] . Moore, R. C., Cohen, M. H. 1993

机译：交互式问题解决的实时语言系统，结合语言和统计技术提高口语理解能力。

End-to-End Spoken Language Understanding Using Transformer Networks and Self-Supervised Pre-Trained Features

摘要

著录项

相似文献

相关主题

期刊订阅