DO as I Mean, Not as I Say: Sequence Loss Training for Spoken Language Understanding

机译：我的意思是，不是我所说的：用于口语语言的序列损失培训

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Spoken language understanding (SLU) systems extract transcriptions, as well as semantics of intent or named entities from speech, and are essential components of voice activated systems. SLU models, which either directly extract semantics from audio or are composed of pipelined automatic speech recognition (ASR) and natural language understanding (NLU) models, are typically trained via differentiable cross-entropy losses, even when the relevant performance metrics of interest are word or semantic error rates. In this work, we propose non-differentiable sequence losses based on SLU metrics as a proxy for semantic error and use the REINFORCE trick to train ASR and SLU models with this loss. We show that custom sequence loss training is the state-of-the-art on open SLU datasets and leads to 6% relative improvement in both ASR and NLU performance metrics on large proprietary datasets. We also demonstrate how the semantic sequence loss training paradigm can be used to update ASR and SLU models without transcripts, using semantic feedback alone.

机译：口语语言理解（SLU）系统提取转录，以及来自语音的意图或命名实体的语义，是语音激活系统的基本组件。 SLU模型，其直接从音频提取语义或由流水线自动语音识别（ASR）和自然语言理解（NLU）模型，通常通过可差的跨熵损失培训，即使感兴趣的相关性能指标是字或语义错误率。在这项工作中，我们提出了基于SLU指标的非微弱序列损失作为语义误差的代理，并使用这种损失训练ASR和SLU模型的增强技巧。我们表明，定制序列损失培训是开放式SLU数据集的最先进，并在大专有数据集中导致ASR和NLU性能指标的相对改进。我们还展示了语义序列丢失训练范式如何使用单独使用语义反馈来更新无需成绩单的ASR和SLU模型。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|7473-7477|共5页
会议地点
作者
Milind Rao; Pranav Dheram; Gautam Tiwari; Anirudh Raju; Jasha Droppo; Ariya Rastrow; Andreas Stolcke;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Measurement; Training; Error analysis; Conferences; Semantics; Natural languages; Signal processing;

机译：测量;训练;错误分析;会议;语义;天然语言;信号处理;

相似文献

外文文献
中文文献
专利

1. Extending the Classifier Algorithms in Machine Learning to Improve the Performance in Spoken Language Understanding Systems Under Deficient Training Data [J] . Sheetal Jagdale, Milind Shah Advances in Science, Technology and Engineering Systems . 2020,第6期

机译：在机器学习中扩展分类器算法，以提高在缺乏训练数据下的口语理解系统中的性能
2. Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages [J] . Hahn S., Dinarelli M., Raymond C., Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第6期

机译：比较使用多种语言进行口语理解的随机方法
3. Multi-Level Cross-Lingual Transfer Learning With Language Shared and Specific Knowledge for Spoken Language Understanding [J] . He Keqing, Xu Weiran, Yan Yuanmeng Quality Control, Transactions . 2020,第期

机译：具有语言共享的多层次交叉传输学习和语言理解的特定知识
4. An Easy and Efficient Grammar Generator for Understanding Spoken Languages: A Novel approach to develop a Spoken Language Understanding Grammar for Inflective Languages [C] . Salvatore Michele Biondi, Vincenzo Catania, Ylenia Cilano, International Conference on Creative Content Technologies . 2014

机译：一种简单高效的语法发生器，用于了解语言的语言：一种开发语言理解语法的新方法，了解了对语言的语言
5. Speech to Text to Semantics: A Sequence-to-sequence System for Spoken Language Understanding [D] . Dodson, John. 2020

机译：发表文本到语义：用于口语语言理解的序列到序列系统
6. The effects of sign language on spoken language acquisition in children with hearing loss: a systematic review protocol [O] . Elizabeth M Fitzpatrick, Adrienne Stevens, Chantelle Garritty, 2013

机译：手语对听力障碍儿童口语习得的影响：系统的评价方案
7. Achieving understanding via interpreter participation inudSign Language / English Map Task dialogues: an analysis of repair sequences involving ambiguity and underspecificity in signed and spoken modesud [O] . Crawley Victoria Louise 2016

机译：通过口译员参与 ud获得理解手语/英语“地图任务”对话：以手语和口语模式对涉及歧义和不合规格的修复顺序进行分析 ud
8. Real-Time Spoken-Language System for Interactive Problem-Solving, Combining Linguistic and Statistical Technology for Improved Spoken Language Understanding. [R] . Moore, R. C., Cohen, M. H. 1993

机译：交互式问题解决的实时语言系统，结合语言和统计技术提高口语理解能力。

DO as I Mean, Not as I Say: Sequence Loss Training for Spoken Language Understanding

摘要

著录项

相似文献

相关主题

期刊订阅