What is best for spoken language understanding: small but task-dependant embeddings or huge but out-of-domain embeddings?

机译：什么是最好的口语理解：小但任务依赖的嵌入或巨大但域外嵌入式？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Word embeddings are shown to be a great asset for several Natural Language and Speech Processing tasks. While they are already evaluated on various NLP tasks, their evaluation on spoken or natural language understanding (SLU) is less studied. The goal of this study is two-fold: firstly, it focuses on semantic evaluation of common word embeddings approaches for SLU task; secondly, it investigates the use of two different data sets to train the embeddings: small and task-dependent corpus or huge and out-of-domain corpus. Experiments are carried out on 5 benchmark corpora (ATIS, SNIPS, SNIPS70, M2M, MEDIA), on which a relevance ranking was proposed in the literature. Interestingly, the performance of the embeddings is independent of the difficulty of the corpora. Moreover, the embeddings trained on huge and out-of-domain corpus yields to better results than the ones trained on small and task-dependent corpus.

机译：Word Embeddings被证明是几种自然语言和语音处理任务的伟大资产。虽然它们已经评估了各种NLP任务，但他们对口语或自然语言理解（SLU）的评估较少。本研究的目标是两倍：首先，它侧重于对SLU任务的共同词嵌入方法的语义评估; 其次，它调查了两个不同的数据集来训练嵌入式：小型和任务依赖性语料库或巨大和域名语料库。实验是在5个基准（ATIS，Snips，Snips70，M2M，Media）上进行的实验，其中在文献中提出了相关性排名。有趣的是，嵌入式的表现与Group的难度无关。此外，胚胎训练在巨大的域外语料库中，从培训的患者培训的巨大突出的语料库培训，而不是在小型和任务依赖性语料库上培训的结果。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2020年|p8064-8682|共5页
会议地点
作者
Sahar Ghannay; Antoine Neuraz; Sophie Rosset;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
spoken language understanding; word embeddings;

机译：口语语言理解;Word Embeddings;

相似文献

外文文献
中文文献
专利

1. Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems [J] . Ryu Seonghan, Kim Seokhwan, Choi Junhwi, Pattern recognition letters . 2017,第Mara1期

机译：在对话系统中仅使用域内语句进行神经语句嵌入以进行域外语句检测
2. Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages [J] . Hahn S., Dinarelli M., Raymond C., Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第6期

机译：比较使用多种语言进行口语理解的随机方法
3. Incorporating Demographic Embeddings Into Language Understanding [J] . Garten Justin, Kennedy Brendan, Hoover Joe, Cognitive science . 2019,第1期

机译：将人口统计嵌入语言理解中
4. What is best for spoken language understanding: small but task-dependant embeddings or huge but out-of-domain embeddings? [C] . Sahar Ghannay, Antoine Neuraz, Sophie Rosset IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：什么是最能理解口语的语言：小的但依赖于任务的嵌入，还是巨大的但超出域的嵌入？
5. Understanding Word Embedding Stability Across Languages and Applications [D] . Wendlandt Burdick, Laura Anne. 2020

机译：了解跨语言和应用程序的Word嵌入稳定性
6. Corrigendum: Effects of Two Teaching Strategies on Preschoolers Oral Language Skills: Repeated Read-Aloud With Question and Answer Teaching Embedded and Repeated Read-Aloud With Executive Function Activities Embedded [O] . Hsin Ying Chien 2020

机译：勘误：两种教学策略对学龄前儿童口语技能的影响：重复问答式嵌入式朗读和嵌入执行功能活动的重复朗读
7. What is best for spoken language understanding: small but task-dependant embeddings or huge but out-of-domain embeddings? [O] . Sahar Ghannay, Antoine Neuraz, Sophie Rosset 2020

机译：什么是最好的口语理解：小但任务依赖的嵌入或巨大但域外嵌入式？
8. Real-Time Spoken-Language System for Interactive Problem-Solving, Combining Linguistic and Statistical Technology for Improved Spoken Language Understanding. [R] . Moore, R. C., Cohen, M. H. 1993

机译：交互式问题解决的实时语言系统，结合语言和统计技术提高口语理解能力。

What is best for spoken language understanding: small but task-dependant embeddings or huge but out-of-domain embeddings?

摘要

著录项

相似文献

相关主题

期刊订阅