首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >What is best for spoken language understanding: small but task-dependant embeddings or huge but out-of-domain embeddings?

【24h】

What is best for spoken language understanding: small but task-dependant embeddings or huge but out-of-domain embeddings?

机译：什么是最能理解口语的语言：小的但依赖于任务的嵌入，还是巨大的但超出域的嵌入？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Word embeddings are shown to be a great asset for several Natural Language and Speech Processing tasks. While they are already evaluated on various NLP tasks, their evaluation on spoken or natural language understanding (SLU) is less studied. The goal of this study is two-fold: firstly, it focuses on semantic evaluation of common word embeddings approaches for SLU task; secondly, it investigates the use of two different data sets to train the embeddings: small and task-dependent corpus or huge and out-of-domain corpus. Experiments are carried out on 5 benchmark corpora (ATIS, SNIPS, SNIPS70, M2M, MEDIA), on which a relevance ranking was proposed in the literature. Interestingly, the performance of the embeddings is independent of the difficulty of the corpora. Moreover, the embeddings trained on huge and out-of-domain corpus yields to better results than the ones trained on small and task-dependent corpus.

机译：单词嵌入被证明是多项自然语言和语音处理任务的重要资产。尽管已经对各种NLP任务进行了评估，但对口语或自然语言理解（SLU）的评估却很少研究。这项研究的目的有两个方面：首先，它主要针对SLU任务中常用单词嵌入方法的语义评估;其次，它研究了如何使用两种不同的数据集来训练嵌入：小型和任务相关的语料库或巨大且域外的语料库。在5个基准语料库（ATIS，SNIPS，SNIPS70，M2M，MEDIA）上进行了实验，文献中提出了相关性排名。有趣的是，嵌入的性能与语料库的难度无关。此外，在庞大而领域外的语料库上训练的嵌入比在小且依赖任务的语料库上训练的嵌入产生更好的结果。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing 》|2020年|8114-8118|共5页
会议地点
作者
Sahar Ghannay; Antoine Neuraz; Sophie Rosset;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
spoken language understanding; word embeddings;

机译：口语理解;词嵌入;

相似文献

外文文献
中文文献
专利

1. Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems [J] . Ryu Seonghan, Kim Seokhwan, Choi Junhwi, Pattern recognition letters . 2017 ,第Mara1期

机译：在对话系统中仅使用域内语句进行神经语句嵌入以进行域外语句检测
2. Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages [J] . Hahn S., Dinarelli M., Raymond C., Audio, Speech, and Language Processing, IEEE Transactions on . 2011 ,第6期

机译：比较使用多种语言进行口语理解的随机方法
3. Incorporating Demographic Embeddings Into Language Understanding [J] . Garten Justin, Kennedy Brendan, Hoover Joe, Cognitive science . 2019 ,第1期

机译：将人口统计嵌入语言理解中
4. What is best for spoken language understanding: small but task-dependant embeddings or huge but out-of-domain embeddings? [C] . Sahar Ghannay, Antoine Neuraz, Sophie Rosset IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：什么是最好的口语理解：小但任务依赖的嵌入或巨大但域外嵌入式？
5. Understanding Word Embedding Stability Across Languages and Applications [D] . Wendlandt Burdick, Laura Anne. 2020

机译：了解跨语言和应用程序的Word嵌入稳定性
6. Corrigendum: Effects of Two Teaching Strategies on Preschoolers Oral Language Skills: Repeated Read-Aloud With Question and Answer Teaching Embedded and Repeated Read-Aloud With Executive Function Activities Embedded [O] . Hsin Ying Chien 2020

机译：勘误：两种教学策略对学龄前儿童口语技能的影响：重复问答式嵌入式朗读和嵌入执行功能活动的重复朗读
7. What is best for spoken language understanding: small but task-dependant embeddings or huge but out-of-domain embeddings? [O] . Sahar Ghannay, Antoine Neuraz, Sophie Rosset 2020

机译：什么是最好的口语理解：小但任务依赖的嵌入或巨大但域外嵌入式？
8. Real-Time Spoken-Language System for Interactive Problem-Solving, Combining Linguistic and Statistical Technology for Improved Spoken Language Understanding. [R] . Moore, R. C., Cohen, M. H. 1993

机译：交互式问题解决的实时语言系统，结合语言和统计技术提高口语理解能力。

What is best for spoken language understanding: small but task-dependant embeddings or huge but out-of-domain embeddings?

摘要

著录项

相似文献

相关主题

期刊订阅