Multitask Learning with Capsule Networks for Speech-to-Intent Applications

机译：用于语音到意图应用的胶囊网络多任务学习

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Voice controlled applications can be a great aid to society, especially for physically challenged people. However this requires robustness to all kinds of variations in speech. A spoken language understanding system that learns from interaction with and demonstrations from the user, allows the use of such a system in different settings and for different types of speech, even for deviant or impaired speech, while also allowing the user to choose a phrasing. The user gives a command and enters its intent through an interface, after which the model learns to map the speech directly to the right action. Since the effort of the user should be as low as possible, capsule networks have drawn interest due to potentially needing little training data compared to deeper neural networks. In this paper, we show how capsules can incorporate multitask learning, which often can improve the performance of a model when the task is difficult. The basic capsule network will be expanded with a regularisation to create more structure in its output: it learns to identify the speaker of the utterance by forcing the required information into the capsule vectors. To this end we move from a speaker dependent to a speaker independent setting.

机译：语音控制的应用程序可以为社会带来很大的帮助，特别是对于身体残障的人。然而，这要求对语音中的各种变化都具有鲁棒性。从与用户的交互和从用户的演示中学到的口语理解系统，允许在不同的设置和不同类型的语音中使用该系统，甚至用于语音不正确或受损的用户，同时还允许用户选择措辞。用户发出命令并通过界面输入其意图，此后模型学习将语音直接映射到正确的动作。由于用户的精力应尽可能少，因此与更深层的神经网络相比，胶囊网络由于可能需要很少的训练数据而引起了人们的兴趣。在本文中，我们展示了胶囊如何结合多任务学习，当任务很困难时，通常可以提高模型的性能。基本的胶囊网络将通过正则化进行扩展，以在其输出中创建更多结构：它通过将所需的信息强加到胶囊向量中，从而学会识别话语的发话人。为此，我们从与说话者相关的设置过渡到与说话者无关的设置。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2020年|8494-8498|共5页
会议地点
作者
Jakob Poncelet; Hugo Van hamme;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Spoken Language Understanding; Capsule Networks; Multitask Learning; End-to-end; Speaker Identification;

机译：口语理解;胶囊网络;多任务学习;端到端;说话人识别;

相似文献

外文文献
中文文献
专利

1. Multitask Learning of Signaling and Regulatory Networks with Application to Studying Human Response to Flu [J] . Siddhartha Jain, Anthony Gitter, Ziv Bar-Joseph PLoS Computational Biology . 2014,第12期

机译：信号和监管网络的多任务学习及其在研究人类对流感的反应中的应用
2. Heterogeneous and Multitask Wireless Sensor Networks—Algorithms, Applications, and Challenges [J] . Jorge Plata-Chaves, Alexander Bertrand, Marc Moonen, Selected Topics in Signal Processing, IEEE Journal of . 2017,第3期

机译：异构和多任务无线传感器网络—算法，应用和挑战
3. CurveNet: Curvature-Based Multitask Learning Deep Networks for 3D Object Recognition [J] . A.A.M.Muzahid, Wanggen Wan, Ferdous Sohel, 自动化学报（英文版） . 2021,第006期

机译：授权：基于曲率的多任务学习3D对象识别的深网络
4. Multitask Learning with Capsule Networks for Speech-to-Intent Applications [C] . Jakob Poncelet, Hugo Van hamme IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：使用胶囊网络进行语音到Intent应用程序的多任务
5. Multitask learning for Bayesian neural networks. [D] . Kovac, Krunoslav. 2005

机译：贝叶斯神经网络的多任务学习。
6. Multitask Learning of Signaling and Regulatory Networks with Application to Studying Human Response to Flu [O] . Siddhartha Jain, Anthony Gitter, Ziv Bar-Joseph 2014

机译：信号和监管网络的多任务学习及其在研究人类对流感的反应中的应用
7. Multitask Learning with Capsule Networks for Speech-to-Intent Applications [O] . Jakob Poncelet, Hugo Van hamme 2020

机译：使用胶囊网络进行语音到Intent应用程序的多任务

Multitask Learning with Capsule Networks for Speech-to-Intent Applications

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅