首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Multitask Learning with Capsule Networks for Speech-to-Intent Applications
【24h】

Multitask Learning with Capsule Networks for Speech-to-Intent Applications

机译:用于语音到意图应用的胶囊网络多任务学习

获取原文

摘要

Voice controlled applications can be a great aid to society, especially for physically challenged people. However this requires robustness to all kinds of variations in speech. A spoken language understanding system that learns from interaction with and demonstrations from the user, allows the use of such a system in different settings and for different types of speech, even for deviant or impaired speech, while also allowing the user to choose a phrasing. The user gives a command and enters its intent through an interface, after which the model learns to map the speech directly to the right action. Since the effort of the user should be as low as possible, capsule networks have drawn interest due to potentially needing little training data compared to deeper neural networks. In this paper, we show how capsules can incorporate multitask learning, which often can improve the performance of a model when the task is difficult. The basic capsule network will be expanded with a regularisation to create more structure in its output: it learns to identify the speaker of the utterance by forcing the required information into the capsule vectors. To this end we move from a speaker dependent to a speaker independent setting.
机译:语音控制的应用程序可以为社会带来很大的帮助,特别是对于身体残障的人。然而,这要求对语音中的各种变化都具有鲁棒性。从与用户的交互和从用户的演示中学到的口语理解系统,允许在不同的设置和不同类型的语音中使用该系统,甚至用于语音不正确或受损的用户,同时还允许用户选择措辞。用户发出命令并通过界面输入其意图,此后模型学习将语音直接映射到正确的动作。由于用户的精力应尽可能少,因此与更深层的神经网络相比,胶囊网络由于可能需要很少的训练数据而引起了人们的兴趣。在本文中,我们展示了胶囊如何结合多任务学习,当任务很困难时,通常可以提高模型的性能。基本的胶囊网络将通过正则化进行扩展,以在其输出中创建更多结构:它通过将所需的信息强加到胶囊向量中,从而学会识别话语的发话人。为此,我们从与说话者相关的设置过渡到与说话者无关的设置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号