Improved End-To-End Spoken Utterance Classification with a Self-Attention Acoustic Classifier

机译：带有自注意声学分类器的改进的端到端口语话语分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While human language provides a natural interface for humanmachine communication, there are several challenges concerning extracting the intents of a speaker when interacting with a virtual agent, especially when the speaker is in a noisy acoustic environment, that still remains to be solved. In this paper, we propose a new architecture for end-to-end spoken utterance classification (SUC) and also explore the impact of leveraging lexical information in conjunction with acoustic information obtained from the end-to-end model for SUC. We demonstrate that strong performance can be obtained by the model with acoustic features alone compared to a text classifier on ASR outputs. Furthermore, when acoustic and lexical embeddings from these classifiers are combined, accuracy that is on par with human agents can be achieved.

机译：尽管人类语言为人机通信提供了自然的界面，但在与虚拟代理进行交互时，尤其是在说话者处于嘈杂的声学环境中时，如何提取说话者的意图仍存在一些挑战，这仍然有待解决。在本文中，我们提出了一种用于端到端话语分类（SUC）的新体系结构，并且还探讨了利用词法信息以及从SUC端到端模型获得的声学信息的影响。我们证明，与ASR输出上的文本分类器相比，仅具有声学特征的模型即可获得强大的性能。此外，将来自这些分类器的声音和词汇嵌入进行组合时，可以实现与人工代理相当的准确性。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2020年|8504-8508|共5页
会议地点
作者
Ryan Price; Mahnoosh Mehrabani; Srinivas Bangalore;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
spoken utterance classification; end-to-end; acoustic embedding; lexical embedding; CTC;

机译：话语分类;端到端;声音嵌入;词汇嵌入; CTC;

相似文献

外文文献
中文文献
专利

1. Combining speech-based and linguistic classifiers to recognize emotion in user spoken utterances [J] . David Griol, José Manuel Molina, Zoraida Callejas Neurocomputing . 2019,第JANa31期

机译：结合基于语音和语言的分类器以识别用户语音中的情绪
2. Estimating the User's State before Exchanging Utterances Using Intermediate Acoustic Features for Spoken Dialog Systems [J] . Yuya Chiba, Takashi Nose, Masashi Ito, IAENG Internaitonal journal of computer science . 2016,第1期

机译：使用语音对话系统的中间声学功能在交换说话之前估算用户的状态
3. Utterance Intent Classification for Spoken Dialogue System with Data-Driven Untying of Recursive Autoencoders [J] . Tsuneo KATO, Atsushi NAGAI, Naoki NODA, IEICE transactions on information and systems . 2019,第6期

机译：数据驱动的递归自编码器解开语音对话系统的话语意图分类
4. Improved End-To-End Spoken Utterance Classification with a Self-Attention Acoustic Classifier [C] . Ryan Price, Mahnoosh Mehrabani, Srinivas Bangalore IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：利用自我关注声学分类器改进端到端口语的话语分类
5. Classifier design to improve pattern classification and knowledge discovery for imbalanced datasets. [D] . Wang, Kun. 2009

机译：分类器设计可改进模式分类和不平衡数据集的知识发现。
6. Using Complexity-Identical Human- and Machine-Directed Utterances to Investigate Addressee Detection for Spoken Dialogue Systems [O] . Oleg Akhtiamov, Ingo Siegert, Alexey Karpov, 2020

机译：使用复杂度相同的人机对话来调查口语对话系统的收件人检测
7. Combining speech-based and linguistic classifiers to recognize emotion in user spoken utterances [O] . David Griol, José Manuel Molina, Zoraida Callejas 2019

机译：结合语言和语言分类器来识别用户口语话语中的情绪

Improved End-To-End Spoken Utterance Classification with a Self-Attention Acoustic Classifier

摘要

著录项

相似文献

相关主题

期刊订阅