首页> 外文会议> >Semantics synchronous understanding for robust spoken language applications

【24h】

Semantics synchronous understanding for robust spoken language applications

机译：健壮的口语应用程序的语义同步理解

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we describe our recent effort in combining speech recognition and understanding into a single pass decoding process. The goal is to utilize the semantic structure not only to better handle disfluencies and improve the overall understanding accuracy, but also to shorten the response time and achieve higher interactivity. Three related techniques are instrumental in our approach. First, we employ the unified language model (ULM) to incorporate semantic schema into the recognition language model, and extend the search process from word synchronous to semantic object synchronous (SOS) decoding. Finally, we utilize sequential detection to defer, reject, or accept semantic hypotheses and execute consequent dialog actions while the user's utterance is ongoing. We incorporated these methods into SALT and HTML and conducted comparative user studies based on the MiPad scenarios. The experimental results show the system can gracefully cope with spontaneous speech and the users prefer the highly interactive nature of such systems even though there are no significant differences in the task completion rate and the understanding accuracy. However, the interactive interface does allow a more effective visual prompting strategy that contributes to the significantly lower out of grammar utterances.

机译：在本文中，我们描述了我们最近在将语音识别和理解结合到单遍解码过程中的工作。目标是利用语义结构不仅可以更好地处理歧义和提高整体理解的准确性，而且可以缩短响应时间并实现更高的交互性。三种相关技术对我们的方法至关重要。首先，我们采用统一语言模型（ULM）将语义模式合并到识别语言模型中，并将搜索过程从单词同步扩展到语义对象同步（SOS）解码。最后，我们利用顺序检测来推迟，拒绝或接受语义假设，并在用户发声进行时执行随后的对话动作。我们将这些方法合并到SALT和HTML中，并根据MiPad方案进行了比较用户研究。实验结果表明，该系统可以很好地应对自发语音，即使任务完成率和理解准确度没有显着差异，用户也喜欢这种系统的高度交互性。但是，交互式界面的确允许使用更有效的视觉提示策略，从而大大降低了语法语调。

著录项

来源
《》|2003年|p.640-645|共6页
会议地点
作者
Kuansan Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
linguistics; speech recognition; speech-based user interfaces; sequential estimation; spoken language understanding system; semantics synchronous understanding; speech recognition; single pass decoding process; disfluencies; understanding accuracy; interactivity; unified language model; ULM; search process; semantic object synchronous decoding; sequential detection; semantic hypotheses; SALT; HTML; spontaneous speech; interactive user interface; task completion rate; visual prompting strategy; out of grammar utterances;

机译：语言学;语音识别;基于语音的用户界面;顺序估计;语音理解系统;语义同步理解;语音识别;单遍解码过程;差异性;理解准确性;交互性;统一语言模型; ULM;搜索过程;语义对象同步解码;顺序检测;语义假设; SALT; HTML;自发语音;交互式用户界面;任务完成率;视觉提示策略;语法不正确;

相似文献

外文文献
中文文献
专利

1. Spoken Language Understanding of Human-Machine Conversations for Language Learning Applications [J] . Yao Qian, Rutuja Ubale, Patrick Lange, Journal of VLSI signal processing systems for signal, image, and video technology . 2020,第8期

机译：对人机对话进行语言学习应用的口语理解
2. Fuzzy Matching of Semantic Class in Chinese Spoken Language Understanding [J] . Yanling LI, Qingwei ZHAO, Yonghong YAN IEICE transactions on information and systems . 2013,第8期

机译：汉语口语理解中语义类的模糊匹配
3. Fuzzy Matching of Semantic Class in Chinese Spoken Language Understanding [J] . Yanling LI, Qingwei ZHAO, Yonghong YAN IEICE Transactions on Information and Systems . 2013,第8期

机译：汉语口语理解中语义类的模糊匹配
4. SEMANTICS SYNCHRONOUS UNDERSTANDING FOR ROBUST SPOKEN LANGUAGE APPLICATIONS [C] . Kuansan Wang IEEE Workshop on Automatic Speech Recognition and Understanding . 2003

机译：语义同步了解强大的口语语言应用程序
5. Speech to Text to Semantics: A Sequence-to-sequence System for Spoken Language Understanding [D] . Dodson, John. 2020

机译：发表文本到语义：用于口语语言理解的序列到序列系统
6. Fast mapping semantic features: Performance of adults with normal language history of disorders of spoken and written language and attention deficit hyperactivity disorder on a word learning task [O] . Mary Alt, Michelle L. Gutmann -1

机译：快速映射语义特征：正常的语言口语和书面语言的障碍病史注意缺陷多动障碍的成年人的表现就一个字学习任务
7. Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora [O] . Lefèvre Fabrice, Mostefa Djamel, Besacier Laurent, 2012

机译：利用跨语言和跨域的口语理解系统的鲁棒性和可移植性研究：PORTMEDIA语料库
8. Integrating Syntax and Semantics into Spoken Language Understanding. [R] . Hirschman, L., Seneff, S., Goodine, D., 1991

机译：将语法和语义集成到口语理解中。

Semantics synchronous understanding for robust spoken language applications

摘要

著录项

相似文献

相关主题

期刊订阅