首页> 外文会议>IEEE Workshop on Spoken Language Technology >Using lexical, syntactic and semantic features for non-terminal grammar rule induction in Spoken Dialogue Systems
【24h】

Using lexical, syntactic and semantic features for non-terminal grammar rule induction in Spoken Dialogue Systems

机译:在口语对话系统中使用非终端语法规则诱导的词法,句法和语义特征

获取原文

摘要

In this work, we propose an algorithm for the automatic induction of non-terminal grammar rules for Spoken Dialogue Systems (SDS). Initially, a grammar developer provides the system with a minimal set of rules that serve as seeding examples. Using these seed rules and (optionally) a seed corpus, in-domain data are harvested and filtered from the web. A challenging task is identifying relevant chunks (phrases) in the web-harvested corpus that are good candidates for enhancing the seed grammar. We propose and evaluate rule-based and statistical classification algorithms for this purpose that use lexical, syntactic and semantic features. Induced grammars are evaluated in terms of accuracy of the proposed rules for two spoken dialogue domains. Results show up to four times absolute precision improvement compared to the naive grammar induction approach using semantic phrase similarity.
机译:在这项工作中,我们提出了一种用于自动诱导非终端语法规则的算法(SDS)。最初,语法开发人员提供了具有最小一组规则的系统,该规则用作播种示例。使用这些种子规则和(任选地)种子语料库,收获域内数据并从网中过滤。一个具有挑战性的任务是识别网络收获的语料库中的相关块(短语),这是加强种子语法的好候选者。我们为此目的提出并评估了基于规则和统计分类算法,该算法使用词法,句法和语义特征。在两个口头对话域的拟议规则的准确性方面评估了语法。结果显示,与使用语义短语相似性的幼稚语法诱导方法相比,绝对精确改善的达到了四倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号