首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Bootstrapping Domain Detection Using Query Click Logs for New Domains
【24h】

Bootstrapping Domain Detection Using Query Click Logs for New Domains

机译:使用查询单击日志为新域引导域检测

获取原文

摘要

Domain detection in spoken dialog systems is usually treated as a multi-class, multi-label classification problem, and training of domain classifiers requires collection and manual annotation of example utterances. In order to extend a dialog system to new domains in a way that is seamless for users, domain detection should be able to handle utterances from the new domain as soon as it is introduced. In this work, we propose using web search query logs, which include queries entered by users and the links they subsequently click on, to bootstrap domain detection for new domains. While sampling user queries from the query click logs to train new domain classifiers, we introduce two types of measures based on the behavior of the users who entered a query and the form of the query. We show that both types of measures result in reductions in the error rate as compared to randomly sampling training queries. In controlled experiments over five domains, we achieve the best gain from the combination of the two types of sampling criteria.
机译:口语对话系统中的域检测通常被视为多类,多标签的分类问题,并且对域分类器的训练需要对示例话语进行收集和手动注释。为了以对用户来说无缝的方式将对话系统扩展到新域,域检测应该能够在引入新域后立即处理来自新域的讲话。在这项工作中,我们建议使用网络搜索查询日志,其中包括用户输入的查询以及他们随后单击的链接,以引导对新域的域检测。在从查询点击日志中采样用户查询以训练新的域分类器时,我们基于输入查询的用户的行为和查询的形式介绍两种类型的度量。我们显示,与随机抽样的训练查询相比,这两种类型的措施均会降低错误率。在五个领域的受控实验中,我们通过两种类型的采样标准的组合获得了最佳增益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号