首页> 外文会议>Perception in Multimodal Dialogue Systems >Call Classification with Hundreds of Classes andHundred Thousands of Training Utterances ... ... and No Target Domain Data
【24h】

Call Classification with Hundreds of Classes andHundred Thousands of Training Utterances ... ... and No Target Domain Data

机译:具有数百个类别和十万个训练演说的呼叫分类... ...并且没有目标域数据

获取原文
获取原文并翻译 | 示例

摘要

This paper reports about an effort to build a large-scale call router able to reliably distinguish among 250 call reasons. Because training data from the specific application (Target) domain was not available, the statistical classifier was built using more than 300,000 transcribed and annotated utterances from related, but different, domains. Several tuning cycles including three re-annotation rounds, in-lab data recording, bag-of-words-based consistency cleaning, and recognition parameter optimization improved the classifier accuracy from 32% to a performance clearly above 70%.
机译:本文报告了有关构建大型呼叫路由器的工作,该路由器能够可靠地区分250个呼叫原因。由于没有来自特定应用程序(目标)域的训练数据,因此使用来自相关但不同域的300,000多个转录和注释话语构建了统计分类器。包括三个重新注释回合,实验室内数据记录,基于词袋的一致性清除以及识别参数优化在内的多个调整周期将分类器的准确性从32%提高到明显高于70%的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号