首页> 外文期刊>Speech Communication >Boosting and combination of classifiers for natural language call routing systems
【24h】

Boosting and combination of classifiers for natural language call routing systems

机译:自然语言呼叫路由系统的分类器的增强和组合

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we present different techniques to improve natural language call routing. We first describe methods to improve a single classifier: boosting, discriminative training (DT) and automatic relevance feedback (ARF). An interesting feature of some of these algorithms is the ability to re-weight the training data in order to focus the classifier on documents judged difficult to classify. We explore ways of deriving and combining uncorrelated classifiers in order to improve accuracy; we discuss specifically the linear interpolation and the constrained minimization techniques. Ail these approaches are probabilistic and are inspired from the information retrieval domain. They are evaluated using two similarity metrics, a common cosine measure from the vector space model, and a beta measure which had given good results in the similar task of e-mail steering. Compared to the baseline classifiers, we show an interesting improvement in the classification accuracy on call routing for a banking task: up to 20% reported for the ARF method, up to 30% for the boosting technique, and more than 45% for the DT approach. Another relative improvement of 11% is also obtained when we combine the classifiers with the constrained minimization approach using a confusion measure and DT. More importantly, synergistic effects of DT on the boosting algorithm were demonstrated: more iterations were possible because DT reduced the classification error rate of individual classifiers trained on re-weighted data by an average of 72%.
机译:在本文中,我们提出了各种改进自然语言呼叫路由的技术。我们首先描述改善单个分类器的方法:增强,判别训练(DT)和自动相关性反馈(ARF)。这些算法中的一些有趣的功能是能够对训练数据进行加权,以便将分类器集中在判断为难以分类的文档上。我们探索导出和组合不相关分类器的方法,以提高准确性。我们专门讨论线性插值和约束最小化技术。这些方法都是概率性的,并且受信息检索领域的启发。使用两个相似性度量对它们进行评估,一个是向量空间模型中的通用余弦度量,另一个是在类似电子邮件转向任务中获得了良好结果的beta度量。与基线分类器相比,我们显示了银行任务的呼叫路由分类准确度有显着提高:ARF方法报告的分类率高达20%,增强技术报告的分类率高达30%,DT的分类率超过45%方法。当我们将分类器与使用混淆度量和DT的约束最小化方法结合使用时,也可以获得11%的相对改进。更重要的是,证明了DT对Boosting算法的协同作用:由于DT将在重新加权数据上训练的单个分类器的分类错误率平均降低了72%,因此可以进行更多的迭代。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号