首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Automatically Optimizing Utterance Classification Performance without Human in the Loop
【24h】

Automatically Optimizing Utterance Classification Performance without Human in the Loop

机译:自动优化话语分类性能,无需人工干预

获取原文

摘要

The Utterance Classification (UC) method has become a developer's choice over traditional Context Free Grammars (CFGs) for voice menus in telephony applications. This data driven method achieves higher accuracy and has great potential to utilize a huge amount of labeled training data. But, having a human manually label the training data can be expensive. This paper provides a robust recipe for training a UC system using inexpensive acoustic data with limited transcriptions or semantic labels. It also describes two new algorithms that use caller confirmation, which naturally occurred within a dialog, to generate pseudo semantic labels. Experimental results show that, after having sufficient labeled data to achieve a reasonable accuracy, both of our algorithms can use unlabeled data to achieve the same performance as a system trained with labeled data, while completely eliminating the need for human supervision.
机译:相对于电话应用中语音菜单的传统上下文无关文法(CFG),语言分类(UC)方法已成为开发人员的选择。这种数据驱动的方法具有更高的准确性,并且在利用大量带标签的训练数据方面具有巨大的潜力。但是,由人工手动标记训练数据可能会很昂贵。本文提供了一个可靠的方法,可以使用廉价的声学数据,有限的转录或语义标签来训练UC系统。它还描述了两种使用调用者确认的新算法,这些算法自然发生在对话框中,以生成伪语义标签。实验结果表明,在拥有足够的标记数据以达到合理的准确性之后,我们的两种算法都可以使用未标记的数据来实现与使用标记数据训练的系统相同的性能,同时完全消除了人工监督的需要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号