首页> 外文会议>Conference on Neural Information Processing Systems;Annual conference on Neural Information Processing Systems >Learning Label Embeddings for Nearest-Neighbor Multi-class Classification with an Application to Speech Recognition
【24h】

Learning Label Embeddings for Nearest-Neighbor Multi-class Classification with an Application to Speech Recognition

机译:学习用于最近邻多类分类的标签嵌入及其在语音识别中的应用

获取原文

摘要

We consider the problem of using nearest neighbor methods to provide a conditional probability estimate, P(y|a), when the number of labels y is large and the labels share some underlying structure. We propose a method for learning label embeddings (similar to error-correcting output codes (ECOCs)) to model the similarity between labels within a nearest neighbor framework. The learned ECOCs and nearest neighbor information are used to provide conditional probability estimates. We apply these estimates to the problem of acoustic modeling for speech recognition. We demonstrate significant improvements in terms of word error rate (WER) on a lecture recognition task over a state-of-the-art baseline GMM model.
机译:我们考虑在标签y的数量较大且标签共享某些基础结构时使用最近邻方法提供条件概率估计值P(y | a)的问题。我们提出了一种学习标签嵌入的方法(类似于纠错输出代码(ECOC)),以对最近邻居框架内的标签之间的相似性进行建模。获悉的ECOC和最邻近信息用于提供条件概率估计。我们将这些估计应用于语音识别的声学建模问题。我们证明,在最新的基线GMM模型上,在演讲识别任务上的单词错误率(WER)方面得到了显着改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号