Learning Label Embeddings for Nearest-Neighbor Multi-class Classification with an Application to Speech Recognition

机译：学习用于最近邻多类分类的标签嵌入及其在语音识别中的应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the problem of using nearest neighbor methods to provide a conditional probability estimate, P(y|a), when the number of labels y is large and the labels share some underlying structure. We propose a method for learning label embeddings (similar to error-correcting output codes (ECOCs)) to model the similarity between labels within a nearest neighbor framework. The learned ECOCs and nearest neighbor information are used to provide conditional probability estimates. We apply these estimates to the problem of acoustic modeling for speech recognition. We demonstrate significant improvements in terms of word error rate (WER) on a lecture recognition task over a state-of-the-art baseline GMM model.

机译：我们考虑在标签y的数量较大且标签共享某些基础结构时使用最近邻方法提供条件概率估计值P（y | a）的问题。我们提出了一种学习标签嵌入的方法（类似于纠错输出代码（ECOC）），以对最近邻居框架内的标签之间的相似性进行建模。获悉的ECOC和最邻近信息用于提供条件概率估计。我们将这些估计应用于语音识别的声学建模问题。我们证明，在最新的基线GMM模型上，在演讲识别任务上的单词错误率（WER）方面得到了显着改善。

著录项

来源
《Conference on Neural Information Processing Systems;Annual conference on Neural Information Processing Systems》|2009年|P.1678-1686|共9页
会议地点
作者
Natasha Singh-Miller; Michael Collins;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Classification of Parkinson’s disease utilizing multi-edit nearest-neighbor and ensemble learning algorithms with speech samples [J] . He-Hua Zhang, Liuyang Yang, Yuchuan Liu, BioMedical Engineering OnLine . 2016,第1期

机译：帕金森氏病的分类，采用多编辑最近邻和整体学习算法以及语音样本
2. HMM/SVM segmentation and labelling of Arabic speech for speech recognition applications [J] . Frihia Hamza, Bahi Halima El International journal of speech technology . 2017,第3期

机译：HMM / SVM分割和标记阿拉伯语语音以用于语音识别应用
3. Multi-Class Learning from Label Proportions for Bank Customer Classification [J] . Yaxing Qian, Qiang Tong, Bo Wang Procedia Computer Science . 2019,第19期

机译：从标签比例中进行多类学习以进行银行客户分类
4. Efficient multi-label ranking for multi-class learning: Application to object recognition [C] . Bucak S.S., Kumar Mallapragada P., Rong Jin, 2009 IEEE 12th International Conference on Computer Vision (ICCV 2009) . 2009

机译：多类别学习的高效多标签排名：在对象识别中的应用
5. Learning embeddings for indexing, retrieval, and classification, with applications to object and shape recognition in image databases. [D] . Athitsos, Vassilis. 2006

机译：学习用于索引，检索和分类的嵌入，并将其应用于图像数据库中的对象和形状识别。
6. Classification of Parkinson’s disease utilizing multi-edit nearest-neighbor and ensemble learning algorithms with speech samples [O] . He-Hua Zhang, Liuyang Yang, Yuchuan Liu, 2016

机译：帕金森氏病的分类采用多编辑最近邻和整体学习算法以及语音样本
7. Efficient Multi-label Ranking for Multi-class Learning: Application to Object Recognition [O] . Serhat S. Bucak, Pavan Kumar Mallapragada, Rong Jin, 2010

机译：多类别学习的高效多标签排名：在对象识别中的应用

Learning Label Embeddings for Nearest-Neighbor Multi-class Classification with an Application to Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅