Kernel metric learning for phonetic classification

机译：核度量学习用于语音分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While a sound spoken is described by a handful of frame-level spectral vectors, not all frames have equal contribution for either human perception or machine classification. In this paper, we introduce a novel framework to automatically emphasize important speech frames relevant to phonetic information. We jointly learn the importance of speech frames by a distance metric across the phone classes, attempting to satisfy a large margin constraint: the distance from a segment to its correct label class should be less than the distance to any other phone class by the largest possible margin. Furthermore, an universal background model structure is proposed to give the correspondence between statistical models of phone types and tokens, allowing us to use statistical models of each phone token in a large margin speech recognition framework. Experiments on TIMIT database demonstrated the effectiveness of our framework.

机译：尽管通过少数帧级频谱矢量描述了语音，但并非所有帧对于人类感知或机器分类都具有同等的贡献。在本文中，我们介绍了一种新颖的框架来自动强调与语音信息相关的重要语音框架。我们试图通过跨电话类别的距离度量来共同学习语音帧的重要性，试图满足较大的裕度约束：从片段到其正确标签类别的距离应小于与任何其他电话类别的距离，最大余量。此外，提出了一种通用的背景模型结构来给出电话类型和令牌的统计模型之间的对应关系，从而使我们能够在大幅度语音识别框架中使用每个电话令牌的统计模型。 TIMIT数据库上的实验证明了我们框架的有效性。

著录项

来源
《Automatic Speech Recognition amp; Understanding, 2009. ASRU 2009》|2009年|141-145|共5页
会议地点 Merano(IT);Merano(IT)
作者
Huang Jui-Ting; Zhou Xi; Hasegawa-Johnson Mark; Huang Thomas;
展开▼
作者单位

Beckman Institute, University of Illinois at Urbana-Champaign, 61801, USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Graph Embedding Multi-Kernel Metric Learning for Image Set Classification With Grassmannian Manifold-Valued Features [J] . Rui Wang, Xiao-Jun Wu, Josef Kittler Multimedia, IEEE Transactions on . 2021,第1期

机译：图表嵌入多核度量学习的图像集分类与基层歧义的功能
2. A dual-kernel spectral-spatial classification approach for hyperspectral images based on Mahalanobis distance metric learning [J] . Li Li, Sun Chao, Lin Lianlei, Information Sciences: An International Journal . 2018,第期

机译：基于Mahalanobis距离度量学习的高光谱图像双核光谱 - 空间分类方法
3. A Mahalanobis metric learning-based polynomial kernel for classification of hyperspectral images [J] . Li Li, Sun Chao, Lin Lianlei, Neural computing & applications . 2018,第4期

机译：基于Mahalanobis公制学习的多项式内核，用于高光谱图像分类
4. Kernel Metric Learning For Phonetic Classification [C] . Jui-Ting Huang, Xi Zhou, Mark Hasegawa-Johnson, IEEE Workshop on Automatic Speech Recognition Understanding . 2009

机译：封校刻度学习的核心分类
5. Image annotation and tag completion via kernel metric learning and noisy matrix recovery. [D] . Feng, Zheyun. 2016

机译：通过内核度量学习和噪声矩阵恢复实现图像注释和标签完成。
6. Kernel-based distance metric learning for microarray data classification [O] . Huilin Xiong, Xue-wen Chen 2006

机译：基于核的距离度量学习用于微阵列数据分类
7. Kernel Metric Learning For Phonetic Classification [O] . Jui-ting Huang, Xi Zhou, Mark Hasegawa-johnson, 2010

机译：用于语音分类的内核度量学习
8. Kernel Multi-Metric Learning for Multi-Channel Transient Acoustic Signal Classification. [R] . Zhang, H., Zhang, Y., Nasrabadi, N. M., 2012

机译：多通道瞬态声信号分类的核多指标学习。

Kernel metric learning for phonetic classification

摘要

著录项

相似文献

相关主题

期刊订阅