首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Phoneme Classification and Lattice Rescoring Based on a k-NN Approach
【24h】

Phoneme Classification and Lattice Rescoring Based on a k-NN Approach

机译:基于k-NN方法的音素分类和格记录

获取原文

摘要

In this paper we propose a &-NN/SASH phoneme classification algorithm that competes favourably with state-of-the-art methods. We apply a similarity search algorithm (SASH) that has been used successfully for classification of high dimensional texts and images. Unlike other search algorithms, the computational time of SASH is not affected by the dimensionality of the data. Therefore, we generate fixed-length but high-dimensional feature vectors for phonemes using their underlying frames and those of their boundaries. The k-NN/SASH phoneme classifier is fast, efficient, and could achieve a classification rate of 79.2% for the TIMIT test database. Finally, we apply this algorithm to rescore phoneme lattices, generated by the GMM-HMM monophone recognizer for both context-independent and context-dependent tasks. In both cases, the k-NN/SASH classifier leads to improvements in the recognition rate.
机译:在本文中,我们提出了一种&-NN / SASH音素分类算法,该算法可与最先进的方法竞争。我们应用相似性搜索算法(SASH),该算法已成功用于高维文本和图像的分类。与其他搜索算法不同,SASH的计算时间不受数据维数的影响。因此,我们使用音素的基础帧及其边界为音素生成定长但高维的特征向量。 k-NN / SASH音素分类器快速,高效,对于TIMIT测试数据库可以达到79.2%的分类率。最后,我们将此算法应用于由GMM-HMM单音识别器生成的音素格,以用于上下文无关和上下文相关任务。在这两种情况下,k-NN / SASH分类器均可提高识别率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号