首页> 外文期刊>IEICE Transactions on Information and Systems >Novel Confidence Feature Extraction Algorithm Based on Latent Topic Similarity
【24h】

Novel Confidence Feature Extraction Algorithm Based on Latent Topic Similarity

机译:基于潜在主题相似度的新置信特征提取算法

获取原文
获取原文并翻译 | 示例
       

摘要

In speech recognition, confidence annotation adopts a single confidence feature or a combination of different features for classification. These confidence features are always extracted from decoding information. However, it is proved that about 30% of knowledge of human speech understanding is mainly derived from high-level information. Thus, how to extract a high-level confidence feature statistically independent of decoding information is worth researching in speech recognition. In this paper, a novel confidence feature extraction algorithm based on latent topic similarity is proposed. Each word topic distribution and context topic distribution in one recognition result is firstly obtained using the latent Dirich-let allocation (LDA) topic model, and then, the proposed word confidence feature is extracted by determining the similarities between these two topic distributions. The experiments show that the proposed feature increases the number of information sources of confidence features with a good information complementary effect and can effectively improve the performance of confidence annotation combined with confidence features from decoding information.
机译:在语音识别中,置信度注释采用单个置信度特征或不同特征的组合进行分类。这些置信度特征总是从解码信息中提取的。但是,事实证明,大约30%的人类语音理解知识主要来自高级信息。因此,如何在统计上独立于解码信息地提取高级置信度特征值得在语音识别中研究。提出了一种基于潜在主题相似度的置信度特征提取算法。首先使用潜在狄利克-莱分配(LDA)主题模型获得一个识别结果中的每个单词主题分布和上下文主题分布,然后通过确定这两个主题分布之间的相似性来提取拟议的单词置信度特征。实验表明,所提出的特征增加了置信度特征的信息源数量,具有良好的信息互补效果,可以有效地提高置信度标注的性能,并结合来自解码信息的置信度特征。

著录项

  • 来源
    《IEICE Transactions on Information and Systems》 |2010年第8期|P.2243-2251|共9页
  • 作者单位

    Pattern Recognition and Intelligent System Laboratory, Beijing University of Posts and Telecommunications, Beijing 100876, China;

    rnPattern Recognition and Intelligent System Laboratory, Beijing University of Posts and Telecommunications, Beijing 100876, China;

    rnPattern Recognition and Intelligent System Laboratory, Beijing University of Posts and Telecommunications, Beijing 100876, China;

    rnGraduate School of Engineering, Tohoku University, Sendai-shi 980-8579 Japan;

    rnFaculty of Science and Technology, Tohoku Bunka Gakuen University, Sendai-shi 981-8551 Japan;

    rnPattern Recognition and Intelligent System Laboratory, Beijing University of Posts and Telecommunications, Beijing 100876, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    speech recognition; confidence annotation; confidence feature; latent topic similarity;

    机译:语音识别;置信度注释;置信度潜在主题相似度;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号