...
首页> 外文期刊>Applied mathematics and computation >A simple statistical speech recognition of mandarin monosyllables
【24h】

A simple statistical speech recognition of mandarin monosyllables

机译:普通话单音节的简单统计语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

Each mandarin syllable is represented by a sequence of vectors of linear predict coding cepstra (LPCC). Since all syllables have a simple phonetic structure, in our speech recognition, we partition the sequence of LPCC vectors of all syllables into equal segments and average the LPCC vectors in each segment. The mean vector of LPCC is used as the feature of a syllable. Our simple feature does not need any time consuming and complicated nonlinear contraction and expansion as adopted by the dynamic time-warping. We propose several probability distributions for the feature values. A simplified Bayes decision rule is used for classification of mandarin syllables. For the speaker-independent mandarin digits, the recognition rate is 98.6% if a normal distribution is used for feature values and the rate is 98.1% if an exponential distribution is used for the absolute values of the features. The feature proposed in this paper to represent a syllable is the simplest one, much easier to be extracted than any other known features. The computation for feature extraction and classification is much faster and more accurate than using the HMM method or any other known techniques. (c) 2005 Elsevier Inc. All rights reserved.
机译:每个普通话音节由线性预测编码倒谱(LPCC)的向量序列表示。由于所有音节都具有简单的语音结构,因此在我们的语音识别中,我们将所有音节的LPCC向量的序列划分为相等的段,并平均每个段中的LPCC向量。 LPCC的平均向量用作音节的特征。我们的简单特征不需要动态时间扭曲所采用的任何耗时且复杂的非线性收缩和扩展。我们提出特征值的几种概率分布。简化的贝叶斯决策规则用于普通话音节的分类。对于独立于说话人的普通话数字,如果将正态分布用于特征值,则识别率为98.6%;如果将指数分布用于特征的绝对值,则识别率为98.1%。本文提出的表示音节的特征是最简单的特征,比其他任何已知特征都容易提取。与使用HMM方法或任何其他已知技术相比,用于特征提取和分类的计算要快得多且更准确。 (c)2005 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号