A simple statistical speech recognition of mandarin monosyllables

Li TF; Chang SC; Lee CB

首页> 外文期刊>Applied mathematics and computation >A simple statistical speech recognition of mandarin monosyllables

【24h】

A simple statistical speech recognition of mandarin monosyllables

机译：普通话单音节的简单统计语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Each mandarin syllable is represented by a sequence of vectors of linear predict coding cepstra (LPCC). Since all syllables have a simple phonetic structure, in our speech recognition, we partition the sequence of LPCC vectors of all syllables into equal segments and average the LPCC vectors in each segment. The mean vector of LPCC is used as the feature of a syllable. Our simple feature does not need any time consuming and complicated nonlinear contraction and expansion as adopted by the dynamic time-warping. We propose several probability distributions for the feature values. A simplified Bayes decision rule is used for classification of mandarin syllables. For the speaker-independent mandarin digits, the recognition rate is 98.6% if a normal distribution is used for feature values and the rate is 98.1% if an exponential distribution is used for the absolute values of the features. The feature proposed in this paper to represent a syllable is the simplest one, much easier to be extracted than any other known features. The computation for feature extraction and classification is much faster and more accurate than using the HMM method or any other known techniques. (c) 2005 Elsevier Inc. All rights reserved.

机译：每个普通话音节由线性预测编码倒谱（LPCC）的向量序列表示。由于所有音节都具有简单的语音结构，因此在我们的语音识别中，我们将所有音节的LPCC向量的序列划分为相等的段，并平均每个段中的LPCC向量。 LPCC的平均向量用作音节的特征。我们的简单特征不需要动态时间扭曲所采用的任何耗时且复杂的非线性收缩和扩展。我们提出特征值的几种概率分布。简化的贝叶斯决策规则用于普通话音节的分类。对于独立于说话人的普通话数字，如果将正态分布用于特征值，则识别率为98.6％;如果将指数分布用于特征的绝对值，则识别率为98.1％。本文提出的表示音节的特征是最简单的特征，比其他任何已知特征都容易提取。与使用HMM方法或任何其他已知技术相比，用于特征提取和分类的计算要快得多且更准确。（c）2005 Elsevier Inc.保留所有权利。

著录项

来源
《Applied mathematics and computation》 |2006年第2期|共8页
作者
Li TF; Chang SC; Lee CB;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类应用数学;
关键词
Bayes decision rule; linear predict coding; speech recognition; HIDDEN MARKOV-MODELS; ISOLATED WORD RECOGNITION; VECTOR QUANTIZATION; MAXIMUM-LIKELIHOOD; LINEAR PREDICTION; EM ALGORITHM; SYSTEM; CHAINS;

机译：贝叶斯决策规则;线性预测编码;语音识别;隐马尔可夫模型;孤立词识别;矢量量化;最大似然;线性预测;EM算法;系统;链;

相似文献

外文文献
中文文献
专利

1. A simple statistical speech recognition of mandarin monosyllables [J] . Li TF, Chang SC, Lee CB Applied mathematics and computation . 2006,第2期

机译：普通话单音节的简单统计语音识别
2. The characteristics of monosyllable recognition in Mandarin-speaking patients with auditory neuropathy [J] . Acta Oto-Laryngologica . 2020,第5a6期

机译：讲术治疗神经病变术患者单音节识别的特征
3. Development of a mandarin monosyllable recognition test. [J] . Tsai KS, Tseng LH, Wu CJ, Ear and hearing. . 2009,第1期

机译：普通话单音节识别测试的发展。
4. A statistical speech recognition of Ningbo dialect monosyllables [C] . 2010 International Conference on Intelligent Systems and Knowledge Engineering . 2010

机译：宁波话单音节的统计语音识别
5. Signal processing in automatic speech recognition for English and Mandarin Chinese. [D] . Liu, Xiaoyu. 2016

机译：英文和中文普通话自动语音识别中的信号处理。
6. The Binaural Masking-Level Difference of Mandarin Tone Detection and the Binaural Intelligibility-Level Difference of Mandarin Tone Recognition in the Presence of Speech-Spectrum Noise [O] . Cheng-Yu Ho, Pei-Chun Li, Yuan-Chuan Chiang, -1

机译：语音频谱噪声下普通话检测的双耳掩蔽水平差异和普通话识别的双耳可懂度水平差异
7. Machine Recognition of Mandarin Monosyllable [O] . Kung‐Pu Li 1964

机译：机器识别普通话单糖基
8. Machine Recognition of Mandarin Monosyllable [R] . Li, K. P. 1964

机译：普通话单音节的机器识别

A simple statistical speech recognition of mandarin monosyllables

摘要

著录项

相似文献

相关主题

期刊订阅