Improving Speech Recognition Rate through Analysis Parameters

Deividas Eringis; Gintautas Tamulevi?ius

首页> 外文期刊>Electrical, Control and Communication Engineering >Improving Speech Recognition Rate through Analysis Parameters

【24h】

Improving Speech Recognition Rate through Analysis Parameters

机译：通过分析参数提高语音识别率

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech signal is redundant and non-stationary by nature. Because of vocal tract inertness these variations are not very rapid and the signal can be considered as stationary in short segments. It is presumed that in short-time magnitude spectrum the most distinct information of speech is contained. This is the main reason for speech signal analysis in frame-by-frame manner. The analyzed speech signal is segmented into overlapping segments (so-called frames) for this purpose. Segments of 15-25 ms with the overlap of 10-15 ms are used usually. In this paper we present results of our investigation of analysis window length and frame shift influence on speech recognition rate. We have analyzed three different cepstral analysis approaches for this purpose: mel frequency cepstral analysis (MFCC), linear prediction cepstral analysis (LPCC) and perceptual linear prediction cepstral analysis (PLPC). The highest speech recognition rate was obtained using 10 ms length analysis window with the frame shift varying from 7.5 to 10 ms (regardless of analysis type). The highest increase of recognition rate was 2.5 %.

机译：语音信号本质上是冗余且不稳定的。由于声道惰性，这些变化不是很快，并且信号可以在短段中被认为是静止的。假定在短时幅度频谱中包含最鲜明的语音信息。这是逐帧分析语音信号的主要原因。为此，被分析的语音信号被分成重叠的段（所谓的帧）。通常使用15-25 ms的段与10-15 ms的重叠。在本文中，我们提出了分析窗口长度和移码对语音识别率影响的调查结果。为此，我们分析了三种不同的倒谱分析方法：梅尔频率倒谱分析（MFCC），线性预测倒谱分析（LPCC）和感知线性倒谱分析（PLPC）。使用10 ms长度的分析窗口可获得最高的语音识别率，其移码范围为7.5到10 ms（与分析类型无关）。识别率最高增加为2.5％。

著录项

来源
《Electrical, Control and Communication Engineering》 |2014年第1期|共6页
作者
Deividas Eringis; Gintautas Tamulevi?ius;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Statistical modeling of speech Poincareacute sections in combination of frequency analysis to improve speech recognition performance [J] . Jafari A, Almasganj F, Bidhendi MN Chaos . 2010,第3期

机译：结合频率分析，对语音Poincareacute部分进行统计建模以提高语音识别性能
2. Speech perception for adult cochlear implant recipients in a realistic background noise: effectiveness of preprocessing strategies and external options for improving speech recognition in noise. [J] . Gifford RH, Revit LJ Journal of the American Academy of Audiology . 2010,第7期

机译：成年人工耳蜗植入者在逼真的背景噪声中的语音感知：预处理策略的有效性和用于改善噪声中语音识别的外部选项。
3. Robust speechon-speech detection based on LDA-derived parameter and voicing parameter for speech recognition in noisy environments [J] . Arnaud Martin, Laurent Mauuary Speech Communication . 2006,第2期

机译：基于LDA派生参数和发声参数的鲁棒语音/非语音检测用于嘈杂环境中的语音识别
4. Improving Automatic Speech Recognition by Classifying Adult and Child Speakers into Separate Groups using Speech Rate Rhythmicity Parameter [C] . S. Shahnawazuddin, Tarun Sai Bandarupalli, R Chakravarthy International Conference on Signal Processing and Communications . 2020

机译：通过使用语音速率节律参数将成人和儿童说话者分为不同的组来改善自动语音识别
5. Strategies for improving audible quality and speech recognition accuracy of reverberant speech. [D] . Gillespie, Bradford Wilson. 2002

机译：改善混响语音的听觉质量和语音识别准确性的策略。
6. Speech Perception for Adult Cochlear Implant Recipients in a Realistic Background Noise: Effectiveness of Preprocessing Strategies and External Options for Improving Speech Recognition in Noise [O] . René H. Gifford, Lawrence J. Revit -1

机译：成人耳蜗植入者在现实背景噪声中的言语感知：预处理策略和外部选择改善噪声语音识别的有效性
7. Improving Speech Recognition Rate through Analysis Parameters [O] . Eringis Deividas, Tamulevičius Gintautas 2014

机译：通过分析参数提高语音识别率
8. A Study of Significant Parameters of Speech for Application in Automatic Speech Recognition Systems [R] . 1964

机译：语音重要参数在自动语音识别系统中的应用研究

Improving Speech Recognition Rate through Analysis Parameters

摘要

著录项

相似文献

相关主题

期刊订阅