Significance of the Modified Group Delay Feature in Speech Recognition

Rajesh M. Hegde; Hema A. Murthy; Venkata Ramana Rao Gadde

首页> 外文期刊>IEEE transactions on audio, speech and language processing >Significance of the Modified Group Delay Feature in Speech Recognition

【24h】

Significance of the Modified Group Delay Feature in Speech Recognition

机译：改进的群时延特征在语音识别中的意义

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Spectral representation of speech is complete when both the Fourier transform magnitude and phase spectra are specified. In conventional speech recognition systems, features are generally derived from the short-time magnitude spectrum. Although the importance of Fourier transform phase in speech perception has been realized, few attempts have been made to extract features from it. This is primarily because the resonances of the speech signal which manifest as transitions in the phase spectrum are completely masked by the wrapping of the phase spectrum. Hence, an alternative to processing the Fourier transform phase, for extracting speech features, is to process the group delay function which can be directly computed from the speech signal. The group delay function has been used in earlier efforts, to extract pitch and formant information from the speech signal. In all these efforts, no attempt was made to extract features from the speech signal and use them for speech recognition applications. This is primarily because the group delay function fails to capture the short-time spectral structure of speech owing to zeros that are close to the unit circle in the z-plane and also due to pitch periodicity effects. In this paper, the group delay function is modified to overcome these effects. Cepstral features are extracted from the modified group delay function and are called the modified group delay feature (MODGDF). The MODGDF is used for three speech recognition tasks namely, speaker, language, and continuous-speech recognition. Based on the results of feature and performance evaluation, the significance of the MODGDF as a new feature for speech recognition is discussed

机译：当同时指定傅立叶变换幅度和相位频谱时，语音的频谱表示就完成了。在传统的语音识别系统中，特征通常是从短时幅度谱中得出的。尽管已经认识到傅立叶变换阶段在语音感知中的重要性，但很少有人尝试从中提取特征。这主要是因为语音信号的共振在相位频谱中表现为相变，而相位频谱的包裹完全掩盖了该语音信号的共振。因此，处理傅立叶变换阶段以提取语音特征的另一种选择是处理可以从语音信号直接计算的群延迟函数。群延迟功能已经用于较早的工作中，以从语音信号中提取音调和共振峰信息。在所有这些努力中，没有尝试从语音信号中提取特征并将其用于语音识别应用。这主要是因为群延迟函数由于接近z平面中单位圆的零以及音高周期性效应而无法捕获语音的短时频谱结构。在本文中，修改了群延迟函数以克服这些影响。倒谱特征是从修改的群延迟特征中提取的，被称为修改的群延迟特征（MODGDF）。 MODGDF用于三种语音识别任务，即说话者，语言和连续语音识别。根据特征和性能评估的结果，讨论了MODGDF作为语音识别新特征的意义

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2007年第1期|p.190-202|共13页
作者
Rajesh M. Hegde; Hema A. Murthy; Venkata Ramana Rao Gadde;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
Fourier transforms; feature extraction; speaker recognition; speech processing; Fourier transform magnitude; cepstral features; continuous-speech recognition; features extraction; group delay function; language recognition; modified group delay feature; phase spect;

机译：傅里叶变换;特征提取;说话人识别;语音处理;傅立叶变换幅度;倒谱特征;连续语音识别;特征提取;群时延函数;语言识别;修改后的群时延特征;相位斑点;

相似文献

外文文献
中文文献
专利

1. Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing [J] . Rajesh M. Hegde, Hema A. Murthy, V. R. R. Gadde EURASIP journal on audio, speech, and music processing . 2006,第1期

机译：改进的群时延函数推导的联合特征在语音处理中的意义
2. Significance of Phonological Features in Speech Emotion Recognition [J] . Wei Wang, Paul A. Watters, Xinyi Cao, International journal of speech technology . 2020,第3期

机译：语音情感识别中语音特征的意义
3. Development of Modified Analytical Model for Investigating Acceptable Delay of TCP-Based Speech Recognition [J] . Advanced Science Letters . 2017,第4期

机译：改进分析模型调查基于TCP的语音识别的可接受的分析模型
4. Significance of Group Delay based Acoustic Features in the Linguistic Search Space for Robust Speech Recognition [C] . Ramya R, Rajesh M Hegde, Hema A Murthy International Speech Communication Association . 2008

机译：基于组延迟基于语言搜索空间的延迟声学特征的意义，实现鲁棒语音识别
5. Two modified methods of feature extraction for automatic speech recognition. [D] . Ge, Wangning. 2013

机译：自动语音识别的特征提取的两种改进方法。
6. Speech delays and behavioral problems are the predominant features in individuals with developmental delays and 16p11.2 microdeletions and microduplications [O] . Jill A. Rosenfeld, Justine Coppinger, Bassem A. Bejjani, 2010

机译：言语延迟和行为问题是发育迟缓16p11.2微缺失和微重复的个体的主要特征
7. Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing [O] . Rajesh M. Hegde, Hema A. Murthy, V. R. R. Gadde 2006

机译：改进的群时延函数推导的联合特征在语音处理中的意义
8. Speech Recognition, Articulatory Feature Detection, and Speech Synthesis in Multiple Languages [R] . Ore, B. M. 2009

机译：语音识别，发音特征检测和多语言语音合成

Significance of the Modified Group Delay Feature in Speech Recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅