Transform representation of the spectra of acoustic speech segments with applications. I. General approach and application to speech recognition

Algazi V.R.; Brown K.L.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Transform representation of the spectra of acoustic speech segments with applications. I. General approach and application to speech recognition

【24h】

Transform representation of the spectra of acoustic speech segments with applications. I. General approach and application to speech recognition

机译：借助应用程序来变换语音片段的频谱表示。一，一般方法及其在语音识别中的应用

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

An approach to modeling and capturing the time-varying structure of the spectral envelope of speech is reported. Acoustic subword decomposition and the Karhunen-Loeve transform (KLT) are used to extract and efficiently represent the highly correlated structure of the spectral envelope. Integration of the KLT with acoustic subword modeling provides concise representation of both steady-state and dynamic features of the spectra in a unified framework that very effectively captures acoustic-phonetic patterns. The physiological and perceptual basis for the approach, the frame-based and acoustic-subword-based spectral representation, and applications to speaker-dependent recognition are presented. The performance of the recognition algorithm based on this approach compares favorably with that of other techniques.

机译：报告了一种建模和捕获语音频谱包络的时变结构的方法。声学子词分解和Karhunen-Loeve变换（KLT）用于提取并有效表示频谱包络的高度相关结构。 KLT与声学子词建模的集成在统一框架中提供了频谱的稳态和动态特征的简洁表示，可以非常有效地捕获声学模式。提出了该方法的生理学和知觉基础，基于帧和基于声学子词的频谱表示以及在说话者相关识别中的应用。基于这种方法的识别算法的性能可与其他技术相媲美。

著录项

来源
《IEEE Transactions on Speech and Audio Proceeding》 |1993年第2期|P.180-195|共16页
作者
Algazi V.R.; Brown K.L.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Transform representation of the spectra of acoustic speech segments with applications. II. Speech analysis, synthesis, and coding [J] . Algazi V.R., Brown K.L. IEEE Transactions on Speech and Audio Proceeding . 1993,第3期

机译：借助应用程序来变换语音片段的频谱表示。二。语音分析，合成和编码
2. Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech [J] . Krerksak Likitsupin, Proadpran Punyabukkana, Chai Wutiwiwatchai, Engineering journal . 2016,第2期

机译：改进大词汇量连续语音基于片段的语音识别的声学方法
3. Transitional speech units and their representation by regressive Markov states: applications to speech recognition [J] . Deng L., Sameti H. IEEE Transactions on Speech and Audio Proceeding . 1996,第4期

机译：过渡语音单元及其通过回归马尔可夫状态表示：在语音识别中的应用
4. Trajectory Representations and Acoustic Descriptions for a Segment-Modelling Approach to Automatic Speech Recognition [C] . NATO advanced study institute on computational models of speech pattern processing . 1999

机译：用于自动语音识别的分段建模方法的轨迹表示和声学描述
5. Spectral analysis methods for automatic speech recognition applications. [D] . Parinam, Venkata Neelima Devi. 2013

机译：用于自动语音识别应用程序的频谱分析方法。
6. Time-Frequency Feature Representation Using Multi-Resolution Texture Analysis and Acoustic Activity Detector for Real-Life Speech Emotion Recognition [O] . Kun-Ching Wang 2015

机译：使用多分辨率纹理分析和声活动检测器的时频特征表示用于现实生活中的语音情感识别
7. A Sinusoidal Model Approach to Acoustic Landmark Detection and Segmentation for Robust Segment-Based Speech Recognition [O] . Tara N. Sainath, Timothy J. Hazen 2006

机译：基于稳健的基于语音的语音识别的声音正弦检测和分割的正弦模型方法

Transform representation of the spectra of acoustic speech segments with applications. I. General approach and application to speech recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅