Peripheral Nonlinear Time Spectrum Features Algorithm for Large Vocabulary Mandarin Automatic Speech Recognition

Fadhil H. T. Al-dulaimy; WANG Zuoying

首页> 中文期刊> 《清华大学学报（英文版）》 >Peripheral Nonlinear Time Spectrum Features Algorithm for Large Vocabulary Mandarin Automatic Speech Recognition

Peripheral Nonlinear Time Spectrum Features Algorithm for Large Vocabulary Mandarin Automatic Speech Recognition

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

This work describes an improved feature extractor algorithm to extract the peripheral features of point x(ti,fj) using a nonlinear algorithm to compute the nonlinear time spectrum (NL-TS) pattern. The algorithm observes n×n neighborhoods of the point in all directions, and then incorporates the peripheral features using the Mel frequency cepstrum components (MFCCs)-based feature extractor of the Tsinghua electronic engineering speech processing (THEESP) for Mandarin automatic speech recognition (MASR) system as replacements of the dynamic features with different feature combinations. In this algorithm, the orthogonal bases are extracted directly from the speech data using discrite cosime transformation (DCT) with 3×3 blocks on an NL-TS pattern as the peripheral features. The new primal bases are then selected and simplified in the form of the operator in the time direction and the operator in the frequency direction. The algorithm has 23.29% improvements of the relative error rate in comparison with the standard MFCC feature-set and the dynamic features in tests using THEESP with the duration distribution-based hidden Markov model (DDBHMM) based on MASR system.

著录项

来源
《清华大学学报（英文版）》 |2005年第2期|174-182|共9页
作者
Fadhil H. T. Al-dulaimy; WANG Zuoying;
展开▼
作者单位

Department of Electronic Engineering, Tsinghua University, Beijing 100084, China;

Department of Electronic Engineering, Tsinghua University, Beijing 100084, China;

展开▼
原文格式 PDF
正文语种 chi
中图分类工程基础科学;
关键词
large vocabulary speech recognition; Mandarin automatic speech recognition (MASR); duration distribution-based hidden Markov model (DDBHMM); feature identification;

机译：大词汇量语音识别;普通话自动语音识别（MASR）;基于持续时间分布的隐马尔可夫模型（DDBHMM）;特征识别;

Peripheral Nonlinear Time Spectrum Features Algorithm for Large Vocabulary Mandarin Automatic Speech Recognition

摘要

著录项

相关主题

期刊订阅