Speech recognition using LSP frequency interval and CSM intensity pair

Kenzo Isogawa; Koichi Shinoda; Shigeki Sagayama

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Speech recognition using LSP frequency interval and CSM intensity pair

【24h】

Speech recognition using LSP frequency interval and CSM intensity pair

机译：Speech recognition using LSP frequency interval and CSM intensity pair

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

This paper discusses two novel acoustic features for speech recognition from a new perspective. Most conventional acoustic features (e.g., cepstrum) are used to compare a pair of spectra in terms of "vertical" differences at the same frequency points. On the other hand, LSP frequencies provide a means of comparing spectra in terms of "horizontal" differences along frequency axis reflecting the formant frequency mismatches. After discussing these existing categories of acoustic feature parameters, we propose another category that represents spectrum intensities with an adaptively stretching frequency axis. We propose two novel features in the new category; one is the logarithmic difference of adjacent LSP (Line Spectrum Pair) frequencies; the other is the CSM (Composite Sinusoidal Modeling) intensity pairs. Their theoretical properties are discussed. Through continuous speech recognition experiments based on triphone HMM using LSP frequencies, MFCC and two new features, it was found that the new features performed better than LSP frequencies but not better than MFCCs.

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2002年第160期|1-6|共6页
作者
Kenzo Isogawa; Koichi Shinoda; Shigeki Sagayama;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种日语
中图分类电报、传真;
关键词
LSP (line spectrum pair); LSP frequency; LSP frequency interval; CSM (composite sinsodical modeling); CSM intensity pair;

Speech recognition using LSP frequency interval and CSM intensity pair

摘要

著录项

相关主题

期刊订阅