Acoustic modeling in consideration of unknown variation factors at the time of recognition

Hiroyuki SUZUKI; Heiga ZEN; Yoshihiko NANKAKUChiyomi MIYAJIMAKeiichi TOKUDATadashi KITAMURA

首页> 外文期刊>電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication >Acoustic modeling in consideration of unknown variation factors at the time of recognition

【24h】

Acoustic modeling in consideration of unknown variation factors at the time of recognition

机译：Acoustic modeling in consideration of unknown variation factors at the time of recognition

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

This paper proposes a speech recognition technique based on acoustic models which can take unknown variation factors (speaker voice characteristic, noise environment) into account at the time of recognition. Context-dependent acoustic models, which are typically triphone HMMs, are often used in continuous speech recognition systems. These methods enhance the accuracy of acoustic models via the respective modeling of phonemes according to the factors in acoustic variation. This work hypothesizes that the speaker voice characteristics that humans can perceive by listening and the noise environments are also factors of acoustic variation in construction of acoustic models, and a tree-based clustering technique is also applied to speaker voice characteristics and noise environments to construct proposed acoustic models. In speech recognition using triphone models, the neighboring phonetic context is given from the linguistic-phonetic knowledge in advance; in contrast, the variation factors as voice characteristics and noise environments of input speech are unknown in recognition using proposed acoustic models. This paper proposes a method of recognizing speech even under conditions where the variation factors of the input speech are unknown. The result of a gender-dependent speech recognition experiment shows that the proposed method achieves higher recognition performance in comparison to conventional methods.

著录项

来源
《電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication》 |2003年第517期|157-162|共6页
作者
Hiroyuki SUZUKI; Heiga ZEN; Yoshihiko NANKAKUChiyomi MIYAJIMAKeiichi TOKUDATadashi KITAMURA;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类通信;
关键词
Voice characteristic; Noise; Speech recognition; Acoustic model; Clustering;

Acoustic modeling in consideration of unknown variation factors at the time of recognition

摘要

著录项

相关主题

期刊订阅