Phoneme Recognition based on AF-HMMs with Optimal State Configuration

Narpendyah W. ARIWARDHANI; Yurie IRIBE; Kouichi KATSURADA; Tsuneo NITTA

首页> 外文期刊>電子情報通信学会技術研究報告 >Phoneme Recognition based on AF-HMMs with Optimal State Configuration

【24h】

Phoneme Recognition based on AF-HMMs with Optimal State Configuration

机译：基于具有最佳状态配置的AF-HMM的音素识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

認識と合成の双方に共通の調音運動HMM を利用する，ワンモデル音声認識・合成方式の音声認識モジュールに対して性能を改善する研究を行っている。本文では，調音特徴（AF）を用いたHMM の構成方法に焦点をあて，音素認識正解率と精度を改善するための評価実験を行った結果を報告する。種々のHMM 構成を対象に，JANAS コーパスによる評価を通して，AF に基づくアプローチは，特徴抽出器に音素文脈情報を埋め込むことで，monophone model や少ない混合数においても，高い性能を達成できることを明らかにする。%Speeh recognition based on one-model of articulatory movement HMMs that are commonly applied to both speech recognition (SR) and speech synthesis (SS) is described. In a SR module, speaker-invariant HMMs are applied to recognize articulatory feature (AF) sequence. This paper focuses on our approaches in designing an optimal state configuration for accurate phoneme recognizer based on articulatory movement. We consider several strategies to improve Articulatory Feature (AF) based phoneme recognition and compare its performance. In the experiments on Japanese Newspaper Article Sentences (JNAS) utterances, the approach of separating short vowel and long vowel in 5-states HMM triphone models provides higher accuracy compared to other approaches.

机译：我们正在进行研究，以提高一种模型的语音识别/合成语音识别模块的性能，该模块使用在识别和合成中都通用的关节式HMM。在本文中，我们重点介绍了使用发音特征（AF）构造HMM的方法，并报告了为提高音素识别的准确性和准确性而进行的评估实验的结果。通过JANAS语料库对各种HMM配置的评估，可以清楚地发现，基于AF的方法甚至可以通过在特征提取器中嵌入音素上下文信息，甚至在使用单音素模型和少量混合音的情况下也可以实现高性能。 ..描述了基于一种通常用于语音识别（SR）和语音合成（SS）的关节运动HMM的语音识别％。在SR模块中，不变说话者HMM被用于识别关节特征（AF）序列。本文重点研究基于发音运动设计准确音素识别器的最佳状态配置的方法，我们考虑了几种改进基于音素特征（AF）的音素识别并比较其性能的策略。（JNAS）话语，在五状态HMM三音器模型中分离短元音和长元音的方法与其他方法相比具有更高的准确性。

著录项

来源
《電子情報通信学会技術研究報告》 |2011年第364期|p.49-54|共6页
作者
Narpendyah W. ARIWARDHANI; Yurie IRIBE; Kouichi KATSURADA; Tsuneo NITTA;
展开▼
作者单位

Graduate School of Engineering, Toyohashi University of Technology 1-1 Hibarigaoka, Tenpaku-chou, Toyohashi, 441-8580 Japan;

Graduate School of Engineering, Toyohashi University of Technology 1-1 Hibarigaoka, Tenpaku-chou, Toyohashi, 441-8580 Japan;

Graduate School of Engineering, Toyohashi University of Technology 1-1 Hibarigaoka, Tenpaku-chou, Toyohashi, 441-8580 Japan;

Graduate School of Engineering, Toyohashi University of Technology 1-1 Hibarigaoka, Tenpaku-chou, Toyohashi, 441-8580 Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
automatic speech recognition; hidden markov model (HMM); articulatory feature (AF); one-model SR and SS;

机译：自动语音识别;隐藏马尔可夫模型（HMM）;关节特征（AF）;单模SR和SS;
入库时间 2022-08-18 00:31:38

相似文献

外文文献
中文文献
专利

1. Phoneme Recognition based on AF-HMMs with Optimal State Configuration [J] . Narpendyah W. ARIWARDHANI, Yurie IRIBE, Kouichi KATSURADA, 電子情報通信学会技術研究報告. 音声. Speech . 2011,第365期

机译：基于具有最佳状态配置的AF-HMM的音素识别
2. [Poster Presentation] Phoneme Recognition based on AF-HMMs with Optimal State Configuration [J] . Narpendyah W. ARIWARDHANI, Yurie IRIBE, Kouichi KATSURADA, 電子情報通信学会技術研究報告 . 2011,第365期

机译：[海报演示]基于具有最佳状态配置的AF-HMM的音素识别
3. Announcer-independent phoneme recognition on the basis of optimal orthogonal expansions [J] . S.N.Kirillov, A.S.Sheludyakov Journal of Computer and Systems Sciences International . 1997,第5期

机译：基于最佳正交展开的独立于播音员的音素识别
4. Error Minimization in Phoneme based Automated Speech Recognition for Similar Sounding Phonemes [C] . Karan Gangaputra International Conference on Communication Technology and System Design . 2013

机译：基于音素的自动语音识别中的最小化最小化误差
5. Phoneme weighting and energy-based weighting for speaker recognition. [D] . Fang, Eric. 2012

机译：用于说话人识别的音素加权和基于能量的加权。
6. Modeling Cryotherapy Ice Ball Dimensions and Isotherms in a Novel Gel-based Model to Determine Optimal Cryo-needle Configurations and Settings for Potential Use in Clinical Practice [O] . Taimur T. Shah, Uri Arbel, Sonja Foss, -1

机译：在基于凝胶的新型模型中对冷冻疗法的冰球尺寸和等温线进行建模以确定最佳的低温针头配置和设置以供临床实践使用
7. Error Minimization in Phoneme based Automated Speech Recognition for Similar Sounding Phonemes [O] . Gangaputra Karan 2012

机译：基于音素的相似语音音素的自动语音识别中的错误最小化
8. A phoneme based speech recognition system for high stress moderate noise environments [R] . Anikst, M. T., Davis, B. M., Meisel, W. S., 1990

机译：基于音素的语音识别系统，适用于高应力中等噪声环境

Phoneme Recognition based on AF-HMMs with Optimal State Configuration

摘要

著录项

相似文献

相关主题

期刊订阅