Speech recognition based on phonetic features and acoustic landmarks.

机译：基于语音特征和声学界标的语音识别。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

A probabilistic and statistical framework is presented for automatic speech recognition based on a phonetic feature representation of speech sounds. In this acoustic-phonetic approach, the speech recognition problern is hypothesized as a maximization of the joint posterior probability of a set of phonetic features and the corresponding acoustic landmarks. Binary classifiers of the manner phonetic features---syllabic, sonorant and continuant---are applied for the probabilistic detection of speech landmarks. The landmarks include stop bursts, vowel onsets, syllabic peaks, syllabic dips, fricative onsets and offsets; and sonorant consonant onsets and offsets. The classifiers use automatically extracted knowledge based acoustic parameters (APs) that are acoustic correlates of those phonetic features. For isolated word recognition with known and limited vocabulary, the landmark sequences are constrained using a manner class pronunciation graph. Probabilistic decisions on place and voicing phonetic features are then made using a separate set of APs extracted using the landmarks.; The framework exploits two properties of the knowledge-based acoustic cues of phonetic features: (1) sufficiency of the acoustic cues of a phonetic feature for a decision on that feature and (2) invariance of the acoustic cues with respect to context. The probabilistic framework makes the acoustic-phonetic approach to speech recognition suitable for practical recognition tasks as well as compatible with probabilistic pronunciation and language models. Support vector machines (SVMs) are applied for the binary classification tasks because of their two favorable properties---good generalization and the ability to learn from a relatively small amount of high dimensional data. Performance comparable to Hidden Markov Model (HMM) based systems is obtained on landmark detection as well as isolated word recognition. Applications to restoring of lattices from a large vocabulary continuous speech recognizer are also presented.

机译：提出了一种概率统计框架，用于基于语音的语音特征表示的自动语音识别。在这种声学方法中，语音识别问题被假定为一组语音特征和相应声学界标的联合后验概率的最大化。语音特征方式的二元分类器-音节，回音和连续-用于语音界标的概率检测。地标性特征包括停止爆发，元音发作，音节峰值，音节骤降，摩擦音发作和偏移。和son谐的辅音起音和偏移。分类器使用自动提取的基于知识的声学参数（AP），这些参数是这些语音特征的声学关联。对于具有已知和有限词汇量的孤立单词识别，使用方式类发音图来限制界标序列。然后，使用由地标提取的一组单独的AP来做出关于位置和发声语音特征的概率决策。该框架利用了基于语音特征的基于知识的语音提示的两个属性：（1）语音特征的语音提示是否足以决定该功能，以及（2）语音提示相对于上下文的不变性。概率框架使语音识别的语音方法适合于实际的识别任务，并且与概率发音和语言模型兼容。支持向量机（SVM）由于具有两个良好的特性-良好的泛化能力和从相对少量的高维数据中学习的能力而被用于二进制分类任务。在地标检测以及孤立的单词识别方面，可以获得与基于隐马尔可夫模型（HMM）的系统相当的性能。还介绍了从大词汇量连续语音识别器还原晶格的应用。

著录项

作者
Juneja, Amit.;
展开▼
作者单位

University of Maryland, College Park.;

展开▼
授予单位 University of Maryland, College Park.;
学科 Engineering Electronics and Electrical.
学位 Ph.D.
年度 2004
页码 169 p.
总页数 169
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
入库时间 2022-08-17 11:44:04

相似文献

外文文献
中文文献
专利

1. Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech [J] . Krerksak Likitsupin, Proadpran Punyabukkana, Chai Wutiwiwatchai, Engineering journal . 2016,第2期

机译：改进大词汇量连续语音基于片段的语音识别的声学方法
2. Incorporating finer acoustic phonetic features in lexicon for Hindi language speech recognition [J] . Journal of information and optimization sciences . 2019,第8期

机译：在词典中纳入更精细的声学语音特征以进行印地语语音识别
3. Canonicalization of Feature Parameters for Robust Speech Recognition Based on Distinctive Phonetic Feature (DPF) Vectors [J] . Mohammad NURUL HUDA, Muhammad GHULAM, Takashi FUKUDA, IEICE Transactions on Information and Systems . 2008,第3期

机译：基于独特语音特征（DPF）向量的鲁棒语音识别特征参数的规范化
4. An acoustic-phonetic feature-based system for automatic phoneme recognition in continuous speech [C] . Abdelatty Ali, A.M., van der Spiegel, . 1999

机译：基于语音特征的连续语音自动音素识别系统
5. Synergy of acoustic-phonetics and auditory modeling towards robust speech recognition. [D] . Deshmukh, Om D. 2006

机译：语音和听觉建模对强大语音识别的协同作用。
6. Speech Recognition and Acoustic Features in Combined Electric and Acoustic Stimulation [O] . Yang-soo Yoon, Yongxin Li, Qian-Jie Fu -1

机译：电声组合刺激中的语音识别和声学特征
7. Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech [O] . Krerksak Likitsupin, Proadpran Punyabukkana, Chai Wutiwiwatchai, 2016

机译：改善基于分段的语音识别大词汇连续语音的声学语音方法
8. Simulation and Evaluation of Phonetic Speech Recognition Techniques. Volume III. Acoustical Characteristics of Speech Sounds Systematically Arranged in Form of Tables [R] . Otten, K. W. 1964

机译：语音识别技术的仿真与评估。第三卷。以表格形式系统地排列的语音的声学特征

Speech recognition based on phonetic features and acoustic landmarks.

摘要

著录项

相似文献

相关主题

期刊订阅