首页> 外文会议> >Utterance normalization using vowel features in a spoken word recognition system for multiple speakers

【24h】

Utterance normalization using vowel features in a spoken word recognition system for multiple speakers

机译：在多个说话者的语音识别系统中使用元音功能进行话语归一化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The authors propose a novel method of normalization based on linear transformation of acoustic features of input speech using only one isolated utterance each of the five vowels of Japanese by each individual speaker. Experiments on isolated word recognition combining the proposed normalization method and multiple-template DP matching showed a marked improvement in the recognition rate, especially for smaller numbers of templates per word. The proposed method gives consistently higher word recognition scores than the four-dimensional representation on the Karhunen-Loeve transformation, and also gives higher scores than the original 16-dimensional representation of filter-bank outputs, especially when the number of templates is small. Together with the fact that this method reduces the dimension of the feature vector by a factor of four, the results demonstrate the validity of the proposed method.

机译：作者提出了一种新的归一化方法，该方法基于输入语音的声学特征的线性变换，每个说话者仅使用五个日语元音中的每个独立发音即可。将提出的归一化方法与多模板DP匹配相结合的孤立单词识别实验表明，识别率有了显着提高，尤其是对于每个单词的模板数量较少的情况。与Karhunen-Loeve变换上的四维表示相比，该方法给出的单词识别分数始终较高，并且与原始16维表示的滤波器组输出相比，得分更高，尤其是在模板数量较少时。结合该方法将特征向量的维数减小四倍的事实，结果证明了该方法的有效性。

著录项

来源
《》|1993年|P.578-581|共4页
会议地点
作者
Ohno; S.; Hirose; K.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Commentary: Revisiting vocal perception in non-human animals: a review of vowel discrimination, speaker voice recognition, and speaker normalization [J] . Linda Polka, Ocke-Schwen Bohn, Daniel J. Weiss Frontiers in Psychology . 2015,第4期

机译：评论：重新审视非人类动物的声音感知：元音辨别力，说话人语音识别和说话人正常化的综述
2. Revisiting vocal perception in non-human animals: a review of vowel discrimination, speaker voice recognition, and speaker normalization [J] . Buddhamas Kriengwatana, Paola Escudero, Carel ten Cate Frontiers in Psychology . 2014,第4期

机译：重温非人类动物的声音感知：元音辨别力，说话人语音识别和说话人正常化的综述
3. Histogram equalization using a reduced feature set of background speakers’ utterances for speaker recognition [J] . Myung-jae?Kim, Il-ho?Yang, Min-seok?Kim, Frontiers of Information Technology & Electronic Engineering . 2017,第5期

机译：使用减少的背景说话者特征集进行直方图均衡以识别说话者
4. Utterance normalization using vowel features in a spoken word recognition system for multiple speakers [C] . Ohno S., Hirose K., Institute of Electric and Electronic Engineer IEEE international conference on acoustics, speech, and signal processing . 1993

机译：用于多个扬声器的口语识别系统中的元音特征的话语归一化
5. Speech repairs, intonational boundaries and discourse markers: Modeling speakers' utterances in spoken dialog. [D] . Heeman, Peter Anthony. 1997

机译：语音修复，国际边界和话语标记：在语音对话中模拟说话者的话语。
6. Revisiting vocal perception in non-human animals: a review of vowel discrimination speaker voice recognition and speaker normalization [O] . Buddhamas Kriengwatana, Paola Escudero, Carel ten Cate 2014

机译：重温非人类动物的声音感知：元音辨别说话人语音识别和说话人正常化的综述
7. On-Line, Real-Time Spoken Words Recognition System with Learning Capability of Speaker Differences. [O] . Sakai Toshiyuki, Nakagawa Seiichi 1976

机译：具有说话人差异学习能力的在线实时口语单词识别系统。
8. Investigation of Speaker-Independent Word Recognition Using Multiple Features, Decision Mechanisms, and Template Sets [R] . Brusuelas, M. A. 1986

机译：使用多个特征，决策机制和模板集研究与说话者无关的单词识别

Utterance normalization using vowel features in a spoken word recognition system for multiple speakers

摘要

著录项

相似文献

相关主题

期刊订阅