Utterance normalization using vowel features in a spoken word recognition system for multiple speakers

机译：用于多个扬声器的口语识别系统中的元音特征的话语归一化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The authors propose a novel method of normalization based on linear transformation of acoustic features of input speech using only one isolated utterance each of the five vowels of Japanese by each individual speaker. Experiments on isolated word recognition combining the proposed normalization method and multiple-template DP matching showed a marked improvement in the recognition rate, especially for smaller numbers of templates per word. The proposed method gives consistently higher word recognition scores than the four-dimensional representation on the Karhunen-Loeve transformation, and also gives higher scores than the original 16-dimensional representation of filter-bank outputs, especially when the number of templates is small. Together with the fact that this method reduces the dimension of the feature vector by a factor of four, the results demonstrate the validity of the proposed method.

机译：作者提出了一种基于输入演讲的声学特征的线性变换，仅使用每个单独的扬声器的五个元音的一个分离的话语来提出了一种基于输入语音的线性变换的新颖性。孤立字识别的实验结合所提出的归一化方法和多模板DP匹配的识别率显着改善，特别是对于每个单词的较少数量的模板。该方法提供比Karhunen-Loeve变换的四维表示始终如一的单词识别分数，并且还提供比滤波器存储体输出的原始16维表示更高的分数，尤其是当模板的数量小时。与该方法减少了特征向量的尺寸的事实，结果证明了所提出的方法的有效性。

著录项

来源
《IEEE international conference on acoustics, speech, and signal processing》|1993年||共4页
会议地点
作者
Ohno S.; Hirose K.; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信理论;
关键词

相似文献

外文文献
中文文献
专利

1. Commentary: Revisiting vocal perception in non-human animals: a review of vowel discrimination, speaker voice recognition, and speaker normalization [J] . Linda Polka, Ocke-Schwen Bohn, Daniel J. Weiss Frontiers in Psychology . 2015,第4期

机译：评论：重新审视非人类动物的声音感知：元音辨别力，说话人语音识别和说话人正常化的综述
2. Revisiting vocal perception in non-human animals: a review of vowel discrimination, speaker voice recognition, and speaker normalization [J] . Buddhamas Kriengwatana, Paola Escudero, Carel ten Cate Frontiers in Psychology . 2014,第4期

机译：重温非人类动物的声音感知：元音辨别力，说话人语音识别和说话人正常化的综述
3. Histogram equalization using a reduced feature set of background speakers’ utterances for speaker recognition [J] . Myung-jae?Kim, Il-ho?Yang, Min-seok?Kim, Frontiers of Information Technology & Electronic Engineering . 2017,第5期

机译：使用减少的背景说话者特征集进行直方图均衡以识别说话者
4. Utterance normalization using vowel features in a spoken word recognition system for multiple speakers [C] . Ohno, S., Hirose, . 1993

机译：在多个说话者的语音识别系统中使用元音功能进行话语归一化
5. Speech repairs, intonational boundaries and discourse markers: Modeling speakers' utterances in spoken dialog. [D] . Heeman, Peter Anthony. 1997

机译：语音修复，国际边界和话语标记：在语音对话中模拟说话者的话语。
6. Revisiting vocal perception in non-human animals: a review of vowel discrimination speaker voice recognition and speaker normalization [O] . Buddhamas Kriengwatana, Paola Escudero, Carel ten Cate 2014

机译：重温非人类动物的声音感知：元音辨别说话人语音识别和说话人正常化的综述
7. On-Line, Real-Time Spoken Words Recognition System with Learning Capability of Speaker Differences. [O] . Sakai Toshiyuki, Nakagawa Seiichi 1976

机译：具有说话人差异学习能力的在线实时口语单词识别系统。
8. Investigation of Speaker-Independent Word Recognition Using Multiple Features, Decision Mechanisms, and Template Sets [R] . Brusuelas, M. A. 1986

机译：使用多个特征，决策机制和模板集研究与说话者无关的单词识别

Utterance normalization using vowel features in a spoken word recognition system for multiple speakers

摘要

著录项

相似文献

相关主题

期刊订阅