AN EFFICIENT KEYWORD SPOTTING TECHNIQUE USING A COMPLEMENTARY LANGUAGE FOR FILLER MODELS TRAINING

机译：使用补充语言进行填充模型训练的有效关键词发现技术

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The task of keyword spotting is to detect a set of keywords in the input continuous speech. In a keyword spotter, not only the keywords, but also the non-keyword intervals must be modeled. For this purpose, filler (or garbage) models are used. To date, most of the keyword spotters have been based on hidden Markov models (HMM). More specifically, a set of HMM is used as garbage models. In this paper, a two-pass keyword spotting technique based on bilingual hidden Markov models is presented. In the first pass, our technique uses phonemic garbage models to represent the non-keyword intervals, and in the second stage the putative hits are verified using normalized scores. The main difference from similar approaches lies in the way the non-keyword intervals are modeled. In this work, the target language is Japanese, and English was chosen as the 'garbage' language for training the phonemic garbage models. Experimental results on both clean and noisy telephone speech data showed higher performance compared with using a common set of acoustic models. Moreover, parameter tuning (e.g. word insertion penalty tuning) does not have a serious effect on the performance. For a vocabulary of 100 keywords and using clean telephone speech test data we achieved a 92.04% recognition rate with only a 7.96% false alarm rate, and without word insertion penalty tuning. Using noisy telephone speech test data we achieved a 87.29% recognition rate with only a 12.71% false alarm rate.

机译：关键字发现的任务是检测输入的连续语音中的一组关键字。在关键字搜索器中，不仅必须对关键字进行建模，而且还必须对非关键字间隔进行建模。为此，使用填充（或垃圾）模型。迄今为止，大多数关键字搜寻器都基于隐马尔可夫模型（HMM）。更具体地说，将一组HMM用作垃圾模型。本文提出了一种基于双语隐马尔可夫模型的两遍关键词发现技术。在第一遍中，我们的技术使用音位垃圾模型来表示非关键字间隔，在第二阶段中，使用归一化分数来验证推定命中。与类似方法的主要区别在于对非关键字间隔进行建模的方式。在这项工作中，目标语言是日语，并且英语被选为用于训练音位垃圾模型的“垃圾”语言。与使用普通的声学模型集相比，在干净和嘈杂的电话语音数据上的实验结果均显示出更高的性能。而且，参数调整（例如单词插入罚分调整）对性能没有严重影响。对于一个包含100个关键字的词汇表以及使用干净的电话语音测试数据，我们实现了92.04％的识别率，而误报率仅为7.96％，并且没有单词插入惩罚调整。使用嘈杂的电话语音测试数据，我们达到了87.29％的识别率，而误报率仅为12.71％。

著录项

来源
《European Conference on Speech Communication and Technology - EUROSPEECH 2003(INTERSPEECH 2003) vol.2; 20030901-04; Geneva(CH)》|2003年|P.921-924|共4页
会议地点 Geneva(CH)
作者
Panikos Heracleous; Tohru Shimizu;
展开▼
作者单位

KDDI RD Laboratories, Inc, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动信息理论;
关键词

相似文献

外文文献
中文文献
专利

1. Feature learning for efficient ASR-free keyword spotting in low-resource languages [J] . Ewald van der Westhuizen, Herman Kamper, Raghav Menon, Computer speech and language . 2022,第Jana期

机译：特征学习以低资源语言的高效无论是无ASR的关键字拍摄
2. Training Set Selection For Building Compact And Efficient Language Models [J] . Keiji YASUDA, Hirofumi YAMAMOTO, Eiichiro SUMITA IEICE Transactions on Information and Systems . 2009,第3期

机译：构建紧凑高效的语言模型的训练集选择
3. Proposed Technique for Efficient Cloud Computing Model in Effective Digital Training Towards Sustainable Livelihoods for Unemployed Youths [J] . Bansal Ritu, Singh Vikash Kumar International journal of cloud applications and computing . 2020,第4期

机译：有效数字培训中有效数字培养的高效云计算模型的提出技术，失业青少年的可持续生计
4. AN EFFICIENT KEYWORD SPOTTING TECHNIQUE USING A COMPLEMENTARY LANGUAGE FOR FILLER MODELS TRAINING [C] . Panikos Heracleous, Tohru Shimizu, International Speech Communication Association(ISCA) European Conference on Speech Communication and Technology . 2003

机译：使用填充模型培训的互补语言的有效关键字发现技术
5. Whisper speech processing: Analysis, modeling, and detection with applications to keyword spotting. [D] . Zhang, Chi. 2012

机译：悄悄话语处理：分析，建模和检测，以及关键词发现的应用。
6. Online keyword searching in three countries and languages reflects different perceptions and behaviors in response to the name of the novel coronavirus disease [O] . Renyu Liu, Jonathan R Gavrin, Lee A Fleisher 2020

机译：在三个国家和语言中搜索的在线关键字反映了不同的看法和行为以回应新型冠状病毒病的名称
7. An Efficient Keyword Spotting Technique Using a Complementary Language for Filler Models Training [O] . Panikos Heracleous, Tohru Shimizu 2003

机译：使用补充语言进行填充模型训练的高效关键词发现技术

AN EFFICIENT KEYWORD SPOTTING TECHNIQUE USING A COMPLEMENTARY LANGUAGE FOR FILLER MODELS TRAINING

摘要

著录项

相似文献

相关主题

期刊订阅