首页> 外文会议>Electrical Communications and Computers (CONIELECOMP), 2012 22nd International Conference on >Keyword word recognition using a fusion of spectral, cepstral and modulation features
【24h】

Keyword word recognition using a fusion of spectral, cepstral and modulation features

机译:结合频谱,倒谱和调制功能的关键词识别

获取原文
获取原文并翻译 | 示例

摘要

We present the results of applying a combination of features for recognizing word utterances extracted from a continuous stream of speech. Three sets of features, namely, spectral energy in Bark bands, mel frequency cepstral coefficients, and parameters from an AM-FM model, were employed for training and testing a set of keywords in the CallHome telephone speech database. A pair-wise comparison between the feature set of an unknown word utterance and that of each of the reference utterances in a dynamic time warping process showed a false negative score of 4 out of 12, and a false positive score of 5 out of 132 for a subset of speech from the database. Long, multisyllabic words were spotted correctly while two short words in the word list contributed to errors.
机译:我们提出了应用特征组合来识别从连续语音流中提取的单词话语的结果。三套功能,即Bark频段的频谱能量,梅尔频率倒谱系数和AM-FM模型的参数,被用来训练和测试CallHome电话语音数据库中的一组关键字。在动态时间规整过程中,未知单词话语的特征集与每个参考话语的特征集之间的成对比较显示,在12分中,假阴性分数为12分中的4分,而假阳性分数为132分中的5分。来自数据库的语音子集。正确地发现了多音节的长单词,而单词列表中的两个短单词则导致了错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号