首页> 外文会议>International conference on text, speech and dialogue >Phonetic Spoken Term Detection in Large Audio Archive Using the WFST Framework
【24h】

Phonetic Spoken Term Detection in Large Audio Archive Using the WFST Framework

机译:使用WFST框架的大型音频档案中的语音语音术语检测

获取原文
获取外文期刊封面目录资料

摘要

The paper presents a technique for phonetic spoken term detection in large audio archive. It is designed within the framework of weighted finite-state transducers and utilizes the rather recently developed notion of factor automata, which we have enhanced with a score normalization and a technique for systematic query expansion which allows for phone deletions and substitutions and consequently compensates for frequent pronunciation imperfections and systematic phoneme interchanges occurring during the ASR decoding process. The experiments presented in the paper show that the new WFST-based method outperforms the baseline system both in terms of search performance and speed. Finally, the paper discusses the issues of the proposed techniques that need to be addressed before the application in real-life tasks.
机译:本文提出了一种在大型音频档案中用于语音口语检测的技术。它是在加权有限状态换能器的框架内设计的,并利用了最近发展的因子自动机的概念,我们通过分数归一化和系统查询扩展技术对其进行了增强,该技术允许电话删除和替换,从而弥补了频繁的使用情况。在ASR解码过程中会出现语音缺陷和系统的音素互换。本文提出的实验表明,基于WFST的新方法在搜索性能和速度方面均优于基线系统。最后,本文讨论了在实际任务中应用之前需要解决的拟议技术问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号