首页> 外文会议>Spoken Language Technology Workshop (SLT), 2008 IEEE >Open vocabulary spoken document retrieval by subword sequence obtained from speech recognizer
【24h】

Open vocabulary spoken document retrieval by subword sequence obtained from speech recognizer

机译:通过从语音识别器获得的子词序列检索词汇量大的口语文档

获取原文

摘要

We present a method for open vocabulary retrieval based on a spoken document retrieval (SDR) system using subword models. The present paper proposes a new approach to open vocabulary SDR system using subword models which do not require subword recognition. Instead, subword sequences are obtained from the phone sequence outputted containing an out of vocabulary (OOV) word, a speech recognizer outputs a word sequence whose phone sequence is considered to be similar to the OOV word. When OOV words are provided in a query, the proposed system is able to retrieve the target section by comparing the phone sequences of the query and the word sequence generated by the speech recognizer.
机译:我们提出了一种基于开放式词汇检索的方法,该方法基于使用子词模型的语音文档检索(SDR)系统。本文提出了一种不需要子词识别的使用子词模型的开放式词汇SDR系统的新方法。取而代之的是,从输出的包含单词外词(OOV)的电话序列中获得子单词序列,语音识别器输出一个单词序列,该单词序列的电话序列被认为与OOV单词相似。当在查询中提供OOV单词时,提出的系统能够通过比较查询的电话序列和语音识别器生成的单词序列来检索目标部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号