首页> 外文会议>Spoken Language Technology Workshop (SLT), 2008 IEEE >Open vocabulary spoken document retrieval by subword sequence obtained from speech recognizer

【24h】

Open vocabulary spoken document retrieval by subword sequence obtained from speech recognizer

机译：通过从语音识别器获得的子词序列检索词汇量大的口语文档

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a method for open vocabulary retrieval based on a spoken document retrieval (SDR) system using subword models. The present paper proposes a new approach to open vocabulary SDR system using subword models which do not require subword recognition. Instead, subword sequences are obtained from the phone sequence outputted containing an out of vocabulary (OOV) word, a speech recognizer outputs a word sequence whose phone sequence is considered to be similar to the OOV word. When OOV words are provided in a query, the proposed system is able to retrieve the target section by comparing the phone sequences of the query and the word sequence generated by the speech recognizer.

机译：我们提出了一种基于开放式词汇检索的方法，该方法基于使用子词模型的语音文档检索（SDR）系统。本文提出了一种不需要子词识别的使用子词模型的开放式词汇SDR系统的新方法。取而代之的是，从输出的包含单词外词（OOV）的电话序列中获得子单词序列，语音识别器输出一个单词序列，该单词序列的电话序列被认为与OOV单词相似。当在查询中提供OOV单词时，提出的系统能够通过比较查询的电话序列和语音识别器生成的单词序列来检索目标部分。

著录项

来源
《Spoken Language Technology Workshop (SLT), 2008 IEEE 》||P.301-304|共4页
会议地点
作者
Go Kuriki; Yoshiaki; Kazunori Kojima; Masaaki Ishigame; Kazuyo Tanaka; Shi-wook Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类工业技术 ;
关键词
open vocabulary; spoken document retrieval; subword; subword sequence;

机译：开放词汇语音文档检索子词子词序列;

相似文献

外文文献
中文文献
专利

1. Improved open-vocabulary spoken content retrieval with word and subword lattices using acoustic feature similarity [J] . Hung-yi Lee, Po-wei Chou, Lin-shan Lee Computer speech and language . 2014 ,第5期

机译：使用声学特征相似性改进单词和子词格的开放式语音内容检索
2. Sounds of Speech Based Spoken Document Categorization: A Subword Representation Method [J] . Weidong QU, Katsuhiko SHIRAI IEICE Transactions on Information and Systems . 2004 ,第5期

机译：基于语音的语音文档分类：子词表示方法
3. Subword-based approaches for spoken document retrieval [J] . Kenney Ng, Victor W. Zue 20f Speech Communication . 2000 ,第3期

机译：基于子词的语音文档检索方法
4. Open vocabulary spoken document retrieval by subword sequence obtained from speech recognizer [C] . Go Kuriki, Yoshiaki, Kazunori Kojima, Workshop on Spoken Language Technology . 2008

机译：通过语音识别器获得的子字序列检索开放词汇表文档
5. Audio parsing and rapid speaker adaptation in speech recognition for spoken document retrieval. [D] . Zhou, Bowen. 2003

机译：语音识别中的音频解析和快速的说话人自适应，可用于语音文档检索。
6. Subword segmentation--leveling out morphological variations for medical document retrieval. [O] . U. Hahn, M. Honeck, M. Piotrowski, 2001

机译：子词分割-整理出用于医学文档检索的形态变化。
7. The RWTH large vocabulary continuous speech recognition system and spoken document retrieval [O] . Ney Hermann, Welling Lutz, Ortmanns Stefan, 1998

机译：RWTH大词汇量连续语音识别系统和语音文档检索

Open vocabulary spoken document retrieval by subword sequence obtained from speech recognizer

摘要

著录项

相似文献

相关主题

期刊订阅