Evaluation of Fast Spoken Term Detection Using a Suffix Array

机译：使用后缀数组的快速语音术语检测评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We previously proposed [1] fast spoken term detection that uses a suffix array as a data structure for searching a large-scale speech documents. In this method, a keyword is divided into sub-keywords, and the phoneme sequences that contain two or more sub-keywords are output as results. Although the search is executed very quickly on a 10,000-h speech database, we only proposed a variety of matching procedures in [1]. In this paper, we compare different varieties of matching procedures in which the number of phonemes in a sub-keyword and the required number of sub-keywords to be contained in a search result are different. We also compare the performance and the process time of our method with typical spoken term detection using an inverted index.

机译：我们先前提出了[1]快速口语术语检测，该方法使用后缀数组作为用于搜索大规模语音文档的数据结构。在该方法中，将关键字划分为子关键字，并且将包含两个或多个子关键字的音素序列作为结果输出。尽管在10,000小时的语音数据库中可以非常快速地执行搜索，但我们在[1]中仅提出了各种匹配过程。在本文中，我们比较了各种匹配过程，其中子关键字中的音素数量与搜索结果中包含的子关键字所需数量不同。我们还将使用倒排索引将我们的方法的性能和处理时间与典型的语音术语检测进行比较。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.916-919|共4页
会议地点
作者
Kouichi Katsurada; Shinta Sawada; Shigeki Teshima; Yurie Iribe; Tsuneo Nitta;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
spoken term detection; large scale speech document; suffix array; keyword division; iterative lengthening search;

机译：语音术语检测;大型语音文件;后缀数组;关键字划分;迭代加长搜索;

相似文献

外文文献
中文文献
专利

1. System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive [J] . Josef Psutka, Jan Svec, Josef V Psutka, EURASIP journal on audio, speech, and music processing . 2011,第1期

机译：捷克文化遗产档案中的快速词汇和语音口语检测系统
2. System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive [J] . Josef Psutka, Jan ?vec, Josef V Psutka, EURASIP journal on audio, speech, and music processing . 2011,第1期

机译：捷克文化遗产档案中的快速词汇和语音口语检测系统
3. Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations [J] . Javier Tejedor, Doroteo T. Toledano, Paula Lopez-Otero, EURASIP journal on audio, speech, and music processing . 2016,第1期

机译：以示例查询的ALBAYZIN口语检测2012年和2014年评估的比较
4. Acceleration of Spoken Term Detection Using a Suffix Array by Assigning Optimal Threshold Values to Sub-Keywords [C] . Kouichi Katsurada, Seiichi Miura, Kheang Seng, Conference of the International Speech Communication Association . 2013

机译：通过将最佳阈值分配给子关键字，使用后缀阵列加速口头术语检测
5. Fast Parallel Suffix Array on the GPU. [D] . Wang, Leyuan. 2015

机译：GPU上的快速并行后缀数组。
6. RNA-Seq Mapping and Detection of Gene Fusions with a Suffix Array Algorithm [O] . Onur Sakarya, Heinz Breu, Milan Radovich, 2012

机译：后缀阵列算法的RNA-Seq定位和基因融合的检测
7. Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation [O] . Javier Tejedor, Doroteo T. Toledano, Paula Lopez-Otero, 2019

机译：从口语查询中搜索：多域国际Albayzin 2018逐个语言检测评估

Evaluation of Fast Spoken Term Detection Using a Suffix Array

摘要

著录项

相似文献

相关主题

期刊订阅