Spoken Term Detection Using SVM-Based Classifier Trained with Pre-Indexed Keywords

Kentaro DOMOTO; Takehito UTSURO; Naoki SAWADA; Hiromitsu NISHIZAKI

首页> 外文期刊>IEICE transactions on information and systems >Spoken Term Detection Using SVM-Based Classifier Trained with Pre-Indexed Keywords

【24h】

Spoken Term Detection Using SVM-Based Classifier Trained with Pre-Indexed Keywords

机译：使用基于SVM的分类器和预索引关键字训练的语音术语检测

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This study presents a two-stage spoken term detection (STD) method that uses the same STD engine twice and a support vector machine (SVM)-based classifier to verify detected terms from the STD engine's output. In a front-end process, the STD engine is used to pre-index target spoken documents from a keyword list built from an automatic speech recognition result. The STD result includes a set of keywords and their detection intervals (positions) in the spoken documents. For keywords having competitive intervals, we rank them based on the STD matching cost and select the one having the longest duration among competitive detections. The selected keywords are registered in the pre-index. They are then used to train an SVM-based classifier. In a query term search process, a query term is searched by the same STD engine, and the output candidates are verified by the SVM-based classifier. Our proposed two-stage STD method with pre-indexing was evaluated using the NTCIR-10 SpokenDoc-2 STD task and it drastically outperformed the traditional STD method based on dynamic time warping and a confusion network-based index.

机译：这项研究提出了一种两阶段的口语项检测（STD）方法，该方法两次使用相同的STD引擎，并且基于支持向量机（SVM）的分类器从STD引擎的输出中验证检测到的项。在前端过程中，STD引擎用于根据自动语音识别结果构建的关键字列表对目标语音文档进行预索引。 STD结果包括一组关键词及其在语音文档中的检测间隔（位置）。对于具有竞争间隔的关键字，我们根据STD匹配成本对它们进行排名，然后在竞争检测中选择持续时间最长的关键字。所选关键字已注册在预索引中。然后将它们用于训练基于SVM的分类器。在查询词搜索过程中，查询词由相同的STD引擎搜索，输出候选由基于SVM的分类器验证。我们使用NTCIR-10 SpokenDoc-2 STD任务评估了我们提出的带有预索引的两阶段STD方法，该方法大大优于基于动态时间规整和基于混淆网络的索引的传统STD方法。

著录项

来源
《IEICE transactions on information and systems》 |2016年第10期|共11页
作者
Kentaro DOMOTO; Takehito UTSURO; Naoki SAWADA; Hiromitsu NISHIZAKI;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Extension of a Kernel-Based Classifier for Discriminative Spoken Keyword Spotting [J] . Shima Tabibian, Ahmad Akbari, Babak Nasersharif Neural processing letters . 2014,第2期

机译：基于内核的分类器的扩展，用于区分性口语关键词发现
2. Spoken keyword detection using autoassociative neural networks [J] . S. Jothilakshmi International journal of speech technology . 2014,第1期

机译：使用自动联想神经网络进行口语关键词检测
3. Model-Based Unsupervised Spoken Term Detection with Spoken Queries [J] . Chan C.-A., Lee L.-S. Audio, Speech, and Language Processing, IEEE Transactions on . 2013,第7期

机译：具有语音查询的基于模型的无监督语音术语检测
4. Selection of best match keyword using spoken term detection for spoken document indexing [C] . Domoto Kentaro, Utsuro Takehito, Sawada Naoki, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2014

机译：使用语音术语检测为语音文档索引选择最佳匹配关键字
5. Adaptation and Augmentation: Towards Better Rescoring Strategies for Automatic Speech Recognition and Spoken Term Detection [D] . Ma, Min. 2018

机译：适应和增强：寻求更好的自动语音识别和语音术语检测的评分策略
6. An SVM-Based Classifier for Estimating the State of Various Rotating Components in Agro-Industrial Machinery with a Vibration Signal Acquired from a Single Point on the Machine Chassis [O] . Ruben Ruiz-Gonzalez, Jaime Gomez-Gil, Francisco Javier Gomez-Gil, 2014

机译：基于SVM的分类器通过从机架上的单个点获取振动信号来估计农用机械中各种旋转组件的状态
7. Spoken Term Detection Using SVM-Based Classifier Trained with Pre-Indexed Keywords [O] . DOMOTO Kentaro, UTSURO Takehito, SAWADA Naoki, 2016

机译：使用基于SVM的分类器和预索引关键字训练的语音术语检测

Spoken Term Detection Using SVM-Based Classifier Trained with Pre-Indexed Keywords

摘要

著录项

相似文献

相关主题

期刊订阅