Zero-resource audio-only spoken term detection based on a combination of template matching techniques

机译：基于模板匹配技术的零资源纯音频语音术语检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Spoken term detection is a well-known information retrieval task that seeks to extract contentful information from audio by locating occurrences of known query words of interest. This paper describes a zero-resource approach to such task based on pattern matching of spoken term queries at the acoustic level. The template matching module comprises the cascade of a segmental variant of dynamic time warping and a self-similarity matrix comparison to further improve robustness to speech variability. This solution notably differs from more traditional train and test methods that, while shown to be very accurate, rely upon the availability of large amounts of linguistic resources. We evaluate our framework on different param-eterizations of the speech templates: raw MFCC features and Gaussian posteriorgrams, French and English phonetic posteri-orgrams output by two different state of the art phoneme recognizers.

机译：语音术语检测是一项众所周知的信息检索任务，旨在通过查找感兴趣的已知查询词的出现来从音频中提取有意义的信息。本文介绍了一种基于零级资源的语音任务查询方法，该方法基于声学级别的口语术语查询模式匹配。模板匹配模块包括动态时间规整的分段变体的级联和自相似矩阵比较，以进一步提高对语音可变性的鲁棒性。该解决方案明显不同于更传统的训练和测试方法，后者虽然显示非常准确，但依赖于大量语言资源的可用性。我们根据语音模板的不同参数评估我们的框架：原始的MFCC特征和高斯后验图，两种不同状态的音素识别器输出的法语和英语语音后验图。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.928-931|共4页
会议地点
作者
Armando Muscariello; Guillaume Gravier; Frederic Bimbot;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
spoken term detection; template matching; unsu-pervised learning; posterior features;

机译：语音术语检测;模板匹配;未经监督的学习;后部特征;

相似文献

外文文献
中文文献
专利

1. Detection of densely dispersed spherical bubbles in digital images based on a template matching technique - Application to wet foams [J] . Zabulis X, Papara M, Chatziargyriou A, Colloids and Surfaces, A. Physicochemical and Engineering Aspects . 2007,第1a3期

机译：基于模板匹配技术的数字图像中密集分散的球形气泡检测-在湿泡沫中的应用
2. Model-Based Unsupervised Spoken Term Detection with Spoken Queries [J] . Chan C.-A., Lee L.-S. Audio, Speech, and Language Processing, IEEE Transactions on . 2013,第7期

机译：具有语音查询的基于模型的无监督语音术语检测
3. Defect detection method using deep convolutional neural network, support vector machine and template matching techniques [J] . Fusaomi Nagata, Kenta Tokuno, Kazuki Mitarai, Artificial life and robotics . 2019,第4期

机译：使用深度卷积神经网络的缺陷检测方法，支持向量机和模板匹配技术
4. Spoken Term Detection Based on Acoustic Models Trained in Multiple Languages for Zero-Resource Language [C] . Satoru Mizuochi, Yuya Chiba, Takashi Nose, IEEE Global Conference on Consumer Electronics . 2020

机译：基于用于零资源语言的多种语言培训的声学模型的口语术语检测
5. Discriminative Articulatory Feature-based Pronunciation Models with Application to Spoken Term Detection [D] . Prabhavalkar, Rohit. 2013

机译：基于区分性发音特征的语音模型及其在口语检测中的应用
6. A Model-Based 3D Template Matching Technique for Pose Acquisition of an Uncooperative Space Object [O] . Roberto Opromolla, Giancarmine Fasano, Giancarlo Rufino, 2015

机译：基于模型的3D模板匹配技术用于不合作空间物体的姿态获取
7. EFFICIENT SYSTEM COMBINATION FOR SYLLABLE-CONFUSION-NETWORK-BASED CHINESE SPOKEN TERM DETECTION [O] . Jie Gao, Qingwei Zhao, Yonghong Yan, 2013

机译：基于可配置网络的中文口语检测的高效系统组合
8. Data Fusion and Correlation Technique Testbed (DFACTT): Analysis Tools forEmitter Fix Clustering and Doctrinal Template Matching [R] . Mikulin, L., Elsaesser, D. 1994

机译：数据融合和相关技术测试平台（DFaCTT）：用于发射器修复聚类和逻辑模板匹配的分析工具

Zero-resource audio-only spoken term detection based on a combination of template matching techniques

摘要

著录项

相似文献

相关主题

期刊订阅