Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection

Hongjie Chen; Cheung-Chi Leung; Lei Xie; Bin Ma; Haizhou Li

首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection

【24h】

Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection

机译：多资源特征学习，以低资源示例查询口语查询

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We propose a novel technique that learns a low-dimensional feature representation from unlabeled data of a target language, and labeled data from a nontarget language. The technique is studied as a solution to query-by-example spoken term detection (QbE-STD) for a low-resource language. We extract low-dimensional features from a bottle-neck layer of a multitask deep neural network, which is jointly trained with speech data from the low-resource target language and resource-rich nontarget language. The proposed feature learning technique aims to extract acoustic features that offer phonetic discriminability. It explores a new way of leveraging cross-lingual speech data to overcome the resource limitation in the target language. We conduct QbE-STD experiments using the dynamic time warping distance of the multitask bottle-neck features between the query and the search database. The QbE-STD process does not rely on an automatic speech recognition pipeline of the target language. We validate the effectiveness of multitask feature learning through a series of comparative experiments.

机译：我们提出了一种新技术，该技术从目标语言的未标记数据和非目标语言的标记数据中学习低维特征表示。研究了该技术，作为一种针对资源匮乏的语言的示例查询口语检测（QbE-STD）的解决方案。我们从多任务深度神经网络的瓶颈层中提取低维度特征，该深度层网络与来自资源匮乏的目标语言和资源丰富的非目标语言的语音数据共同训练。提出的特征学习技术旨在提取提供语音可辨性的声学特征。它探索了一种利用跨语言语音数据克服目标语言资源限制的新方法。我们使用查询和搜索数据库之间的多任务瓶颈特征的动态时间规整距离进行QbE-STD实验。 QbE-STD过程不依赖于目标语言的自动语音识别管道。我们通过一系列比较实验验证了多任务特征学习的有效性。

著录项

来源
《Selected Topics in Signal Processing, IEEE Journal of》 |2017年第8期|1329-1339|共11页
作者
Hongjie Chen; Cheung-Chi Leung; Lei Xie; Bin Ma; Haizhou Li;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech; Feature extraction; Neural networks; Speech processing; Gaussian mixture model;

机译：语音;特征提取;神经网络;语音处理;高斯混合模型;

相似文献

外文文献
中文文献
专利

1. Multilingual query-by-example spoken term detection in Indian languages [J] . Abhimanyu Popli, Arun Kumar International journal of speech technology . 2019,第1期

机译：多语言示例查询印度语言中的口语术语检测
2. Vocal Tract Length Normalization using a Gaussian mixture model framework for query-by-example spoken term detection [J] . Madhavi Maulik C., Patil Hemant A. Computer speech and language . 2019,第NOVa期

机译：使用高斯混合模型框架进行语音片段长度归一化，以示例查询口语术语
3. Vocal Tract Length Normalization using a Gaussian mixture model framework for query-by-example spoken term detection [J] . Madhavi Maulik C., Patil Hemant A. Computer speech and language . 2019,第Nova期

机译：使用高斯混合模型框架进行查询逐期检测的高斯混合模型框架的声带长度标准化
4. Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection [C] . Yougen Yuan, Cheung-Chi Leung, Lei Xie, IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：使用多语言瓶颈功能进行成对学习，以实现资源少的示例式口语查询
5. Discriminative Articulatory Feature-based Pronunciation Models with Application to Spoken Term Detection [D] . Prabhavalkar, Rohit. 2013

机译：基于区分性发音特征的语音模型及其在口语检测中的应用
6. Multitask feature learning approach for knowledge graph enhanced recommendations with RippleNet [O] . YueQun Wang, LiYan Dong, YongLi Li, 2021

机译：多任务特征学习方法用于知识图表增强了RIPPlenet的建议
7. Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation [O] . Javier Tejedor, Doroteo T. Toledano, Paula Lopez-Otero, 2019

机译：从口语查询中搜索：多域国际Albayzin 2018逐个语言检测评估

Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅