首页> 外文会议>IEEE Automatic Speech Recognition and Understanding Workshop >Multilingual Bottleneck Features for Query by Example Spoken Term Detection
【24h】

Multilingual Bottleneck Features for Query by Example Spoken Term Detection

机译:通过示例口语检测查询的多语言瓶颈功能

获取原文

摘要

State of the art solutions to query by example spoken term detection (QbE-STD) rely on bottleneck feature representation of the query and audio document. Here, we present a study on QbE-STD performance using several monolingual as well as multilingual bottleneck features extracted from feed forward networks. In contrast to previous works, we use multitask learning to train the multilingual networks which perform significantly better than the concatenated monolingual features. Additionally, we propose to employ residual networks (ResNet) to estimate the bottleneck features and show significant improvements over the corresponding feed forward network based features. The neural networks are trained on GlobalPhone corpus and QbE-STD experiments are performed on a very challenging QUESST 2014 database
机译:通过示例口语检测(QbE-STD)进行查询的最新技术解决方案依赖于查询和音频文档的瓶颈特征表示。在这里,我们使用从前馈网络中提取的几种单语言和多语言瓶颈功能,对QbE-STD性能进行了研究。与以前的工作相比,我们使用多任务学习来训练比连接的单语功能明显更好的多语网络。此外,我们建议采用残差网络(ResNet)来评估瓶颈功能,并显示相对于基于相应前馈网络的功能的显着改进。在GlobalPhone语料库上训练神经网络,并在一个极富挑战性的QUESST 2014数据库上进行QbE-STD实验

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号