首页> 外文会议> >Acoustic Modeling in the STC Keyword Search System for OpenKWS 2016 Evaluation
【24h】

Acoustic Modeling in the STC Keyword Search System for OpenKWS 2016 Evaluation

机译:用于OpenKWS 2016评估的STC关键字搜索系统中的声学建模

获取原文
获取外文期刊封面目录资料

摘要

This paper describes in detail the acoustic modeling part of the keyword search system developed in the Speech Technology Center (STC) for the OpenKWS 2016 evaluation. The key idea was to utilize diversity of both sound representations and acoustic model architectures in the system. For the former, we extended speaker-dependent bottleneck (SDBN) approach to the multilingual case, which is the main contribution of the paper. Two types of multilingual SDBN features were applied in addition to conventional spectral and cepstral features. The acoustic model architectures employed in the final system are based on deep feedforward and recurrent neural networks. We also applied speaker adaptation of acoustic models using multilingual i-vectors, speed perturbation based data augmentation and semi-supervised training. Final STC system comprised 9 acoustic models, which allowed it to achieve strong performance and to be among the top three systems in the evaluation.
机译:本文详细介绍了语音技术中心(STC)为OpenKWS 2016评估开发的关键字搜索系统的声学建模部分。关键思想是利用系统中声音表示和声学模型架构的多样性。对于前者,我们将说话人相关瓶颈(SDBN)方法扩展到了多语言案例,这是本文的主要贡献。除了常规的频谱和倒谱特征外,还应用了两种类型的多语言SDBN特征。最终系统中采用的声学模型架构基于深度前馈和递归神经网络。我们还使用多语言i向量,基于速度扰动的数据增强和半监督训练对声学模型进行了说话人自适应。最终的STC系统包括9个声学模型,这使其具有强大的性能,并成为评估中排名前三的系统之一。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号