首页> 外文会议>IEEE Workshop on Automatic Speech Recognition and Understanding >The development of the cambridge university alignment systems for the multi-genre broadcast challenge
【24h】

The development of the cambridge university alignment systems for the multi-genre broadcast challenge

机译:剑桥大学对准系统的发展,为多类型的广播挑战

获取原文

摘要

We describe the alignment systems developed both for the preparation of data for the Multi-Genre Broadcast (MGB) challenge and for our participation in the transcription and alignment tasks. Captions of varying quality are aligned with the audio of TV shows that range from few minutes long to more than six hours. Lightly supervised decoding is performed on the audio and the output text is aligned with the original text transcript. Reliable split points are found and the resulting text chunks are force-aligned with the corresponding audio segments. Confidence scores are associated with the aligned data. Multiple refinements ??? including audio segmentation based on deep neural networks (DNNs) and the use of DNN-based acoustic models ??? were used to improve the performance. The final MGB alignment system had the highest F-measure value on the evaluation data.
机译:我们描述了为编写多类型广播(MGB)挑战的数据以及参与转录和对准任务的对准系统。不同质量的标题与电视音频对齐,显示范围从几分钟长到超过六个小时。在音频上执行轻微监督的解码,并且输出文本与原始文本转录物对齐。找到可靠的分割点,并使用相应的音频段强制对齐生成的文本块。置信分数与对齐数据相关联。多种改进???包括基于深神经网络(DNN)的音频分割以及基于DNN的声学模型的使用???用于提高性能。最终MGB对准系统对评估数据的F测量值最高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号