【24h】

The Vocapia Research ASR Systems for EVALITA 2011

机译:2011年录像科研asr系统

获取原文

摘要

This document describes the automatic speech-to-text transcription used by Vocapia Research for the EVALITA 2011 evaluation for the open unconstrained automatic speech recognition (ASR) task. The aim of this evaluation was to perform automatic speech recognition of parliament audio sessions in the Italian language. About 30h of untran-scribed audio data and one year of minutes from parliament sessions were provided as training corpus. This corpus was used to carry out an unsupervised adaptation of Vocapia's Italian broadcast speech transcription system. Transcriptions produced by two systems were submitted. The primary system has a single decoding pass and was optimized to run in real time. The contrastive system, developed in collaboration with LIMSI-CNRS, has two decoding passes and runs in about 5×RT. The case-insensitive word error rates (WER) of these systems are respectively 10.2% and 9.3% on the EVALITA development data and 6.4% and 5.4% on the evaluation data.
机译:本文档介绍了Vocapia研究的自动语音到文本转录,用于评估开放的无约束自动语音识别(ASR)任务的评估。该评估的目的是以意大利语中的议会音频会话进行自动演讲识别。大约30小时的UNTRAN划线音频数据和议会会议一年的几分钟分钟被提供为培训语料库。该语料库用于开展Vocapia的意大利广播语音转录系统的无监督适应。提交了两个系统产生的转录。主系统具有单个解码通行证,并优化以实时运行。与LiMSI-CNR合作开发的对比系统具有两个解码,并在约5×室内运行。这些系统的不敏感的单词误差率(WER)分别对评估数据的评估数据和6.4%和5.4%的10.2%和9.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号