首页> 外文会议>Evaluation of natural language and speech tools for Italian >The Vocapia Research ASR Systems for EVALITA 2011
【24h】

The Vocapia Research ASR Systems for EVALITA 2011

机译:用于EVALITA 2011的Vocapia Research ASR系统

获取原文
获取原文并翻译 | 示例

摘要

This document describes the automatic speech-to-text transcription used by Vocapia Research for the Evalita 2011 evaluation for the open unconstrained automatic speech recognition (ASR) task. The aim of this evaluation was to perform automatic speech recognition of parliament audio sessions in the Italian language. About 30h of untran-scribed audio data and one year of minutes from parliament sessions were provided as training corpus. This corpus was used to carry out an unsu-pervised adaptation of Vocapia's Italian broadcast speech transcription system. Transcriptions produced by two systems were submitted. The primary system has a single decoding pass and was optimized to run in real time. The contrastive system, developed in collaboration with Limsi-CNRS, has two decoding passes and runs in about 5×RT. The case-insensitive word error rates (WER) of these systems are respectively 10.2% and 9.3% on the Evalita development data and 6.4% and 5.4% on the evaluation data.
机译:本文档介绍了Vocapia Research将Evoita 2011评估中使用的自动语音转文本转录功能,用于开放式无限制自动语音识别(ASR)任务。这项评估的目的是对意大利语的议会音频会议进行自动语音识别。提供了大约30小时的未转录音频数据和距议会会议时间为1分钟的培训语料库。该语料库用于对Vocapia的意大利广播语音转录系统进行未经监督的改编。提交了两个系统产生的转录本。主系统具有一次解码通道,并进行了优化以实时运行。与Limsi-CNRS合作开发的对比系统具有两次解码通道,并以大约5×RT运行。这些系统的不区分大小写的单词错误率(WER)在Evalita开发数据上分别为10.2%和9.3%,在评估数据上分别为6.4%和5.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号