首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Woefzela - An open-source platform for ASR data collection in the developing world
【24h】

Woefzela - An open-source platform for ASR data collection in the developing world

机译:Woefzela-发展中国家用于ASR数据收集的开源平台

获取原文

摘要

Building transcribed speech corpora for under-resourced languages plays a pivotal role in developing speech technologies for such languages. We have developed an open-source tool for devices running the Android operating system to facilitate the efficient collection of speech data for Automatic Speech Recognition system development. The tool was designed for use in typical developing-world conditions; we present the relevant design choices and analyse the effectiveness of this tool by means of a case study. In particular, we introduce a novel semi-real-time quality monitoring system, which increases the efficiency of the data collection process.
机译:为资源不足的语言建立转录语音语料库在开发此类语言的语音技术中起着至关重要的作用。我们已经为运行Android操作系统的设备开发了一个开源工具,以促进为自动语音识别系统开发高效地收集语音数据。该工具旨在用于典型的发展中国家条件。我们提供了相关的设计选择,并通过案例研究分析了该工具的有效性。特别是,我们引入了一种新颖的半实时质量监控系统,该系统提高了数据收集过程的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号