首页> 外国专利> FRUGAL METHOD AND SYSTEM FOR CREATING SPEECH CORPUS

FRUGAL METHOD AND SYSTEM FOR CREATING SPEECH CORPUS

机译:建立言语语料库的方法和系统

摘要

The present invention provides a frugal method for extraction of speech data and associated transcription from plurality of web resources (internet) for speech corpus creation characterized by an automation of the speech corpus creation and cost reduction. An integration of existing speech corpus with extracted speech data and its transcription from the web resources to build an aggregated rich speech corpus that are effective and easy to adapt for generating acoustic and language models for (Automatic Speech Recognition) ASR systems.
机译:本发明提供了一种节俭方法,用于从多个网络资源(互联网)中提取语音数据和相关的转录以用于语音语料库创建,其特征在于语音语料库创建的自动化和成本降低。现有语音语料库与提取的语音数据的集成及其从Web资源的转录,以构建聚合的丰富语音语料库,这些语料库有效且易于调整,以生成用于(自动语音识别)ASR系统的声学和语言模型。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号