首页> 外国专利> A FRUGAL METHOD AND SYSTEM FOR CREATING SPEECH CORPUS

A FRUGAL METHOD AND SYSTEM FOR CREATING SPEECH CORPUS

机译：一种建立言语语料库的方法和系统

页面导航

摘要
著录项
相似文献

摘要

The present invention provides a frugal method for extraction of speech data and associated transcription from plurality of web resources (internet) for speech corpus creation characterized by an automation of the speech corpus creation and cost reduction. An integration of existing speech corpus with extracted speech data and its transcription from the web resources to build an aggregated rich speech corpus that are effective and easy to adapt for generating acoustic and language models for (Automatic Speech Recognition) ASR systems.

机译：本发明提供了一种节俭方法，用于从多个网络资源（互联网）中提取语音数据和相关的转录以用于语音语料库创建，其特征在于语音语料库创建的自动化和成本降低。现有语音语料库与提取的语音数据的集成及其从Web资源的转录，以构建聚合的丰富语音语料库，这些语料库有效且易于调整，以生成用于（自动语音识别）ASR系统的声学和语言模型。

著录项

公开/公告号IN2011MU02148A

专利类型
公开/公告日2013-02-01

原文格式PDF
申请/专利权人
展开▼

申请/专利号IN2148/MUM/2011
发明设计人 KOPPARAPU SUNILKUMAR;SHEIKH IMRAN AHMED;
展开▼

申请日2011-07-28
分类号G10L15/00;
国家 IN
入库时间 2022-08-21 16:41:11

相似文献

专利
外文文献
中文文献