首页> 外国专利> MULTISTAGE CURRICULUM TRAINING FRAMEWORK FOR ACOUSTIC-TO-WORD SPEECH RECOGNITION

MULTISTAGE CURRICULUM TRAINING FRAMEWORK FOR ACOUSTIC-TO-WORD SPEECH RECOGNITION

机译:语音对语音识别的多阶段课程培训框架

摘要

Methods and apparatuses are provided for performing acoustic to word (A2W) speech recognition training performed by at least one processor. The method includes initializing, by the at least one processor, one or more first layers of a neural network with phone based Connectionist Temporal Classification (CTC), initializing, by the at least one processor, one or more second layers of the neural network with grapheme based CTC, acquiring, by the at least one processor, training data and performing, by the at least one processor, A2W speech recognition training based the initialized one or more first layers and one or more second layers of the neural network using the training data.
机译:提供了用于执行由至少一个处理器执行的语音到单词(A2W)语音识别训练的方法和装置。该方法包括:由所述至少一个处理器利用基于电话的连接器时间分类(CTC)来初始化神经网络的一个或多个第一层;由所述至少一个处理器利用以下步骤来初始化所述神经网络的一个或多个第二层:基于字素的CTC,由至少一个处理器获取训练数据,并由至少一个处理器执行基于神经网络的初始化的一个或多个第一层和一个或多个第二层的A2W语音识别训练数据。

著录项

  • 公开/公告号US2020074983A1

    专利类型

  • 公开/公告日2020-03-05

    原文格式PDF

  • 申请/专利权人 TENCENT AMERICA LLC;

    申请/专利号US201816117373

  • 发明设计人 CHENGZHU YU;CHAO WENG;JIA CUI;DONG YU;

    申请日2018-08-30

  • 分类号G10L15/06;G10L15/16;

  • 国家 US

  • 入库时间 2022-08-21 11:19:14

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号