首页> 外国专利> METHODS, COMPUTING DEVICES, AND STORAGE MEDIA FOR GENERATING TRAINING CORPUS

METHODS, COMPUTING DEVICES, AND STORAGE MEDIA FOR GENERATING TRAINING CORPUS

机译:生成训练语料库的方法,计算设备和存储介质

摘要

The present disclosure provides methods, computing devices, and storage media for generating a training corpus. The method includes: mining out pieces of data from user behavior logs associated with a target application, each piece of data including a first behavior log and a second behavior log, the first behavior log including a user speech and a corresponding speech recognition result, the second behavior log belonging to the same user as the first behavior log and time-dependent with the first behavior log; and determining the user speech and the corresponding speech recognition result in each piece of data as a positive feedback sample or a negative feedback sample, based on the first behavior log and the second behavior log.
机译:本公开提供了用于生成训练语料库的方法,计算设备和存储介质。该方法包括:从与目标应用相关联的用户行为日志中挖掘出数据,每条数据包括第一行为日志和第二行为日志,第一行为日志包括用户语音和相应的语音识别结果,第二行为日志与第一行为日志属于同一用户,并且与第一行为日志在时间上相关;基于第一行为日志和第二行为日志,确定每条数据中的用户语音和相应的语音识别结果为正反馈样本或负反馈样本。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号