首页> 外国专利> Data augmentation method based on stochastic feature mapping for automatic speech recognition

Data augmentation method based on stochastic feature mapping for automatic speech recognition

机译：基于随机特征映射的数据增强自动语音识别方法

页面导航

摘要
著录项
相似文献

摘要

A method of augmenting training data includes converting a feature sequence of a source speaker determined from a plurality of utterances within a transcript to a feature sequence of a target speaker under the same transcript, training a speaker-dependent acoustic model for the target speaker for corresponding speaker-specific acoustic characteristics, estimating a mapping function between the feature sequence of the source speaker and the speaker-dependent acoustic model of the target speaker, and mapping each utterance from each speaker in a training set using the mapping function to multiple selected target speakers in the training set.

机译：一种增强训练数据的方法，包括将从转录本中的多个发声确定的源说话者的特征序列转换为相同转录本下的目标说话者的特征序列，为目标说话者训练与说话者相关的声学模型，以便进行相应的调整。特定于说话者的声学特征，估计源说话者的特征序列与目标说话者的与说话者相关的声学模型之间的映射函数，并使用该映射函数将训练集中来自每个说话者的每个发音映射到多个选定的目标说话者在训练集中。

著录项

公开/公告号US9721559B2

专利类型
公开/公告日2017-08-01

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US201514689730
发明设计人 XIAODONG CUI;BRIAN E. D. KINGSBURY;VAIBHAVA GOEL;
展开▼

申请日2015-04-17
分类号G10L15/16;G10L15/06;G10L15/02;
国家 US
入库时间 2022-08-21 13:43:37

相似文献

专利
外文文献
中文文献