首页> 外国专利> Data shredding for speech recognition acoustic model training under data retention restrictions

Data shredding for speech recognition acoustic model training under data retention restrictions

机译:在数据保留限制下用于语音识别声学模型训练的数据粉碎

摘要

Training speech recognizers, e.g., their language or acoustic models, using actual user data is useful, but retaining personally identifiable information may be restricted in certain environments due to regulations. Accordingly, a method or system is provided for enabling training of an acoustic model which includes dynamically shredding a speech corpus to produce text segments and depersonalized audio features corresponding to the text segments. The method further includes enabling a system to train an acoustic model using the text segments and the depersonalized audio features. Because the data is depersonalized, actual data may be used, enabling speech recognizers to keep up-to-date with user trends in speech and usage, among other benefits.
机译:使用实际用户数据来训练语音识别器(例如其语言或声学模型)是有用的,但是由于法规的原因,保留个人身份信息可能会受到限制。因此,提供了一种用于使得能够训练声学模型的方法或系统,该方法或系统包括动态切碎语音语料库以产生文本片段和与该文本片段相对应的去个性化的音频特征。该方法还包括使系统能够使用文本片段和去个性化的音频特征来训练声学模型。由于数据是非个人化的,因此可以使用实际数据,从而使语音识别器能够及时了解用户的语音和使用趋势,以及其他好处。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号