首页> 外国专利> Data shredding for speech recognition acoustic model training under data retention restrictions

Data shredding for speech recognition acoustic model training under data retention restrictions

机译：在数据保留限制下用于语音识别声学模型训练的数据粉碎

页面导航

摘要
著录项
相似文献

摘要

Training speech recognizers, e.g., their language or acoustic models, using actual user data is useful, but retaining personally identifiable information may be restricted in certain environments due to regulations. Accordingly, a method or system is provided for enabling training of an acoustic model which includes dynamically shredding a speech corpus to produce text segments and depersonalized audio features corresponding to the text segments. The method further includes enabling a system to train an acoustic model using the text segments and the depersonalized audio features. Because the data is depersonalized, actual data may be used, enabling speech recognizers to keep up-to-date with user trends in speech and usage, among other benefits.

机译：使用实际用户数据来训练语音识别器（例如其语言或声学模型）是有用的，但是由于法规的原因，保留个人身份信息可能会受到限制。因此，提供了一种用于使得能够训练声学模型的方法或系统，该方法或系统包括动态切碎语音语料库以产生文本片段和与该文本片段相对应的去个性化的音频特征。该方法还包括使系统能够使用文本片段和去个性化的音频特征来训练声学模型。由于数据是非个人化的，因此可以使用实际数据，从而使语音识别器能够及时了解用户的语音和使用趋势，以及其他好处。

著录项

公开/公告号US9514741B2

专利类型
公开/公告日2016-12-06

原文格式PDF
申请/专利权人 NUANCE COMMUNICATIONS INC.;
展开▼

申请/专利号US201313800764
发明设计人 WILLIAM F. GANONG III;PHILIP CHARLES WOODLAND;UWE HELMUT JOST;SYED RAZA SHAHID;PAUL J. VOZILA;MARCEL KATZ;
展开▼

申请日2013-03-13
分类号G10L15/00;G10L15/06;G06F21/62;G06F21/00;G10L15/187;G10L15/02;
国家 US
入库时间 2022-08-21 13:41:07

相似文献

专利
外文文献
中文文献