首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >An Unsupervised Adaptation Approach to Leveraging Feedback Loop Data by Using i-Vector for Data Clustering and Selection
【24h】

An Unsupervised Adaptation Approach to Leveraging Feedback Loop Data by Using i-Vector for Data Clustering and Selection

机译:通过使用i-Vector进行数据聚类和选择的反馈回路数据的无监督自适应方法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

We present a study of using unsupervised adaptation approaches to improve speech recognition accuracy of a deployed speech service by leveraging large-scale untranscribed speech data collected from a feedback loop (FBL). For a regular user with lots of adaptation utterances, conventional CMLLR-based adaptation can be used for personalization directly. For a casual user with a few adaptation utterances, we propose to use CMLLR-based adaptation by augmenting his / her adaptation utterances with utterances acoustically close to the user, which are selected from the FBL data by an i-vector based approach. For a new user, we propose to perform a CMLLR-based recognition of an unknown utterance by selecting a set of CMLLR transforms from the most similar cluster, which are pre-trained by using the utterances from the corresponding cluster generated by an i-vector based utterance clustering method from the FBL data. The effectiveness of the above approaches are confirmed by our experiments on a short message dictation task on smart phones.
机译:我们提出了一项利用无监督自适应方法,以通过利用从反馈环路(FBL)收集的大规模未转录语音数据来提高已部署语音服务的语音识别准确性的研究。对于具有大量适配话语的常规用户,常规的基于CMLLR的适配可以直接用于个性化。对于具有少量适应性话语的休闲用户,我们建议使用基于CMLLR的适应性,方法是通过在听觉上接近用户的话语来增强他/她的适应性话语,这些话语是通过基于i向量的方法从FBL数据中选择的。对于新用户,我们建议通过从最相似的簇中选择一组CMLLR变换来执行基于CMLLR的未知话语识别,这些变换通过使用i向量生成的相应簇中的话语进行预训练FBL数据基于语音的聚类方法。通过我们在智能手机上的短信命令任务的实验,证实了上述方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号