首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >An auto-encoder based approach to unsupervised learning of subword units
【24h】

An auto-encoder based approach to unsupervised learning of subword units

机译:基于自动编码器的子词单元无监督学习方法

获取原文

摘要

In this paper we propose an autoencoder-based method for the unsupervised identification of subword units. We experiment with different types and architectures of autoencoders to asses what autoencoder properties are most important for this task. We first show that the encoded representation of speech produced by standard autencoders is more effective than Gaussian posteriorgrams in a spoken query classification task. Finally we evaluate the subword inventories produced by the proposed method both in terms of classification accuracy in a word classification task (with lexicon size up to 263 words) and in terms of consistency between subword transcription of different word examples of a same word type. The evaluation is carried out on Italian and American English datasets.
机译:在本文中,我们提出了一种基于自动编码器的方法,用于子词单元的无监督识别。我们尝试使用不同类型和体系结构的自动编码器,以评估哪些自动编码器属性对于此任务最重要。我们首先显示在语音查询分类任务中,由标准autencoders产生的语音的编码表示比高斯后验图更有效。最后,我们根据单词分类任务(词典大小最大为263个单词)中的分类准确度以及相同单词类型的不同单词示例的子单词转录之间的一致性,来评估通过该方法产生的子单词清单。评估是在意大利和美国英语数据集上进行的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号