首页> 外文会议>IEEE International Workshop on Machine Learning for Signal Processing >Vowel duration measurement using deep neural networks
【24h】

Vowel duration measurement using deep neural networks

机译:使用深神经网络的元音持续时间测量

获取原文

摘要

Vowel durations are most often utilized in studies addressing specific issues in phonetics. Thus far this has been hampered by a reliance on subjective, labor-intensive manual annotation. Our goal is to build an algorithm for automatic accurate measurement of vowel duration, where the input to the algorithm is a speech segment contains one vowel preceded and followed by consonants (CVC). Our algorithm is based on a deep neural network trained at the frame level on manually annotated data from a phonetic study. Specifically, we try two deep-network architectures: convolutional neural network (CNN), and deep belief network (DBN), and compare their accuracy to an HMM-based forced aligner. Results suggest that CNN is better than DBN, and both CNN and HMM-based forced aligner are comparable in their results, but neither of them yielded the same predictions as models fit to manually annotated data.
机译:在解决语音学中的特定问题的研究中,最常用的元音持续时间。因此,这一直受到主观,劳动密集型手动注释的依赖受到阻碍。我们的目标是建立一种用于自动精确测量元音持续时间的算法,其中输入到算法的输入是语音段,其中包含一个先前的一个元音,然后是辅音(CVC)。我们的算法基于从语音研究的手动注释数据上训练的深神经网络。具体而言,我们尝试两个深网络架构:卷积神经网络(CNN)和深度信仰网络(DBN),并将其准确性与基于HMM的强制对准器进行比较。结果表明CNN优于DBN,CNN和基于HMM的强制对准器的结果相当,但是它们均未产生与模型适合手动注释数据的相同预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号