首页> 外文会议>European Signal Processing Conference >An unsupervised approach to glottal inverse filtering
【24h】

An unsupervised approach to glottal inverse filtering

机译:声门逆滤波的无监督方法

获取原文
获取外文期刊封面目录资料

摘要

The extraction of the glottal volume velocity waveform from voiced speech is a well-known example of a sparse signal recovery problem. Prior approaches have mostly used well-engineered speech processing or convex L1-optimization methods to solve the inverse filtering problem. In this paper, we describe a novel approach to modeling the human vocal tract using an unsupervised dictionary learning framework. We make the assumption of an all-pole model of the vocal tract, and derive an L1 regularized least squares loss function for the all-pole approximation. To evaluate the quality of the extracted glottal volume velocity waveform, we conduct experiments on real-life speech datasets, which include vowels and multi-speaker phonetically balanced utterances. We find that the the unsupervised model learns meaningful dictionaries of vocal tracts, and the proposed data-driven unsupervised framework achieves a performance comparable to the IAIF (Iterative Adaptive Inverse Filtering) glottal flow extraction approach.
机译:从浊音中提取声门体积速度波形是稀疏信号恢复问题的一个众所周知的例子。现有方法大多使用精心设计的语音处理或凸L1优化方法来解决逆滤波问题。在本文中,我们描述了一种使用无监督词典学习框架对人类声道建模的新颖方法。我们假设声道的全极点模型,并为全极点近似推导一个L1正则化最小二乘损失函数。为了评估提取的声门体积速度波形的质量,我们对真实的语音数据集进行了实验,其中包括元音和多说话者语音平衡的发声。我们发现无监督模型学习了有意义的声道词典,并且所提出的数据驱动无监督框架实现了与IAIF(迭代自适应逆滤波)声门流提取方法相当的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号