首页> 外文会议>2017 International Conference on Control, Artificial Intelligence, Robotics amp; Optimization >Voiced-Unvoiced Classification of Speech Using a Neural Network Trained with LPC Coefficients
【24h】

Voiced-Unvoiced Classification of Speech Using a Neural Network Trained with LPC Coefficients

机译:使用经过LPC系数训练的神经网络进行语音的浊音分类

获取原文
获取原文并翻译 | 示例

摘要

Voiced-Unvoiced classification (V-UV) is a well understood but still not perfectly solved problem. It tackles the problem of determining whether a signal frame contains harmonic content or not. This paper presents a new approach to this problem using a conventional multi-layer perceptron neural network trained with linear predictive coding (LPC) coefficients. LPC is a method that results in a number of coefficients that can be transformed to the envelope of the spectrum of the input frame. As a spectrum is suitable for determining the harmonic content, so are the LPC-coefficients. The proposed neural network works reasonably well compared to other approaches and has been evaluated on a small dataset of 4 different speakers.
机译:语音清音分类(V-UV)是一个很好理解的问题,但仍不能完全解决。它解决了确定信号帧是否包含谐波含量的问题。本文提出了一种使用线性预测编码(LPC)系数训练的常规多层感知器神经网络解决此问题的新方法。 LPC是一种产生许多系数的方法,这些系数可以转换为输入帧频谱的包络。由于频谱适合确定谐波含量,因此LPC系数也是如此。与其他方法相比,拟议的神经网络工作得相当好,并且已在由4个不同说话者组成的小型数据集中进行了评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号