首页> 外文会议>International Work Conference on Bio-inspired Intelligence >Voice Pathology Detection Using Deep Learning: a Preliminary Study
【24h】

Voice Pathology Detection Using Deep Learning: a Preliminary Study

机译:使用深度学习的语音病理检测:初步研究

获取原文

摘要

This paper describes a preliminary investigation of Voice Pathology Detection using Deep Neural Networks (DNN). We used voice recordings of sustained vowel /a/ produced at normal pitch from German corpus Saarbruecken Voice Database (SVD). This corpus contains voice recordings and electroglottograph signals of more than 2 000 speakers. The idea behind this experiment is the use of convolutional layers in combination with recurrent Long-Short-Term-Memory (LSTM) layers on raw audio signal. Each recording was split into 64 ms Hamming windowed segments with 30 ms overlap. Our trained model achieved 71.36% accuracy with 65.04% sensitivity and 77.67% specificity on 206 validation files and 68.08% accuracy with 66.75% sensitivity and 77.89% specificity on 874 testing files. This is a promising result in favor of this approach because it is comparable to similar previously published experiment that used different methodology. Further investigation is needed to achieve the state-of-the-art results.
机译:本文介绍了使用深神经网络(DNN)的语音病理检测的初步调查。我们在德国语料库萨尔布鲁伯肯语音数据库(SVD)的正常间距时使用持续元音/ A /产生的持续元音/ A /产生的录音。该语料库包含2 000多个扬声器的录音和电凝块信号。该实验背后的想法是在原始音频信号上使用卷积层与经常性的长短期记忆(LSTM)层组合使用。每个录音都分为64毫秒的垂直窗口段,30毫秒重叠。我们训练有素的模型可实现71.36 %的精度,灵敏度为65.04 %,对206个验证文件的特异性和77.67 %的特异性,66.75 %的精度,灵敏度为66.75 %,在874个测试文件上具有77.89 %特异性。这是有利于这种方法的有希望的结果,因为它与使用不同方法的类似先前公布的实验相当。需要进一步调查来实现最先进的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号