Voice Pathology Detection Using Deep Learning: a Preliminary Study

机译：使用深度学习的语音病理检测：初步研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a preliminary investigation of Voice Pathology Detection using Deep Neural Networks (DNN). We used voice recordings of sustained vowel /a/ produced at normal pitch from German corpus Saarbruecken Voice Database (SVD). This corpus contains voice recordings and electroglottograph signals of more than 2 000 speakers. The idea behind this experiment is the use of convolutional layers in combination with recurrent Long-Short-Term-Memory (LSTM) layers on raw audio signal. Each recording was split into 64 ms Hamming windowed segments with 30 ms overlap. Our trained model achieved 71.36% accuracy with 65.04% sensitivity and 77.67% specificity on 206 validation files and 68.08% accuracy with 66.75% sensitivity and 77.89% specificity on 874 testing files. This is a promising result in favor of this approach because it is comparable to similar previously published experiment that used different methodology. Further investigation is needed to achieve the state-of-the-art results.

机译：本文介绍了使用深神经网络（DNN）的语音病理检测的初步调查。我们在德国语料库萨尔布鲁伯肯语音数据库（SVD）的正常间距时使用持续元音/ A /产生的持续元音/ A /产生的录音。该语料库包含2 000多个扬声器的录音和电凝块信号。该实验背后的想法是在原始音频信号上使用卷积层与经常性的长短期记忆（LSTM）层组合使用。每个录音都分为64毫秒的垂直窗口段，30毫秒重叠。我们训练有素的模型可实现71.36 ％的精度，灵敏度为65.04 ％，对206个验证文件的特异性和77.67 ％的特异性，66.75 ％的精度，灵敏度为66.75 ％，在874个测试文件上具有77.89 ％特异性。这是有利于这种方法的有希望的结果，因为它与使用不同方法的类似先前公布的实验相当。需要进一步调查来实现最先进的结果。

著录项

来源
《International Work Conference on Bio-inspired Intelligence》|2017年|165p|共4页
会议地点
作者
Pavol Harar; Jesus B. Alonso-Hernandezy; Jiri Mekyska; Zoltan Galaz; Radim Burget; Zdenek Smekal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Pathology; Convolution; Databases; Testing; Feature extraction; Support vector machines; Training;

机译：病理学;卷积;数据库;测试;特征提取;支持向量机;培训;

相似文献

外文文献
中文文献
专利

1. Towards robust voice pathology detection Investigation of supervised deep learning, gradient boosting, and anomaly detection approaches across four databases [J] . Harar Pavol, Galaz Zoltan, Alonso-Hernandez Jesus B., Neural computing & applications . 2020,第20期

机译：朝向强大的语音病理检测调查，跨四个数据库的监督深度学习，梯度升压和异常检测方法
2. Towards robust voice pathology detection Investigation of supervised deep learning, gradient boosting, and anomaly detection approaches across four databases (vol 32, pg 15747, 2018) [J] . Harar Pavol, Galaz Zoltan, Alonso-Hernandez Jesus B., Neural computing & applications . 2020,第20期

机译：朝着四个数据库中受到监督深度学习，梯度提升和异常检测方法的强大语音病理检测调查（Vol 32，PG 15747,2018）
3. Detection of dementia on voice recordings using deep learning: a Framingham Heart Study [J] . Xue Chonghua, Karjadi Cody, Paschalidis Ioannis Ch., Alzheimer s Research & Therapy . 2021,第1期

机译：深入学习检测痴呆症的痴呆仪：框架心脏研究
4. Voice Pathology Detection Using Deep Learning: a Preliminary Study [C] . Pavol Harar, Jesus B. Alonso-Hernandezy, Jiri Mekyska, International Conference and Workshop on Bioinspired Intelligence . 2017

机译：使用深度学习的语音病理检测：初步研究
5. A Comparison of Vocal Health, Hygiene, and Perceptions in Student Teachers, Voice Music Majors, and Speech-Language Pathology Majors: A Preliminary Study [D] . Brown, Kenreah LaVaughn. 2017

机译：学生教师，语音音乐专业和言语病理学专业学生的人声健康，卫生和知觉比较：初步研究
6. A Preliminary Experience of Implementing Deep-Learning Based Auto-Segmentation in Head and Neck Cancer: A Study on Real-World Clinical Cases [O] . Yang Zhong, Yanju Yang, Yingtao Fang, 2021

机译：在头部颈部癌症中实施深受学习的自我分割的初步经验：现实世界临床病例研究
7. A novel deep learning algorithm for the automatic detection of high-grade gliomas on t2-weighted magnetic resonance images. a preliminary machine learning study [O] . Mehmet Ali Atici, Seref Sagiroglu, Pinar Celtikci, 2019

机译：一种新型深度学习算法，用于在T2加权磁共振图像上自动检测高等胶质瘤。初步机器学习研究

Voice Pathology Detection Using Deep Learning: a Preliminary Study

摘要

著录项

相似文献

相关主题

期刊订阅