Automatic Hypernasality Detection in Cleft Palate Speech Using CNN

Wang Xiyue; Tang Ming; Yang Sen; Yin Heng; Huang Hua; He Ling

首页> 外文期刊>Circuits, systems, and signal processing >Automatic Hypernasality Detection in Cleft Palate Speech Using CNN

【24h】

Automatic Hypernasality Detection in Cleft Palate Speech Using CNN

机译：使用CNN的腭裂语音中的自动化高血量检测

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic hypernasality detection in cleft palate speech can facilitate diagnosis by speech-language pathologists. This paper describes a feature-independent end-to-end algorithm that uses a convolutional neural network (CNN) to detect hypernasality in cleft palate speech. A speech spectrogram is adopted as the input. The average F1-scores for the hypernasality detection task are 0.9485 and 0.9746 using a dataset that is spoken by children and a dataset that is spoken by adults, respectively. The experiments explore the influence of the spectral resolution on the hypernasality detection performance in cleft palate speech. Higher spectral resolution can highlight the vocal tract parameters of hypernasality, such as formants and spectral zeros. The CNN learns efficient features via a two-dimensional filtering operation, while the feature extraction performance of shallow classifiers is limited. Compared with deep neural network and shallow classifiers, CNN realizes the highest F1-score of 0.9485. Comparing various network architectures, the convolutional filter of size 1x8 achieves the highest F1-score in the hypernasality detection task. The selected filter size of 1x8 considers more frequency information and is more suitable for hypernasality detection than the filters of size 3x3, 4x4, 5x5, and 6x6. According to an analysis of hypernasality-sensitive vowels, the experimental result concludes that the vowel /i/ is the most sensitive vowel to hypernasality. Compared with state-of-the-art literature, the proposed CNN-based system realizes a better detection performance. The results of an experiment that is conducted on a heterogeneous corpus demonstrate that CNN can better handle the speech variability compared with the shallow classifiers.

机译：腭裂言论中的自动化性高兴检测可以通过语言病理学家促进诊断。本文介绍了一种独立于独立的端到端算法，它使用卷积神经网络（CNN）来检测腭裂语音中的到期性。采用语音谱图作为输入。使用儿童和数据集分别使用的数据集分别为0.9485和0.9746分别为0.9485和0.9746。实验探讨了光谱分辨率对腭裂语音中的快旱性检测性能的影响。较高的光谱分辨率可以突出显示器的发声，例如素质和光谱零。 CNN通过二维滤波操作学习高效功能，而浅分类器的特征提取性能是有限的。与深神经网络和浅分类器相比，CNN实现了0.9485的最高F1分数。比较各种网络架构，大小1x8的卷积滤波器实现了快速检测任务中的最高F1分数。选择的滤波器大小为1×8考虑更多频率信息，并且更适合于比大小3x3,4x4,5×5和6x6的滤波器更适合于过度迹象检测。根据过度敏感元音的分析，实验结果得出结论，元音/Ⅰ/是最敏感的上升性的元音。与最先进的文献相比，所提出的基于CNN的系统实现了更好的检测性能。在异构语料库上进行的实验结果表明，与浅分类器相比，CNN可以更好地处理语音变异性。

著录项

来源
《Circuits, systems, and signal processing》 |2019年第8期|3521-3547|共27页
作者
Wang Xiyue; Tang Ming; Yang Sen; Yin Heng; Huang Hua; He Ling;
展开▼
作者单位

Sichuan Univ Coll Elect Engn & Informat Technol Chengdu Sichuan Peoples R China;

Sichuan Univ Coll Elect Engn & Informat Technol Chengdu Sichuan Peoples R China;

Sichuan Univ Coll Elect Engn & Informat Technol Chengdu Sichuan Peoples R China;

Sichuan Univ Hosp Stomatol Chengdu Sichuan Peoples R China;

Sichuan Univ Coll Elect Engn & Informat Technol Chengdu Sichuan Peoples R China;

Sichuan Univ Coll Elect Engn & Informat Technol Chengdu Sichuan Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Cleft palate speech; Hypernasality; Convolutional neural network; End-to-end; Speech spectrogram;

机译：腭裂言论;上衣;卷积神经网络;端到端;语音谱图;

相似文献

外文文献
中文文献
专利

1. Automatic Hypernasality Detection in Cleft Palate Speech Using CNN [J] . Wang Xiyue, Tang Ming, Yang Sen, Circuits, systems, and signal processing . 2019,第8期

机译：使用CNN自动检测Pal裂语音中的鼻音
2. Automatic hypernasality grade assessment in cleft palate speech based on the spectral envelope method [J] . Biomedizinische Technik . 2020,第1期

机译：基于频谱包络法的left裂语音自动鼻腔分级评估
3. Automatic Evaluation of Hypernasality Based on a Cleft Palate Speech Database [J] . He Ling, Zhang Jing, Liu Qi, Journal of medical systems . 2015,第5期

机译：基于left裂语音数据库的鼻感自动评估
4. The Automatic Detection of Hypernasality in Cleft Palate Speech Based on an Improved Cepstrum Method [C] . Fang-Ling FU, Jia-Hui QIAN, Fei HE, International Conference on Electronics, Electrical Engineering and Information Science . 2017

机译：基于改进综迷法的腭裂语音自动检测
5. Safety Signal Detection for Antihypertensive Drug-Induced Cleft Lip and/or Palate and Evaluation of Risk Perception for Medication Use in Pregnancy Among Mothers With Children With Cleft Lip and/or Palate. [D] . Palaska, Pinelopi-Kleio. 2013

机译：降压药物诱发的唇裂和/或Pal裂的安全性信号检测，以及对有唇裂和/或Pal裂患儿的母亲进行妊娠用药的风险感知评估。
6. The Relationship between the Type of Cleft and Nasal Air Emission in Speech of Children with Cleft Palate or Cleft Lip and Palate [O] . Tatjana Georgievska-Jancheska 2019

机译：Pal裂或唇裂和Pal裂患儿言语中left裂类型与鼻腔空气排放的关系
7. The Automatic Detection of Hypernasality in Cleft Palate Speech Based on an Improved Cepstrum Method [O] . Fang-Ling FU, Jia-Hui QIAN, Fei HE, 2017

机译：基于改进综迷法的腭裂语音自动检测

Automatic Hypernasality Detection in Cleft Palate Speech Using CNN

摘要

著录项

相似文献

相关主题

期刊订阅