首页> 外文期刊>Biomedical signal processing and control >An efficient approach using HOS-based parameters in the LPC residual domain to classify breathy and rough voices
【24h】

An efficient approach using HOS-based parameters in the LPC residual domain to classify breathy and rough voices

机译:在LPC残差域中使用基于HOS的参数进行分类的有效方法

获取原文
获取原文并翻译 | 示例
           

摘要

Although a considerable number of studies have been focused on the analysis of pathological voices using conventional parameters such as jitter, shimmer, and signal-to-noise ratio (SNR), these parameters have been found to be sensitive to variations in pitch extraction algorithm and cannot analyze severely disordered voice signals which exhibit irregular or aperiodic waveforms. In this paper, higher-order statistics (HOSs) analysis, which is independent of pitch period, is derived from linear predictive coding (LPC) residuals to describe breathy and rough voices. Recordings of a sustained /a/ from 23 individuals with breathy voices and 30 individuals with rough voices were collected from the disordered voice database distributed by the Japanese Society of Logopedics and Phoniatrics. We extracted conventional parameters as well as the HOS-based parameters such as the normalized skewness and the normalized kurtosis. On the other hand, we calculated HOS-based parameters from the LPC residual domain. The results showed that the HOS-based parameters and the HOS-based parameters estimated from the LPC residual are different for rough and breathy voices. Conventional parameters were not distinctive for these voices. Classification and regression tree (CART) was used to combine multiple parameters and to classify breathy and rough voices. Using the HOS-based parameters, the CART achieved an accuracy of 85.0% with the optimal decision tree generated by means of the normalized skewness and kurtosis. When the HOS-based parameters using LPC residual were used, the optimal decision tree was 88.7% accurate and the variances of the normalized skewness and kurtosis were included.
机译:尽管大量研究集中在使用常规参数(如抖动,闪光和信噪比(SNR))对病理语音进行分析,但已发现这些参数对音高提取算法和无法分析表现出不规则或非周期性波形的严重混乱的语音信号。本文从线性预测编码(LPC)残差推导了高阶统计(HOS)分析,该分析与音高周期无关,用于描述呼吸音和粗声。从由日本语和语音科学学会分发的无序语音数据库中收集了来自23个具有呼吸声音的个体和30个具有粗糙声音的个体的持续/ a /的记录。我们提取了常规参数以及基于居屋的参数,例如归一化偏度和归一化峰度。另一方面,我们从LPC残差域计算了基于HOS的参数。结果表明,粗糙和呼吸声的基于HOS的参数和根据LPC残差估计的基于HOS的参数是不同的。常规参数对于这些声音并没有区别。分类和回归树(CART)用于组合多个参数,并对呼吸声和粗略声音进行分类。使用基于HOS的参数,CART通过归一化偏度和峰度生成的最佳决策树达到了85.0%的精度。当使用基于LPC残差的基于HOS的参数时,最佳决策树的准确度为88.7%,并且包括归一化偏度和峰度的方差。

著录项

  • 来源
    《Biomedical signal processing and control》 |2011年第2期|p.186-196|共11页
  • 作者单位

    Division of Otolaryngology—Head and Neck Surgery, University of Wisconsin Medical School, 5745a Medical Sciences Center, 1300 University Avenue, Madison, WI 53706, United States,Division of Multimedia Communications and Processing, School of Engineering, Korea Advanced Institute of Science and Technology, 335 Gwahangno, Yuseong-gu, Daejeon 305-701, South Korea;

    Division of Electrical Engineering, School of Electrical and Electronic Engineering, Gyeongsang National University, 900 Gajwa-dongjinju 660-701, South Korea;

    Division of Multimedia Communications and Processing, School of Engineering, Korea Advanced Institute of Science and Technology, 335 Gwahangno, Yuseong-gu, Daejeon 305-701, South Korea;

    Division of Otolaryngology—Head and Neck Surgery, University of Wisconsin Medical School, 5745a Medical Sciences Center, 1300 University Avenue, Madison, WI 53706, United States;

    Division of Otolaryngology—Head and Neck Surgery, University of Wisconsin Medical School, 5745a Medical Sciences Center, 1300 University Avenue, Madison, WI 53706, United States;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    pathological voice quality; higher-order statistics; linear predictive coding; classification and regression tree;

    机译:病理语音质量;高阶统计量;线性预测编码;分类回归树;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号