...
首页> 外文期刊>Biomedical and Health Informatics, IEEE Journal of >Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings
【24h】

Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings

机译:具有多种录音类型的帕金森语音数据集的收集和分析

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

There has been an increased interest in speech pattern analysis applications of Parkinsonism for building predictive telediagnosis and telemonitoring models. For this purpose, we have collected a wide variety of voice samples, including sustained vowels, words, and sentences compiled from a set of speaking exercises for people with Parkinson's disease. There are two main issues in learning from such a dataset that consists of multiple speech recordings per subject: 1) How predictive these various types, e.g., sustained vowels versus words, of voice samples are in Parkinson's disease (PD) diagnosis? 2) How well the central tendency and dispersion metrics serve as representatives of all sample recordings of a subject? In this paper, investigating our Parkinson dataset using well-known machine learning tools, as reported in the literature, sustained vowels are found to carry more PD-discriminative information. We have also found that rather than using each voice recording of each subject as an independent data sample, representing the samples of a subject with central tendency and dispersion metrics improves generalization of the predictive model.
机译:在建立预测性远程诊断和远程监控模型方面,人们对帕金森氏病的语音模式分析应用越来越感兴趣。为此,我们收集了各种语音样本,包括针对帕金森氏病患者的一系列口语练习汇编的持续元音,单词和句子。从这样的数据集中学习存在两个主要问题,该数据集包含每个主题的多个语音记录:1)在帕金森氏病(PD)诊断中,语音样本的这些各种类型(例如,持续元音与单词)的预测性如何? 2)集中趋势和分散度指标可以很好地代表一个主题的所有样本记录吗?在本文中,如文献所报道,使用著名的机器学习工具对我们的帕金森数据集进行调查,发现持续元音可以承载更多的PD区分信息。我们还发现,与其将每个主题的每个语音记录用作独立的数据样本,不如使用具有集中趋势和离散度的指标来表示一个主题的样本,可以改善预测模型的通用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号