首页> 外文会议>International conference on healthcare science and engineering >Lung Sound Diagnosis with Deep Convolutional Neural Network and Two-Stage Pipeline Model
【24h】

Lung Sound Diagnosis with Deep Convolutional Neural Network and Two-Stage Pipeline Model

机译:深度卷积神经网络和两阶段管道模型的肺部声音诊断

获取原文

摘要

Lung sounds are very critical in the diagnosis of pulmonary disease clinically. The study of their recognition using computers is considered to be meaningful for doctors. In this paper, we proposed two methods for identifying wheeze, crackle, and normal lung sounds. We first formulate the lung sound identification problem mathematically. And then, we propose a deep convolutional neural network (CNN) model which is consisted of 9 layers (6 conv layers, 3 pooling layers, and 3 fully connected layers). Lung sound segments are extracted to obtain feature bands from log-scaled mel-frequency spectral (LMFS) and are constructed into feature maps byways of bands by frames. The second method is a two-stage pipeline model (TSPM) which is the extension of Gaussian mixture model. Forty-six traditional features are extracted and selected for our TSPM. Testing on our lung sound database recorded from a local hospital, we find that chroma features are the most important to TSPM and the F1 scores of 46 features on three types show an obvious improvement when it is compared with 24 MFCC which are shown to be the optimal features for wheeze recognition in the previous literature. And we finally find that CNN model has better recognition performance than TSPM, because F1 scores about CNN model's testing are 0.8516 for wheeze, 0.8471 for crackle, and 0.8571 for normal sound.
机译:肺部声音在临床上诊断肺病时非常关键。他们使用计算机的认可研究被认为对医生有意义。在本文中,我们提出了两种识别喘息,噼啪声和正常肺部声音的方法。我们首先在数学上制定肺部声音识别问题。然后,我们提出了一个深度卷积神经网络(CNN)模型,由9层组成(6个Conv层,3个池层和3个完全连接的层)。提取肺部声音段以获得从记录较小的熔体频谱(LMF)的特征频带,并且由帧构造成带有频带的特征映射。第二种方法是一种两级流水线模型(TSPM),其是高斯混合模型的延伸。为我们的TSPM提取和选择四十六种传统功能。在从当地医院录制的肺部声音数据库测试中,我们发现色度特征是TSPM最重要的,并且在三种类型的46个功能中的F1分数显示出明显的改进,与24 MFCC相比,这是如此以前文学中喘息识别的最佳特征。我们终于发现CNN模型具有比TSPM更好的识别性能,因为关于CNN模型的测试的F1分数为0.8516,用于喘息,0.8471,裂纹0.8571,正常声音为0.8571。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号