首页> 外文期刊>International Journal of Agricultural and Biological Engineering >DNN-HMM based acoustic model for continuous pig cough sound recognition
【24h】

DNN-HMM based acoustic model for continuous pig cough sound recognition

机译:基于DNN-HMM基于连续猪咳声声音识别的声学模型

获取原文
       

摘要

To detect the respiratory disease through pig cough sound in the early stage, a novel method based on Deep Neural Networks-Hidden Markov Model (DNN-HMM) was proposed to construct an acoustic model for continuous pig cough sound recognition. Noises in the continuous pig sounds were eliminated by the Wiener algorithm based on wavelet thresholding the multitaper spectrum, and the experimental corpus was obtained from the denoised continuous pig sounds. The 39-dimensional Mel Frequency Cepstral Coefficients (MFCC) extracted from the corpus were considered as feature vectors. Sounds in pig farms were divided into pig coughs, non-pig coughs, and silence segments. In the HMM, the number of hidden states of pig cough, non-pig cough and silence segments were 5, 5 and 3 respectively, and the observation states represented the feature vectors of the continuous pig sound signal. Based on experiments and empirical theory, the DNN model with 3 hidden layers and 100 nodes per layer was used to describe the correspondence between hidden states and observation serials. Through experiments, the context frames of DNN input were set to 5. Under the condition of optimal parameter setting, the traditional acoustic model Gaussian Mixture Model-Hidden Markov Model (GMM-HMM) was compared with DNN-HMM through a 5-fold cross-validation experiment. It was found that the Word Error Rate (WER) of each group in DNN-HMM was lower than that in GMM-HMM, and the average WER was 3.45% lower. At the same time, the best result of the DNN-HMM model was obtained with the lowest WER of 7.54%, and the average WER was 8.03%. The results showed that the method of DNN-HMM based acoustic model for continuous pig cough sound recognition was stable and reliable.
机译:为了在早期阶段通过猪咳声检测呼吸系统疾病,提出了一种基于深神经网络 - 隐马尔可夫模型(DNN-HMM)的新方法,以构建用于连续猪咳声识别的声学模型。通过基于小波阈值的维纳算法消除了连续猪声音中的噪声,从多扶手光谱扫描,实验性胼emply in of of obered连续猪声。从语料库中提取的39维膜频率焦度系数(MFCC)被认为是特征向量。猪场的声音分为猪咳,非猪咳和沉默段。在HMM中,分别分别为5,5和3的猪咳,非猪咳嗽和沉默段的隐藏状态,观察状态表示连续猪声信号的特征载体。基于实验和经验理论,使用具有3个隐藏层的DNN模型和每层100个节点来描述隐藏状态和观察序列之间的对应关系。通过实验,DNN输入的上下文帧被设置为5.在最佳参数设置的条件下,将传统的声学模型高斯混合模型 - 隐藏式马铃草模型(GMM-HMM)与DNN-HMM通过5倍交叉进行比较 - 过验证实验。发现DNN-HMM中每组的单词错误率(WER)低于GMM-HMM中的错误率(WER),平均WER为3.45%。同时,获得DNN-HMM模型的最佳效果,最低的效率为7.54%,平均WER为8.03%。结果表明,连续猪咳声声音识别的DNN-HMM基声学模型是稳定可靠的。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号