首页> 外文会议>IEEE Annual Information Technology, Electronics and Mobile Communication Conference >A Mel-Filterbank and MFCC-based Neural Network Approach to Train the Houston Toad Call Detection System Design
【24h】

A Mel-Filterbank and MFCC-based Neural Network Approach to Train the Houston Toad Call Detection System Design

机译:基于Mel-Filterbank和MFCC的神经网络方法训练休斯顿蟾蜍呼叫检测系统设计

获取原文

摘要

Speaker recognition or voice detection is a state-of-art in the field of signal processing which includes human as well as animal. This paper proposes a naive approach to build a predictor model to detect the Houston Toad mating call signature in an audio file which can be paraphrased as toad voice activity detection. To accomplish that, several ideal toad call voice frames of unique characteristics in audio files have been experienced. The audio file is bandpass filtered, and then preprocessed by multiplying every frame with the hamming window to break into segments. Next, the Mel-Filterbank and Mel-Frequency Spectral Coefficient (MFCC) are used for feature extraction, while the Support Vector Machine (SVM) and Multi-layer Perceptron (MLP) neural networks are utilized as classifiers to determine the best fit. This experimental result reflects the higher accuracy of the MLP neural network over SVM showing the best potential of classification.
机译:说话人识别或语音检测是信号处理领域的最新技术,包括人类和动物。本文提出了一种幼稚的方法来构建预测器模型,以检测音频文件中的Houston Toad交配呼叫签名,可以将其解释为蟾蜍语音活动检测。为此,已经体验了音频文件中具有独特特征的几个理想的蟾蜍呼叫语音帧。音频文件经过带通滤波,然后通过将每个帧乘以汉明窗进行预处理,以分成多个部分。接下来,将梅尔滤波器组和梅尔频率谱系数(MFCC)用于特征提取,而将支持向量机(SVM)和多层感知器(MLP)神经网络用作分类器以确定最佳拟合。该实验结果反映了MLP神经网络在SVM上具有更高的准确性,显示出了最佳的分类潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号