首页> 外文会议>IEEE Annual Information Technology ,Electronics and Mobile Communication Conference >A Mel-Filterbank and MFCC-based Neural Network Approach to Train the Houston Toad Call Detection System Design
【24h】

A Mel-Filterbank and MFCC-based Neural Network Approach to Train the Houston Toad Call Detection System Design

机译:梅尔过滤器和基于MFCC的神经网络方法培训休斯顿蟾蜍呼叫检测系统设计

获取原文

摘要

Speaker recognition or voice detection is a state-of-art in the field of signal processing which includes human as well as animal. This paper proposes a naive approach to build a predictor model to detect the Houston Toad mating call signature in an audio file which can be paraphrased as toad voice activity detection. To accomplish that, several ideal toad call voice frames of unique characteristics in audio files have been experienced. The audio file is bandpass filtered, and then preprocessed by multiplying every frame with the hamming window to break into segments. Next, the Mel-Filterbank and Mel-Frequency Spectral Coefficient (MFCC) are used for feature extraction, while the Support Vector Machine (SVM) and Multi-layer Perceptron (MLP) neural networks are utilized as classifiers to determine the best fit. This experimental result reflects the higher accuracy of the MLP neural network over SVM showing the best potential of classification.
机译:扬声器识别或语音检测是在信号处理领域的最先进的,其包括人类以及动物。本文提出了一种天真的方法来构建预测模型,可以在音频文件中检测休斯顿蟾蜍交配呼叫签名,该音频文件可以被解释为蟾蜍语音活动检测。为实现这一目标,已经遇到了几种理想的蟾蜍呼叫音频文件中独特特征的语音帧。音频文件是带通滤波,然后通过将每个帧与汉明窗口乘以分解到段来预处理。接下来,将Mel-FilterBank和熔融频谱系数(MFCC)用于特征提取,而支持向量机(SVM)和多层的Perceptron(MLP)神经网络被用作分类器以确定最合适的分类器。该实验结果反映了MLP神经网络通过SVM的更高精度,显示出最佳分类潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号