首页> 外文会议>International Conference on Computing for Sustainable Global Development >A stacked sparse autoencoder based architecture for Punjabi and English spoken language classification using MFCC features
【24h】

A stacked sparse autoencoder based architecture for Punjabi and English spoken language classification using MFCC features

机译:基于叠稀疏的AutoEncoder基于Punjabi和英语口语语言分类的基于MFCC功能的架构

获取原文

摘要

Spoken language classification is an important task in speech processing. It can serve as a preprocessing step in the pipeline of automated understanding of the semantics of human speech. The paper proposes a Sparse Autoencoder based architecture for Punjabi and English spoken language Classification. A number of shallow architectures namely Soft-max classifier, SVM and deep architectures namely Artificial Neural Networks, SVM with Sparse Auto encoder and Softmax with sparse auto encoder (with and without fine tuning) have been compared on the same task. For this purpose noisy speech samples of 1 second using frame based MFCC coefficients of speech samples have been considered. Comparison of Principal Component Analysis and Sparse autoencoder based features has also been done on the dataset.
机译:口语语言分类是语音处理中的重要任务。它可以作为对人类语音语义的自动理解管道的预处理步骤。本文提出了一种基于稀疏的自动级别基于旁遮普和英语口语语言分类的架构。许多浅架构即Soft-Max分类器,SVM和深度架构即人工神经网络,使用稀疏自动编码器和带有稀疏自动编码器(带有且无需微调)的SoftMax的SVM已经进行了比较。对于此目的,已经考虑了使用基于帧的MFCC系数的语音样本的1秒的嘈杂语音样本。在数据集上也已经在基于主成分分析和基于稀疏的AutoEncoder的特征的比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号