首页> 外文会议>International Workshop on Multimedia Signal Processing >Spectrogram-Based Classification Of Spoken Foul Language Using Deep CNN
【24h】

Spectrogram-Based Classification Of Spoken Foul Language Using Deep CNN

机译:基于谱图的口语犯规分类使用深CNN

获取原文

摘要

Excessive content of profanity in audio and video files has proven to shape one’s character and behavior. Currently, conventional methods of manual detection and censorship are being used. Manual censorship method is time consuming and prone to misdetection of foul language. This paper proposed an intelligent model for foul language censorship through automated and robust detection by deep convolutional neural networks (CNNs). A dataset of foul language was collected and processed for the computation of audio spectrogram images that serve as an input to evaluate the classification of foul language. The proposed model was first tested for 2-class (Foul vs Normal) classification problem, the foul class is then further decomposed into a 10-class classification problem for exact detection of profanity. Experimental results show the viability of proposed system by demonstrating high performance of curse words classification with 1.24-2.71 Error Rate (ER) for 2-class and 5.49-8.30 F1- score. Proposed Resnet50 architecture outperforms other models in terms of accuracy, sensitivity, specificity, F1-score.
机译:音频和视频文件中亵渎的过度内容已被证明是塑造一个人的性格和行为。目前,正在使用常规的手动检测和审查方法。手动审查方法是耗时和易于误导的误导。本文提出了通过深卷积神经网络(CNNS)的自动化和强大检测来智能语言审查智能模型。收集并处理犯规语言的数据集,用于计算音频频谱图图像,该图像用作评估犯规语言分类的输入。该模型最初是为2级(犯规与规范)分类问题测试时,犯规类然后进一步分解成亵渎的精确检测有10类分类问题。实验结果通过展示骂人的话分类的高性能与1.24-2.71错误率(ER)2级和5.49-8.30 F1-评分表明拟议系统的可行性。建议Reset50架构在准确性,灵敏度,特异性,F1分数方面优于其他模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号