首页> 外文会议>International Workshop on Multimedia Signal Processing >Spectrogram-Based Classification Of Spoken Foul Language Using Deep CNN

【24h】

Spectrogram-Based Classification Of Spoken Foul Language Using Deep CNN

机译：基于谱图的口语犯规分类使用深CNN

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Excessive content of profanity in audio and video files has proven to shape one’s character and behavior. Currently, conventional methods of manual detection and censorship are being used. Manual censorship method is time consuming and prone to misdetection of foul language. This paper proposed an intelligent model for foul language censorship through automated and robust detection by deep convolutional neural networks (CNNs). A dataset of foul language was collected and processed for the computation of audio spectrogram images that serve as an input to evaluate the classification of foul language. The proposed model was first tested for 2-class (Foul vs Normal) classification problem, the foul class is then further decomposed into a 10-class classification problem for exact detection of profanity. Experimental results show the viability of proposed system by demonstrating high performance of curse words classification with 1.24-2.71 Error Rate (ER) for 2-class and 5.49-8.30 F1- score. Proposed Resnet50 architecture outperforms other models in terms of accuracy, sensitivity, specificity, F1-score.

机译：音频和视频文件中亵渎的过度内容已被证明是塑造一个人的性格和行为。目前，正在使用常规的手动检测和审查方法。手动审查方法是耗时和易于误导的误导。本文提出了通过深卷积神经网络（CNNS）的自动化和强大检测来智能语言审查智能模型。收集并处理犯规语言的数据集，用于计算音频频谱图图像，该图像用作评估犯规语言分类的输入。该模型最初是为2级（犯规与规范）分类问题测试时，犯规类然后进一步分解成亵渎的精确检测有10类分类问题。实验结果通过展示骂人的话分类的高性能与1.24-2.71错误率（ER）2级和5.49-8.30 F1-评分表明拟议系统的可行性。建议Reset50架构在准确性，灵敏度，特异性，F1分数方面优于其他模型。

著录项

来源
《International Workshop on Multimedia Signal Processing 》|2020年|1-6|共6页
会议地点
作者
Abdulaziz Saleh Ba Wazir; Hezerul Abdul Karim; Mohd Haris Lye Abdullah; Sarina Mansor; Nouar AlDahoul; Mohammad Faizal Ahmad Fauzi; John See;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Image recognition; Speech recognition; Manuals; Censorship; Convolutional neural networks; Erbium;

机译：可视化;图像识别;语音识别;手册;审查;卷积神经网络;erbium;

相似文献

外文文献
中文文献
专利

1. OCR with the Deep CNN Model for Ligature Script-Based Languages like Manchu [J] . Diandian Zhang, Yan Liu, Zhuowei Wang, Scientific programming . 2021 ,第a期

机译：OCR与深入的CNN模型，即用于满族的结扎脚本语言
2. Classification and similarity analysis of fundamental frequency patterns in infant spoken language acquisition [J] . Hiroko Kato Solvang, Masanobu Taniguchi, Tomohiro Nakatani, Statistical Methodology . 2008 ,第3期

机译：婴儿口语习得中基本频率模式的分类和相似性分析
3. An elitist approach to automatic articulatory-acoustic feature classification for phonetic characterization of spoken language [J] . Shuangyu Chang, Mirjam Wester, Steven Greenberg Speech Communication . 2005 ,第3期

机译：语音自动语音特征的精英主义方法
4. Power Signal Classification with Combinational Spectrogram-based CNN for Embedded System Health Management [C] . Heoncheol Lee, Yongsung Kwon, Kipyo Kim International Conference on Control, Automation and Systems . 2018

机译：基于组合频谱图的CNN进行功率信号分类，用于嵌入式系统健康管理
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Design and Implementation of Fast Spoken Foul Language Recognition with Different End-to-End Deep Neural Network Architectures [O] . Abdulaziz Saleh Ba Wazir, Hezerul Abdul Karim, Mohd Haris Lye Abdullah, 2021

机译：不同端到端深神经网络架构的快速口语臭语识别的设计与实现
7. Deep Collaborative Attention Network for Hyperspectral Image Classification by Combining 2-D CNN and 3-D CNN [O] . Hao Guo, Jianjun Liu, Jinlong Yang, 2020

机译：通过组合2-D CNN和3-D CNN，深度协作关注网络进行高光谱图像分类

Spectrogram-Based Classification Of Spoken Foul Language Using Deep CNN

摘要

著录项

相似文献

相关主题

期刊订阅