首页> 外文会议>Workshop on e-Commerce and NLP >Detect Profane Language in Streaming Services to Protect Young Audiences
【24h】

Detect Profane Language in Streaming Services to Protect Young Audiences

机译:检测流媒体服务中的亵渎语言以保护年轻观众

获取原文

摘要

With the rapid growth of online video streaming, recent years have seen increasing concerns about profane language in their content. Detecting profane language in streaming services is challenging due to the long sentences appeared in a video. While recent research on handling long sentences has focused on developing deep learning modeling techniques, little work has focused on techniques on improving data pipelines. In this work, we develop a data collection pipeline to address long sequence of texts and integrate this pipeline with a multi-head self-attention model. With this pipeline, our experiments show the self-attention model offers 12.5% relative accuracy improvement over state-of-the-art distilBERT model on profane language detection while requiring only 3% of parameters. This research designs a better system for informing users of profane language in video streaming services.
机译:随着在线视频流的快速增长,近年来人们越来越担心视频内容中的亵渎语言。由于视频中出现了很长的句子,在流媒体服务中检测亵渎语言是一项挑战。虽然最近关于处理长句的研究主要集中在开发深度学习建模技术上,但很少有研究集中在改进数据管道的技术上。在这项工作中,我们开发了一个数据收集管道来处理长序列的文本,并将该管道与一个多头自我注意模型相结合。通过这条管道,我们的实验表明,与最先进的distilBERT模型相比,自我注意模型在亵渎语言检测方面的相对准确率提高了12.5%,而只需要3%的参数。本研究设计了一个更好的系统,用于在视频流服务中告知用户亵渎语言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号