首页> 外文期刊>Journal of computer sciences >A Content Filtering from Spam Posts on Social Media using Weighted Multimodal Approach
【24h】

A Content Filtering from Spam Posts on Social Media using Weighted Multimodal Approach

机译:使用加权多模式方法从垃圾邮件帖子的内容过滤

获取原文
           

摘要

The system for filtering spam posts on social media is preferred to obtain the relevant content and expected by users. The previous works on spam detection have been done to filter irrelevant content on email and social media based on text or image separately. Due to the social media posts are commonly in the form of image, text, or both, the multimodal data is preferred to improve the capability of system in handling filtering content on social media. In addition, a spam post containing multimodal data sometimes does not indicate spam in both data but only one. To improve the performance of system, we propose a weighted multimodal approach for filtering content from spam posts in social media using Convolutional Neural Network (CNN). The mechanism of weighted multimodal is by weighting of spam prediction results from image and text data. We also investigate the performance of CNN architectures for spam post detection that are 3-layer, 5-layer, AlexNet and VGG16. The performance of each architectures is evaluated by 8000 Indonesian posts in the form of image and text taken from Instagram posts. The results show that the highest accuracy achieves 0.9850 based on the combination of image and text by using a 5-layer architecture. The average accuracy of all CNN architectures using multimodal data is higher than only using image and text data separately.
机译:用于在社交媒体上过滤垃圾邮件帖子的系统是首选获取相关内容和用户的预期。已经完成了以前的垃圾邮件检测的工作,以便根据文本或图像分别过滤在电子邮件和社交媒体上的无关内容。由于社交媒体帖通常是图像,文本或两者的形式,多模式数据是优选提高系统在社交媒体上处理过滤内容的能力。此外,包含多模式数据的垃圾邮件帖子有时不会在两个数据中表示垃圾邮件但只有一个。为了提高系统的性能,我们提出了一种使用卷积神经网络(CNN)在社交媒体中从垃圾邮件帖子中过滤内容的加权多模态方法。加权多模式的机制是通过图像和文本数据加权垃圾邮件预测结果。我们还研究了CNN架构对垃圾邮件底层检测的性能,这是3层,5层,AlexNet和VGG16。每个架构的性能由8000个印度尼西亚帖子以Instagram帖子的图像和文本的形式评估。结果表明,通过使用5层架构,基于图像和文本的组合,最高精度达到0.9850。所有使用多模式数据的CNN架构的平均准确性高于单独使用图像和文本数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号