...
首页> 外文期刊>Expert systems with applications >Detecting clickbaits using two-phase hybrid CNN-LSTM biterm model
【24h】

Detecting clickbaits using two-phase hybrid CNN-LSTM biterm model

机译:使用两相混合CNN-LSTM比特频模型检测ClickBaits

获取原文
获取原文并翻译 | 示例

摘要

Clickbait indicates the type of content with an intending goal to attract the attention of readers. It has grown to become a nuisance to social media users. The purpose of clickbait is to bring an appealing link in front of users. Clickbaits seen in the form of headlines influence people to get attracted and curious to read the inside content. The content seen in the form of text on clickbait posts is very short to identify its features as clickbait. In this paper, a novel approach (two-phase hybrid CNN-LSTM Biterm model) has been proposed for modeling short topic content. The hybrid CNN-LSTM model when implemented with pre-trained GloVe embedding yields the best results based on accuracy, recall, precision, and F1-score performance metrics. The proposed model achieves 91.24%, 95.64%, 95.87% precision values for Dataset 1, Dataset 2 and Dataset 3, respectively. Eight types of clickbait such as Reasoning, Number, Reaction, Revealing, Shocking/Unbelievable, Hypothesis/Guess, Questionable, Forward referencing are classified in this work using the Biterm Topic Model (BTM). It has been shown that the clickbaits such as Shocking/Unbelievable, Hypothesis/Guess and Reaction are the highest in numbers among rest of the clickbait headlines published online. Also, a ground dataset of non-textual (image-based) data using multiple social media platforms has been created in this paper. The textual information has been retrieved from the images with the help of OCR tool. A comparative study is performed to show the effectiveness of our proposed model which helps to identify the various categories of clickbait headlines that are spread on social media platforms. (C) 2020 Elsevier Ltd. All rights reserved.
机译:ClickBait表示具有打算目标的内容类型,以吸引读者的注意。它已经发展成为社交媒体用户的滋扰。 ClickBait的目的是在用户面前带来一个吸引人的链接。单击标题形式的ClickBAITS影响人们吸引和好奇地阅读内部内容。单击“ClickBait帖子”文本形式的内容非常短暂,无法识别其作为ClickBait的功能。在本文中,已经提出了一种新的方法(两相混合CNN-LSTM比特形模型,用于建模短主题内容。使用预先训练的手套嵌入实施时,混合CNN-LSTM模型会产生基于精度,回忆,精度和F1分数性能指标的最佳结果。所提出的模型分别实现了数据集1,DataSet 2和DataSet 3的91.24%,95.64%,95.87%95.87%的精度值。八种类型的点击性,如推理,数量,反应,揭示,震惊/令人难以置信,假设/猜测,可疑,前进参考使用Biterm主题模型(BTM)在这项工作中分类。已经表明,诸如令人震惊/令人难以置信的,假设/猜测和反应的ClickBaits是在线发布的剩余点击条件的剩余标题之间的数量最高。此外,已经在本文中创建了使用多个社交媒体平台的非文本(基于图像)数据的地图数据集。在OCR工具的帮助下,已从图像中检索文本信息。进行比较研究以显示我们提出的模型的有效性,有助于识别在社交媒体平台上传播的各类点击条标题类别。 (c)2020 elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号