首页> 外文会议>International conference on security management;SAM 2011 >Twitter on Drugs: Pharmaceutical Spam in Tweets
【24h】

Twitter on Drugs: Pharmaceutical Spam in Tweets

机译:Twitter上的毒品:推文中的药品垃圾邮件

获取原文

摘要

Twitter presents a new forum for spammers to facilitate illegal pharmaceutical scams. We present a classification scheme using decision strategy and data mining techniques taking into account the unbalanced nature of the data set. Four classifiers are used to identify pharmaceutical spam tweets. Classifiers J48 and Random Tree (RT) are generated by Weka tools, and classifiers DL(J48) and DL(RT) are based on the combination of J48 and RT with the decision matrix. The classifiers were tested using manually labeled data sets collected at different time spans. Experimental results suggest that the combination of RT with the decision matrix provides a stable performance improvement over using standalone tree-based classifiers.
机译:Twitter为垃圾邮件发送者提供了一个新的论坛,以促进非法药物诈骗。考虑到数据集的不平衡性,我们提出了一种使用决策策略和数据挖掘技术的分类方案。四个分类器用于识别垃圾邮件推文。分类器J48和随机树(RT)由Weka工具生成,分类器DL(J48)和DL(RT)基于J48和RT与决策矩阵的组合。使用在不同时间跨度收集的手动标记的数据集对分类器进行了测试。实验结果表明,与使用独立的基于树的分类器相比,RT与决策矩阵的组合可提供稳定的性能提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号