首页> 美国卫生研究院文献>Springer Open Choice >Automatically identifying the function and intent of posts in underground forums
【2h】

Automatically identifying the function and intent of posts in underground forums

机译:自动识别地下论坛中帖子的功能和意图

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The automatic classification of posts from hacking-related online forums is of potential value for the understanding of user behaviour in social networks relating to cybercrime. We designed annotation schema to label forum posts for three properties: post type, author intent, and addressee. The post type indicates whether the text is a question, a comment, and so on. The author’s intent in writing the post could be positive, negative, moderating discussion, showing gratitude to another user, etc. The addressee of a post tends to be a general audience (e.g. other forum users) or individual users who have already contributed to a threaded discussion. We manually annotated a sample of posts and returned substantial agreement for post type and addressee, and fair agreement for author intent. We trained rule-based (logical) and machine learning (statistical) classification models to predict these labels automatically, and found that a hybrid logical–statistical model performs best for post type and author intent, whereas a purely statistical model is best for addressee. We discuss potential applications for this data, including the analysis of thread conversations in forum data and the identification of key actors within social networks.
机译:来自黑客相关的在线论坛的帖子的自动分类对于理解与网络犯罪有关的社交网络中的用户行为具有潜在的价值。我们设计了注释模式来为论坛帖子标记三个属性:帖子类型,作者意图和收件人。帖子类型指示文本是否是问题,评论等。作者撰写此帖子的意图可以是正面的,负面的,适度的讨论,对其他用户的感激之类的。帖子的收件人通常是普通读者(例如,其他论坛用户)或已经对文章做出贡献的个人用户。讨论。我们手动注释了一个帖子样本,并返回了有关帖子类型和收件人的实质性协议,并针对作者意图达成了公平协议。我们训练了基于规则的(逻辑)和机器学习(统计)分类模型,以自动预测这些标签,发现逻辑-统计混合模型最适合于帖子类型和作者意图,而纯统计模型最适合收件人。我们讨论了此数据的潜在应用,包括分析论坛数据中的话题对话以及识别社交网络中的主要参与者。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号