首页> 外文期刊>International Journal of Applied Engineering Research >A Novel Collapsed GIBBS Sampling Methodology for Various Categories of Corpus in Collaborative Cybercriminal Network Discovery
【24h】

A Novel Collapsed GIBBS Sampling Methodology for Various Categories of Corpus in Collaborative Cybercriminal Network Discovery

机译:协作性计算机犯罪网络发现中各种语料库的新型折叠GIBBS抽样方法

获取原文
获取原文并翻译 | 示例
           

摘要

In the recent past, there has been a continuous increase in the amount of cybercrimes that source for considerably financial loss to several organizations. Current investigations expose that cybercriminals are likely to work together or even transact cyber-attack tools through the "dark markets" established in online social media and don't mine syntactic and semantic terms for online social media messages in collaborative cybercrime activities. The foremost role of the work is to solve this problem by means of proposing a weakly supervised cybercriminal network mining called Collapsed Gibbs Sampling method for the extensively utilized Latent Dirichlet Allocation (CGSLDA) model in order to facilitate cybercrime forensics. Here, at the beginning the syntactic and semantically text corpus is extracted from the online social media text documents by means of the shallow-parsed corpus and lexico-syntactic associations. This method results in considerable speedups on real world text corpora. Subsequently, Kernel based Support Vector Machine (KSVM) classification is carried out on the results of CGSLDA in order to categorize the cybercriminal activities of the association labels of messages into cybercriminal and regular activities. CGSLDA is evaluated in accordance with two social media corpora, it reveals that proposed CGSLDA scheme drastically outperforms the Context sensitive Latent Dirichlet Allocation (CSLDA) based scheme and other schemes in terms of Precision, Recall, F-measure, Accuracy and Region Of Convergence (ROC) Curve respectively.
机译:在最近的过去,网络犯罪的数量一直在不断增加,这些犯罪为许多组织带来了可观的经济损失。当前的调查表明,网络犯罪分子很可能会通过在线社交媒体中建立的“黑暗市场”协同工作,甚至交易网络攻击工具,并且不会在协作网络犯罪活动中挖掘在线社交媒体消息的句法和语义术语。该工作的首要作用是通过为广泛使用的潜在狄利克雷特分配(CGSLDA)模型提出一种名为“崩溃的吉布斯抽样”方法的弱监督的网络犯罪网络挖掘来解决此问题,以促进网络犯罪取证。在这里,首先,借助浅层分析的语料库和词汇-句法关联从在线社交媒体文本文档中提取句法和语义上的文本语料库。这种方法导致现实世界中的文本语料库大大提高了速度。随后,对CGSLDA的结果进行基于内核的支持向量机(KSVM)分类,以便将消息的关联标签的网络犯罪活动分类为网络犯罪和常规活动。 CGSLDA是根据两个社交媒体语料库进行评估的,它揭示了拟议的CGSLDA方案在精度,召回率,F度量,准确性和收敛区域( ROC)曲线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号