...
首页> 外文期刊>Knowledge-Based Systems >A biology-inspired, data mining framework for extracting patterns in sexual cyberbullying data
【24h】

A biology-inspired, data mining framework for extracting patterns in sexual cyberbullying data

机译:生物启发的数据挖掘框架,用于提取性网络欺凌数据中的模式

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

With the rapid growth of social media, users, especially adolescents, are spending significant amount of time on various social networking sites to connect with others, to share information, and to pursue common interests. However, as social networking has become widespread, certain people are finding illegal and unethical ways to use these communities as means for opening the door of inappropriate online activities. Thus, they are providing an open way for cybercrimes such as cyberbullying. In this paper, we deal with the aforementioned issue as a time series modelling methodology, aiming at the recognition of bullying patterns within the questions posed by a predator to his victims. Given a set of real world transcripts (i.e. the whole set of predator's questions), in which each question is numerically labelled in terms of severity, we first model each set of predator's questions as a time series. The next step is the main contribution of this paper, in terms of changing the representation scheme from time series data into symbolic representation. More specifically, inspired by the Multiple Sequence Alignment (MSA) method, commonly used in computational biology for identifying conserved regions of similarity among raw molecular data, we represent the set of signals according to a SAX (Symbolic Aggregate approXimation) symbolic representation, transforming each signal into a symbol string. The main rationale behind this adoption lies to the fact that the collected cyberbullying data can be converted to string sequences via SAX conversion, which in turn can be aligned, thus revealing conserved temporal patterns or slight variations in the attacking strategies of the predators. Experimental results, based on the clustering improvement of the aforementioned data using the extracted patterns instead of the time series data, justify our claims. (C) 2015 Elsevier B.V. All rights reserved.
机译:随着社交媒体的迅速发展,用户(尤其是青少年)正在各种社交网站上花费大量时间与他人建立联系,共享信息并追求共同的兴趣。但是,随着社交网络的普及,某些人正在寻找非法和不道德的方式来使用这些社区作为打开不适当的在线活动之门的手段。因此,它们为诸如网络欺凌之类的网络犯罪提供了开放的方式。在本文中,我们将上述问题作为时间序列建模方法进行处理,旨在识别捕食者对其受害者提出的问题内的欺凌模式。给定一组现实世界的成绩单(即整套捕食者问题),其中每个问题都按照严重程度进行数字标记,我们首先将每组捕食者问题建模为一个时间序列。下一步是将表示方案从时间序列数据更改为符号表示,这是本文的主要贡献。更具体地说,受多序列比对(MSA)方法的启发,该方法通常用于计算生物学中,用于识别原始分子数据之间的保守性相似区域,我们根据SAX(符号集合近似)符号表示来表示信号集,并对每个信号进行转换信号成符号串。这种采用背后的主要原理是,可以通过SAX转换将收集到的网络欺凌数据转换为字符串序列,进而可以进行对齐,从而揭示出保守的时间模式或捕食者的攻击策略略有变化。基于使用提取的模式而不是时间序列数据对上述数据进行聚类改进的实验结果证明了我们的主张。 (C)2015 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号