...
首页> 外文期刊>Neural computing & applications >Feature selection generating directed rough-spanning tree for crime pattern analysis
【24h】

Feature selection generating directed rough-spanning tree for crime pattern analysis

机译:特征选择生成针对犯罪模式分析的定向粗糙生成树

获取原文
获取原文并翻译 | 示例
           

摘要

Nowadays, crime is a major threat to the society that affects the normal life of human beings all over the world. It is very important to make the world free from all aspects of crime activities. The main motivation of this work is to understand various crime patterns for avoiding and preventing the crime events to occur in future and save the world from such curse. Though research is going on for solving such problems, no work is noticed to handle the roughness or ambiguity that exists in the crime reports. The present work extracts all possible crime features from the crime reports and selects only the important features required for crime pattern analysis. For this purpose, it develops a purely supervised feature selection model integrating rough set theory and graph theory (spanning tree of a directed weighted graph). The crime reports are preprocessed, and crime features are extracted to represent each report as a feature vector (i.e., a set of distinct crime features). For crime pattern analysis, the main objective of our work, all extracted features are not necessarily essential, rather a minimal subset of relevant features are sufficient. Thus, feature selection is the main contribution in the paper that not only enhances the efficiency of subsequent mining process but also increases its correctness. The rough set theory-based relative indiscernibility relation is defined to measure the similarity between two features relative to the crime type. Based on the similarity score, a weighted and directed graph has been constructed that comprises the features as nodes and the inverse of the similarity score representing the similarity of featurevtouas the weight of the corresponding edge. Then, a minimal spanning tree (termed as rough-spanning tree) is generated using Edmond/Chu-Liu algorithm from the constructed directed graph and the importance of the nodes in the spanning tree is measured using the weights of the edges and the degrees (in-degrees and out-degrees) of the nodes in the spanning tree. Finally, a feature selection algorithm has been proposed that selects the most important node and remove it from the spanning tree iteratively until the modified graph (not necessarily a tree) becomes a null graph. The selected nodes are considered as the important feature subset sufficient for crime pattern analysis. The method is evaluated using various statistical measures and compared with related state-of-the-art methods to express its effectiveness in crime pattern analysis. The Wilcoxon rank-sum test, a popular nonparametric version of the two-samplettest, is done to express that the proposed supervised model is statistically significant.
机译:如今,犯罪是对社会的重大威胁,影响世界各地人类的正常生活。使世界摆脱犯罪活动的各个方面非常重要。这项工作的主要动力是了解避免和预防未来犯罪事件的各种犯罪模式,并将世界拯救出世界。虽然研究正在进行解决此类问题,但没有注意到犯罪报告中存在的粗糙度或歧义的工作。本工作提取犯罪报告中的所有可能犯罪特征,并仅选择犯罪模式分析所需的重要特征。为此目的,它开发了集成粗糙集理论和图论的纯粹监督特征选择模型(指向加权图的生成树)。犯罪报告是预处理的,提取犯罪特征以将每份报告代表为特征向量(即,一组不同的犯罪特征)。对于犯罪模式分析,我们工作的主要目标,所有提取的特征都不一定必不可少,而是最小的相关特征子集足够。因此,特征选择是本文中的主要贡献,不仅提高了后续采矿过程的效率,而且增加了其正确性。粗糙集理论的基于理论的相对难以辨别关系定义为测量相对于犯罪类型的两个特征之间的相似性。基于相似性分数,已经构建了一种加权和定向图,其包括节点作为节点的特征和表示特征vtouas的相似性的相似度分数的相反。然后,使用来自构造的指向图的Edmond / Chu-Liu算法生成最小生成树(称为粗糙度树),并且使用边缘的权重和度数(生成树中的节点的程度和高度。最后,已经提出了一种特征选择算法,其选择最重要的节点并迭代地从生成树中删除它,直到修改的图形(不一定是树)变为空图。所选节点被认为是足以用于犯罪模式分析的重要特征子集。通过各种统计测量评估该方法,并与相关最新方法进行比较,以表达其犯罪模式分析的有效性。 Wilcoxon Rank-Sum测试,一个流行的非参数版本的双Samplettest,是为了表达所提出的监督模型是统计上显着的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号