...
首页> 外文期刊>Applied Soft Computing >Rough set based incremental crime report labelling in dynamic environment
【24h】

Rough set based incremental crime report labelling in dynamic environment

机译:基于粗糙的集合犯罪报告标记在动态环境中

获取原文
获取原文并翻译 | 示例
           

摘要

The proposed work describes a rough set based incremental crime report labelling technique. The named entities are recognized from the available crime reports to identify the phrases existing between entity pairs. The phrases are vectorized considering intervening context words and a quick reduct generation algorithm is applied to minimize their dimensions. A graph based clustering algorithm has been applied to cluster the phrases and later they have been labelled based on some centrality measure techniques. Each current report, represented by a set of phrases, is labelled by the labels of clusters in which its phrases reside. Thus, a current report may be labelled as a multi-class report. Each new report is labelled incrementally using existing labelled clusters with the help of rough set theory. The phrases of the new report are partitioned into equivalence classes using indiscernibility relation and the lower approximations of the clusters of phrases are obtained considering these equivalence classes as target sets. If the lower approximation of a cluster contains all the phrases of the new report, then the report is labelled by the label of that cluster; otherwise, probability based membership values have been assigned to the report to represent the degree of its belongingness into different classes. The existing clusters and equivalence classes of phrases generate a new set of labelled clusters, which are further used for labelling upcoming reports. Thus, the proposed method labels crime reports in an incremental way. The work is validated by various indices and compared with many state-of-the-art clustering and classification algorithms. Experimental results show the effectiveness of the method in the crime report labelling. (C) 2019 Elsevier B.V. All rights reserved.
机译:所提出的工作描述了一种基于粗糙的增量犯罪报告标签技术。命名实体从可用的犯罪报告中识别,以确定实体对之间存在的短语。认为,考虑干预上下文单词的短语,并应用快速减换生成算法以最小化其尺寸。基于图形的聚类算法已应用于群集短语,并以稍后的标记基于一些中心测量技术标记。由一组短语表示的每个当前报告由其短语驻留的群集标签标记。因此,目前的报告可以标记为多级报告。在粗糙集理论的帮助下,使用现有标记的群集逐步标记每个新报告。使用毫无疑问的关系将新报告的短语分为等效类,并且将这些等效类作为目标集的这些等价类获得了较低的短语簇的较低近似。如果群集的较低近似包含新报告的所有短语,则该群集的标签标记报告;否则,已将基于概率的成员资格值分配给报告以将其归属度的程度代表到不同的类中。现有的群集和等效类别的短语类生成一组新的标记群集,这进一步用于标记即将推出的报告。因此,提出的方法以增量方式标记犯罪报告。该工作由各种指标验证,并与许多最先进的聚类和分类算法进行比较。实验结果表明该方法在犯罪报告标签中的有效性。 (c)2019年Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号