首页> 外文会议>International Conference on Enterprise Information Systems >INFORMED k-MEANS: A CLUSTERING PROCESS BIASED BY PRIOR KNOWLEDGE A case study in the dactyloscopic domain
【24h】

INFORMED k-MEANS: A CLUSTERING PROCESS BIASED BY PRIOR KNOWLEDGE A case study in the dactyloscopic domain

机译:知情K-Means:通过先前知识偏置的聚类过程是在Dactyloscopic结构域中的案例研究

获取原文

摘要

Knowledge Discovery in Databases (KDD) is the process by which unknown and useful knowledge and information are extracted, by automatic or semi-automatic methods, from large amounts of data. Along the evolution of Information Technology and the rapid growth in the number and size of databases, the development of methodologies, techniques, and tools for data mining has become a major concern for researchers, and has led, in turn, to the development of applications in a variety of areas of human activity. About 1997, the processes and techniques associated with cluster analysis had begun to be researched with increasing intensity by the KDD community. Within the context of a model intended to support decisions based on cluster analysis, prior knowledge about the data structure and the application domain can be used as important constraints that lead to better results in the clusters' configurations. This paper presents an application of cluster analysis in the area of public safety using a schema that takes into account the burden of prior knowledge acquired from statistical analysis on the data. Such an information was used as a bias for the k-means algorithm that was applied to identify the dactyloscopic (fingerprint) profile of criminals in the Brazilian capital, also known as Federal District. These results was then compared with a similar analysis that disregarded the prior knowledge. It is possible to observe that the analysis using prior knowledge generated clusters that are more coherent with the expert knowledge.
机译:数据库中的知识发现(KDD)是通过大量数据通过自动或半自动方法提取未知和有用的知识和信息的过程。沿着信息技术的演变和数据库的数量和大小的快速增长,数据挖掘的方法,技术和工具的开发已成为研究人员的主要关注点,并转过身来发展应用程序在各种人类活动领域。大约1997年,与集群分析相关的过程和技术已开始随着KDD社区的增加而研究。在旨在基于集群分析的基于群集分析的决策的模型的上下文中,关于数据结构和应用程序域的先验知识可以用作重要的约束,从而导致群集配置更好的结果。本文介绍了使用架构在公共安全领域进行集群分析,考虑到从数据统计分析中获取的先验知识的负担。这样的信息被用作K-Means算法的偏置,用于识别巴西资本中的犯罪分子的Dayyloscopic(指纹)轮廓,也称为联邦区。然后将这些结果与类似的分析进行比较,以忽视先前的知识。可以观察到使用现有知识产生的分析,这些集群与专家知识更加连贯。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号