...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Fast Algorithms for Semantic Association Search and Pattern Mining
【24h】

Fast Algorithms for Semantic Association Search and Pattern Mining

机译:语义关联搜索和模式挖掘的快速算法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Given a large graph representing relations between entities, searching for complex relationships (called semantic associations, or SAs for short) between a set of entities is a common type of information needs in many domains. Further, numerous SAs are often abstracted into a few frequent high-level conceptual graph patterns (called SA patterns, or SAPs for short), which organize SAs into interpretable subgroups. Whereas the quality and usefulness of SAs and SAPs have been extensively studied in the literature, in this article we aim to develop faster algorithms for SA search and frequent SAP mining. For the former problem, we leverage distances to prune the search space, and implement a distance oracle to balance the time and space for distance calculation. For the latter problem, we exploit both graph structure and labels to induce fine-grained skeleton-based partitions of SAs, which may be pruned to reduce SAP enumeration. Besides, we generate canonical codes for SAs, which not only enable result deduplication but also are reused in SAP mining to improve the overall performance. We extensively evaluate the efficiency of our algorithms on four large graphs, using both random queries and simulated queries which reproduce the extreme case of finding numerous SAs.
机译:给定代表实体之间的关系的大图,在一组实体之间搜索复杂的关系(称为语义关联或SAS的SAS,是许多域中的常用信息类型。此外,许多SAS通常被抽象成一些频繁的高级概念图模式(称为SA模式,或短暂的SAP),其将SAS组织成可解释的子组。虽然SAS和SAP的质量和有用性在文献中进行了广泛研究,但在本文中,我们的目标是开发更快的SA搜索和频繁SAP采矿的算法。对于前一个问题,我们利用距离来修剪搜索空间,并实现距离oracle以平衡距离计算的时间和空间。对于后一种问题,我们利用图形结构和标签来诱导SAS的细粒度基于骨架的分区,这可能被修剪以减少SAP枚举。此外,我们为SAS生成规范代码,这不仅能够使结果重复数据删除,而且在SAP挖掘中重复使用以提高整体性能。我们通过随机查询和模拟查询广泛地评估了四个大图中的算法的效率,这些查询可重现查找众多SA的极端情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号