首页> 外文会议>Asia Pacific Web and Web-Age Information Management >Meta Paths and Meta Structures: Analysing Large Heterogeneous Information Networks
【24h】

Meta Paths and Meta Structures: Analysing Large Heterogeneous Information Networks

机译:元路径和元结构:分析大型异构信息网络

获取原文

摘要

A heterogeneous information network (HIN) is a graph model in which objects and edges are annotated with types. Large and complex databases, such as YAGO and DBLP, can be modeled as HINs. A fundamental problem in HINs is the computation of closeness, or relevance, between two HIN objects. Relevance measures, such as PCRW, PathSim, and HeteSim, can be used in various applications, including information retrieval, entity resolution, and product recommendation. These metrics are based on the use of meta-paths, essentially a sequence of node classes and edge types between two nodes in a HIN. In this tutorial, we will give a detailed review of meta-paths, as well as how they are used to define relevance. In a large and complex HIN, retrieving meta paths manually can be complex, expensive, and error-prone. Hence, we will explore systematic methods for finding meta paths. In particular, we will study a solution based on the Query-by-Example (QBE) paradigm, which allows us to discover meta-paths in an effective and efficient manner. We further generalise the notion of meta path to "meta structure", which is a directed acyclic graph of object types with edge types connecting them. Meta structure, which is more expressive than the meta path, can describe complex relationship between two HIN objects (e.g., two papers in DBLP share the same authors and topics). We will discuss three relevance measures based on meta structure. Due to the computational complexity of these measures, we also study an algorithm with data structures proposed to support their evaluation. Finally, we will examine solutions for performing query recommendation based on metapaths. We will also discuss future research directions.
机译:异构信息网络(HIN)是一个图形模型,其中对象和边缘被用类型注释。大型和复杂的数据库,如yago和dblp,可以被建模为帖子。关环中的一个基本问题是两个HIN物体之间的近距离或相关性的计算。相关性措施,例如PCRW,Pathsim和Hetesim,可用于各种应用,包括信息检索,实体分辨率和产品推荐。这些指标基于使用元路径的使用,基本上是HIN中的两个节点之间的节点类和边缘类型的序列。在本教程中,我们将详细审查元路径,以及它们如何用于定义相关性。在一个大而复杂的HIN中,手动检索元路径可以复杂,昂贵和容易出错。因此,我们将探讨找到元路径的系统方法。特别是,我们将基于逐个示例(Qbe)范式的解决方案来研究一个解决方案,这使我们能够以有效且有效的方式发现元路径。我们进一步概括了元路径的概念到“元结构”,这是具有连接它们的边缘类型的对象类型的定向非循环图。比元路径更具表现力的元结构可以描述两个HIN物体之间的复杂关系(例如,DBLP中的两篇论文共享同一作者和主题)。我们将讨论基于元结构的三个相关措施。由于这些措施的计算复杂性,我们还研究了提出数据结构的算法,提出支持其评估。最后,我们将研究基于Metapaths执行查询推荐的解决方案。我们还将讨论未来的研究方向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号