HeteRank: A general similarity measure in heterogeneous information networks by integrating multi-type relationships

Zhang Mingxi; Wang Jinhua; Wang Wei

首页> 外文期刊>Information Sciences: An International Journal >HeteRank: A general similarity measure in heterogeneous information networks by integrating multi-type relationships

【24h】

HeteRank: A general similarity measure in heterogeneous information networks by integrating multi-type relationships

机译：杂：通过集成多型关系，通过整合多型信息网络的一般相似度测量

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

With heterogeneous information networks becoming ubiquitous and complex, lots of data mining tasks have been explored, including clustering, collaborative filtering and link prediction. Similarity computation is a fundamental task required for many problems of data mining. Although a large amount of similarity measures are developed for assessing similarities in heterogeneous networks, they are usually dependent on the network schema and lack a general manner for integrating kinds of relationships between objects. In this paper, we propose a similarity measure, namely HeteRank, for generally computing similarities in heterogeneous information networks. The relationships between different type objects are represented by a general relationship matrix (GRM) that is built based on the scales of different type objects. Based on GRM, HeteRank fully integrates the multi-type relationships into similarity computation by utilizing all the meetings between objects. The HeteRank equation is further transformed into a simple binomial expression form with considering restart probability. For efficiently computing HeteRank similarities, we divide the similarity computation into two steps: the first step is to compute the intermediate values, and the second step is to compute the similarities based on intermediate values. And then we approximate HeteRank equation by setting thresholds for skipping lower intermediate values and similarity scores. A pruning algorithm is developed to reduce the unnecessary visits, multiplications and additions that make little contribution during similarity computation. Extensive experiments on real datasets demonstrate the effectiveness and efficiency of HeteRank through comparing with the state-of-the-art similarity measures. (C) 2018 Elsevier Inc. All rights reserved.

机译：由于异构信息网络变得无处不在，并且已经探索了许多数据挖掘任务，包括聚类，协作滤波和链路预测。相似性计算是数据挖掘许多问题所需的基本任务。尽管用于评估异构网络中的相似性的大量相似度措施，但它们通常依赖于网络模式，并且缺乏用于集成对象之间的关系的一般方式。在本文中，我们提出了一种相似度测量，即HELEND，用于异构信息网络中的通常计算相似之处。不同类型对象之间的关系由基于不同类型对象的尺度构建的一般关系矩阵（GRM）表示。基于GRM，HELEND通过利用对象之间的所有会议完全将多型关系集成到相似性计算中。通过考虑重启概率进一步转化为简单的二项式表达形式。为了有效地计算单词相似性，我们将相似性计算分为两个步骤：第一步是计算中间值，第二步骤是基于中间值计算相似度。然后，我们通过设置跳过较低的中间值和相似性分数的阈值来近似单迹方程。开发了一种修剪算法，以减少在相似性计算期间没有贡献的不必要的访问，乘法和添加。关于实际数据集的广泛实验证明了单南的效率和效率，通过与最先进的相似性措施相比。（c）2018年Elsevier Inc.保留所有权利。

著录项

来源
《Information Sciences: An International Journal》 |2018年第2018期|共19页
作者
Zhang Mingxi; Wang Jinhua; Wang Wei;
展开▼
作者单位

Univ Shanghai Sci &

Technol Shanghai Peoples R China;

Fudan Univ Shanghai Peoples R China;

Fudan Univ Shanghai Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动信息理论;计算机的应用;信息与知识传播;自动化技术、计算机技术;
关键词
Similarity computation; HeteRank; Information network;

机译：相似性计算;HOLEALD;信息网络;

相似文献

外文文献
中文文献
专利

1. HeteRank: A general similarity measure in heterogeneous information networks by integrating multi-type relationships [J] . Zhang Mingxi, Wang Jinhua, Wang Wei Information Sciences: An International Journal . 2018,第期

机译：杂：通过集成多型关系，通过整合多型信息网络的一般相似度测量
2. TSS: Temporal similarity search measure for heterogeneous information networks [J] . Nikmehr Golnaz, Salehi Mostafa, Jalili Mandi Physica, A. Statistical mechanics and its applications . 2019,第期

机译：TSS：异构信息网络的时间相似性搜索措施
3. A semantic-rich similarity measure in heterogeneous information networks [J] . Zhou Yu, Huang Jianbin, Li He, Knowledge-Based Systems . 2018,第AUGa15期

机译：异构信息网络中语义丰富的相似性度量
4. Effectively Integrating information content and structural relationship to improve the GO-based similarity measure between proteins [C] . Bo Li, Feng Luo, James Z. Wang, International Conference on Bioinformatics and Computational Biology . 2010

机译：有效地整合信息内容和结构关系，提高蛋白质中的基于Go的相似度测量
5. User profile relationships using a generalized string similarity metric in social networks. [D] . Dabeeru, Vasavi Akhila. 2014

机译：在社交网络中使用广义字符串相似性度量的用户个人资料关系。
6. KnowSim: A Document Similarity Measure on Structured Heterogeneous Information Networks [O] . Chenguang Wang, Yangqiu Song, Haoran Li, -1

机译：KnowSim：结构化异构信息网络上的文档相似性度量
7. Recurrent Meta-Structure for Robust Similarity Measure in Heterogeneous Information Networks [O] . Yu Zhou, Jianbin Huang, Heli Sun, 2019

机译：异构信息网络中鲁棒相似度测量的复发性元结构

HeteRank: A general similarity measure in heterogeneous information networks by integrating multi-type relationships

摘要

著录项

相似文献

相关主题

期刊订阅