大数据下基于异步累积更新的高效P-Rank计算方法

王旭丛; 李翠平; 陈红

首页> 中文期刊> 《软件学报》 >大数据下基于异步累积更新的高效P-Rank计算方法

大数据下基于异步累积更新的高效P-Rank计算方法

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

P-Rank是SimRank的扩展形式,也是一种相似度度量方法,被用来计算网络中任意两个结点的相似性。不同于SimRank只考虑结点的入度信息,P-Rank还加入了结点的出度信息,从而更加客观准确地评价结点间的相似程度。随着大数据时代的到来,P-Rank需要处理的数据日益增大。使用MapReduce等分布式模型实现大规模P-Rank迭代计算的方法,本质上是一种同步迭代方法,不可避免地具有同步迭代方法的缺点：迭代时间(尤其是迭代过程中处理器等待的时间)长,计算速度慢,因此效率低下。为了解决这一问题,采用了一种迭代计算方法--异步累积更新算法。这个算法实现了异步计算,减少了计算过程处理器结点的等待时间,提高了计算速度,节省了时间开销。从异步的角度实现了P-Rank算法,将异步累积更新算法应用在了P-Rank上,并进行了对比实验。实验结果表明该算法有效地提高了计算收敛速度。%P-Rank enriches the traditional similarity measure, SimRank. It is also a method to measure the similarity between two objects in graph model. Different from SimRank which only considers the in-link information, P-Rank also takes the out-link information into consideration. Consequently, P-Rank could effectively and comprehensively measure“how similar two nodes are”. P-Rank is applied widely in graph mining. With the arrival of big-data era, the data scale which P-Rank processes is increasing. The existing methods which implement P-Rank, such as the MapReduce model, are essentially synchronous iterative methods. These methods have some shortcomings in common: the iterative time, especially the waiting time of processors during iterative computing, is long, thus leading to very low efficiency. To solve this problem, this paper uses a new iterative method-the Asynchronous Accumulative Update method. Different from the traditional synchronous methods, this method successfully implementes asynchronous computations and as a result reduces the waiting time of processors during computing. This paper implements P-Rank using the asynchronous accumulative update method, and the experiment results indicate that this method can effectively improve the computation speed.

著录项

来源
《软件学报》 |2014年第9期|2136-2148|共13页
作者
王旭丛; 李翠平; 陈红;
展开▼
作者单位

中国人民大学信息学院计算机系;

北京 100872;

中国人民大学信息学院数据仓库与商务智能实验室;

北京 100872;

中国人民大学信息学院数据仓库与商务智能实验室;

北京 100872;

展开▼
原文格式 PDF
正文语种 chi
中图分类程序设计、软件工程;
关键词
异步累积更新; 大数据; 相似度; P-Rank; 大规模计算;

相似文献

中文文献
外文文献
专利

1. Spark环境下基于子图的异步迭代更新方法 [J] . 李超 ,董新华 ,陈建峡 . 计算机工程与应用 . 2020,第007期
2. 基于大数据背景下的城市更新节点设计方法研究 [J] . 文巍 ,王晶 ,李春研 . 河南建材 . 2017,第001期
3. 大数据视角下名录库更新维护——基于互联网异源异构数据整合的探讨 [J] . 傅德印 ,黄恒君 ,陶然 . 统计研究 . 2015,第001期
4. 基于精简四阶累积量MUSIC与混合遗传算法的笼型异步电动机转子断条故障检测新方法 [J] . 许伯强 ,朱明飞 . 电机与控制应用 . 2016,第007期
5. 基于SVD滤波技术与快速四阶累积量ESPRIT算法的异步电动机转子断条故障检测新方法 [J] . 孙丽玲 ,王续 ,许伯强 . 电工技术学报 . 2015,第010期
6. 有机更新背景下老工业区集约高效利用模式及规划研究——以沈阳老工业区用地更新为例 [C] . 于路 ,王丽丹 ,张年国 . 2017中国城市规划年会 . 2017
7. 开放大数据支持下的城市更新改造潜力评价研究——基于城市功能完善度驱动力 [A] . 王景丽 . 2017

大数据下基于异步累积更新的高效P-Rank计算方法

摘要

著录项

相似文献

相关主题

期刊订阅