NScale: Neighborhood-centric Analytics on Large Graphs

机译：NScale：大图上以邻域为中心的分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There is an increasing interest in executing rich and complex analysis tasks over large-scale graphs, many of which require processing and reasoning about a large number of multi-hop neighborhoods or subgraphs in the graph. Examples of such tasks include ego network analysis, motif counting in biological networks, finding social circles, personalized recommendations, link prediction, anomaly detection, analyzing influence cascades, and so on. These tasks are not well served by existing vertex-centric graph processing frameworks whose computation and execution models limit the user program to directly access the state of a single vertex, resulting in high communication, scheduling, and memory overheads in executing such tasks. Further, most existing graph processing frameworks also typically ignore the challenges in extracting the relevant portions of the graph that an analysis task is interested in, and loading it onto distributed memory. In this demonstration proposal, we describe NS_(CALE), a novel end-to-end graph processing framework that enables the distributed execution of complex neighborhood-centric analytics over large-scale graphs in the cloud. NS_(CALE) enables users to write programs at the level of neighborhoods or subgraphs. NS_(CALE) uses Apache YARN for efficient and fault-tolerant distribution of data and computation; it features GEL, a novel graph extraction and loading phase, that extracts the relevant portions of the graph and loads them into distributed memory using as few machines as possible. NS_(CALE) utilizes novel techniques for the distributed execution of user computation that minimize memory consumption by exploiting overlap among the neighborhoods of interest. A comprehensive experimental evaluation shows orders-of-magnitude improvements in performance and total cost over vertex-centric approaches.

机译：人们对在大型图上执行丰富而复杂的分析任务越来越感兴趣，其中许多任务需要处理和推理图中的大量多跳邻域或子图。此类任务的示例包括自我网络分析，生物网络中的主题计数，寻找社交圈，个性化推荐，链接预测，异常检测，分析影响级联等等。现有的以顶点为中心的图形处理框架无法很好地完成这些任务，其计算和执行模型限制了用户程序直接访问单个顶点的状态，从而导致执行此类任务时的通信，调度和内存开销较高。此外，大多数现有的图处理框架通常还忽略了以下挑战：提取分析任务感兴趣的图的相关部分并将其加载到分布式内存中。在此演示建议中，我们描述了NS_（CALE），这是一种新颖的端到端图处理框架，该框架能够在云中的大型图上分布式执行复杂的以邻域为中心的分析。 NS_（CALE）使用户可以在邻域或子图级别上编写程序。 NS_（CALE）使用Apache YARN进行数据和计算的高效且容错的分配;它具有GEL（一种新颖的图形提取和加载阶段）功能，可以提取图形的相关部分，并使用尽可能少的机器将它们加载到分布式内存中。 NS_（CALE）利用新颖的技术进行用户计算的分布式执行，该技术通过利用感兴趣的邻域之间的重叠来最大程度地减少内存消耗。全面的实验评估显示，与以顶点为中心的方法相比，性能和总成本得到了数量级的提高。

著录项

来源
《International conference on very large data bases》|2014年|1673-1676|共4页
会议地点
作者
Abdul Quamar; Amol Deshpande; Jimmy Lin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Graph analytics; Cloud computing; Ego-centric analysis; Subgraph extraction; Graph Partitioning; Data Placement; Social networks;

机译：图分析;云计算;以自我为中心的分析;子图提取;图分区数据放置;社交网络;

相似文献

外文文献
中文文献
专利

1. NScale: neighborhood-centric large-scale graph analytics in the cloud [J] . Quamar Abdul, Deshpande Amol, Lin Jimmy The VLDB journal . 2016,第2期

机译：NScale：云中以社区为中心的大规模图形分析
2. Analytical method for the identification and assay of 12 phthalates in cosmetic products: Application of the ISO 12787 international standard " Cosmetics-Analytical methods-Validation criteria for analytical results using chromatographic techniques" [J] . Gimeno P., Maggio A.-F., Bousquet C., Journal of chromatography, A: Including electrophoresis and other separation methods . 2012,第Null期

机译：化妆品中12种邻苯二甲酸酯的鉴定和分析的分析方法：ISO 12787国际标准“化妆品-分析方法-使用色谱技术的分析结果验证标准”的应用
3. Analytical Treatment of Higher-Order Graphs: A Path Ordinal Method for Solving Graphs [J] . Hala Kamal, Eusebio Bernabeu, Alicia Larena Symmetry . 2017,第11期

机译：高阶图的解析处理：图的路径序数法
4. NScale: Neighborhood-centric Analytics on Large Graphs [C] . Abdul Quamar, Amol Deshpande, Jimmy Lin International conference on very large data bases . 2014

机译：nscale：大型图形上以邻居为中心的分析
5. Two new instruments for analytical chemistry: A. Constant potential pulse polarography (CPPP) and differential CPPP (DCPPP) for determination of metals in the presence of oxygen in flowing systems; B. Versatile laser-based analytical instrument for detection of jet-cooled molecular species. [D] . Isaac, Bryan J. 1989

机译：两种用于分析化学的新仪器：A.用于在流动系统中存在氧气的情况下测定金属的恒定电势脉冲极谱法（CPPP）和差分CPPP（DCPPP）； B.基于激光的多功能分析仪器，用于检测喷射冷却的分子种类。
6. Application of 2-Trichloromethylbenzimidazole in Analytical Chemistry: A Highly Selective Chromogenic Reagent for Thin-Layer Chromatography and Some Other Analytical Uses [O] . Leszek Konopski, Anna Kiełczewska 2012

机译：2-三氯甲基苯并咪唑在分析化学中的应用：一种用于薄层色谱和其他分析用途的高选择性生色试剂
7. NScale: Neighborhood-centric Large-Scale Graph Analytics in the Cloud [O] . Quamar, Abdul, Deshpande, Amol, Lin, Jimmy 2015

机译：Nscale：云中以社区为中心的大规模图表分析

NScale: Neighborhood-centric Analytics on Large Graphs

摘要

著录项

相似文献

相关主题

期刊订阅