Estimating Descriptors for Large Graphs

机译：估计大图的描述符

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Embedding networks into a fixed dimensional feature space, while preserving its essential structural properties is a fundamental task in graph analytics. These feature vectors (graph descriptors) are used to measure the pairwise similarity between graphs. This enables applying data mining algorithms (e.g classification, clustering, or anomaly detection) on graph-structured data which have numerous applications in multiple domains. State-of-the-art algorithms for computing descriptors require the entire graph to be in memory, entailing a huge memory footprint, and thus do not scale well to increasing sizes of real-world networks. In this work, we propose streaming algorithms to efficiently approximate descriptors by estimating counts of sub-graphs of order k ≤ 4, and thereby devise extensions of two existing graph comparison paradigms: the Graphlet Kernel and NetSimile. Our algorithms require a single scan over the edge stream, have space complexity that is a fraction of the input size, and approximate embeddings via a simple sampling scheme. Our design exploits the trade-off between available memory and estimation accuracy to provide a method that works well for limited memory requirements. We perform extensive experiments on real-world networks and demonstrate that our algorithms scale well to massive graphs.

机译：将网络嵌入到固定尺寸的特征空间中，同时保留其基本的结构属性是图形分析的基本任务。这些特征向量（图形描述符）用于测量图形之间的成对相似性。这使得可以对图结构化数据应用数据挖掘算法（例如分类，聚类或异常检测），而图结构化数据在多个领域中都有大量应用。用于计算描述符的最新算法要求整个图形都在内存中，这会占用巨大的内存空间，因此无法很好地扩展以适应实际网络的规模。在这项工作中，我们提出了流算法，通过估计k≤4的子图计数来有效地近似描述符，从而设计了两个现有图比较范式的扩展：Graphlet Kernel和NetSimile。我们的算法需要对边缘流进行一次扫描，其空间复杂度仅为输入大小的一小部分，并通过简单的采样方案进行近似嵌入。我们的设计利用了可用内存和估计精度之间的权衡，以提供一种适用于有限内存需求的方法。我们在现实世界的网络上进行了广泛的实验，并证明了我们的算法可以很好地扩展到大量图形。

著录项

来源
《Pacific-Asia Conference on Knowledge Discovery and Data Mining》|2020年|779-791|共13页
会议地点
作者
Zohair Raza Hassan; Mudassir Shabbir; Imdadullah Khan; Waseem Abbas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Graph descriptor; Edge stream; Graph classification;

机译：图描述符;边缘流;图分类;

相似文献

外文文献
中文文献
专利

1. Using field topographic descriptors to estimate soil water retention [J] . Rawls WJ, Pachepsky YA Soil Science . 2002,第7期

机译：使用野外地形描述符估算土壤保水量
2. Component evolution analysis in descriptor graphs for descriptor ranking [J] . Levente Kovács, Anita Keszler, Tamás Szirányi Digital Signal Processing . 2014,第Null期

机译：描述符图中的成分演化分析用于描述符排名
3. Alternative methods for estimating common descriptors for QSAR studies of dyes and fluorescent probes using molecular modeling software. 2. Correlations between log P and the hydrophilic/lipophilic index, and new methods for estimating degrees of amphiphilicity [J] . Dapson Richard W., Horobin Richard W. Biotechnic and Histochemistry . 2013,第1a8期

机译：使用分子建模软件来估计染料和荧光探针的QSAR研究通用描述符的替代方法。 2. log P与亲水/亲脂指数之间的相关性，以及估计两亲性的新方法
4. The Appearance of the Giant Component in Descriptor Graphs and Its Application for Descriptor Selection [C] . Anita Keszler, Levente Kovacs, Tamas Sziranyi International conference of the CLEF initiative . 2012

机译：描述符图中巨分量的出现及其在描述符选择中的应用。
5. A novel multistage image registration technique with graph-based region descriptors. [D] . Bowen, Francis. 2013

机译：一种新颖的具有基于图的区域描述符的多级图像配准技术。
6. Estimating the Instantaneous Screw Axis and the Screw Axis Invariant Descriptor of Motion by Means of Inertial Sensors: An Experimental Study with a Mechanical Hinge Joint and Comparison to the Optoelectronic System [O] . Andrea Ancillao, Maxim Vochten, Erwin Aertbeliën, 2020

机译：借助惯性传感器估算运动的瞬时丝杠轴和丝杠轴不变描述子：带有机械铰链接头的实验研究以及与光电系统的比较
7. Estimating Descriptors for Large Graphs [O] . Zohair Raza Hassan, Mudassir Shabbir, Imdadullah Khan, 2020

机译：估计大图表的描述符

Estimating Descriptors for Large Graphs

摘要

著录项

相似文献

相关主题

期刊订阅