RASMA: a reverse search algorithm for mining maximal frequent subgraphs

Saeed Salem; Mohammed Alokshiya; Mohammad Al Hasan

首页> 外文期刊>BioData Mining >RASMA: a reverse search algorithm for mining maximal frequent subgraphs

【24h】

RASMA: a reverse search algorithm for mining maximal frequent subgraphs

机译：RASMA：用于采矿最大频繁子图的反向搜索算法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Given a collection of coexpression networks over a set of genes, identifying subnetworks that appear frequently is an important research problem known as mining frequent subgraphs. Maximal frequent subgraphs are a representative set of frequent subgraphs; A frequent subgraph is maximal if it does not have a super-graph that is frequent. In the bioinformatics discipline, methodologies for mining frequent and/or maximal frequent subgraphs can be used to discover interesting network motifs that elucidate complex interactions among genes, reflected through the edges of the frequent subnetworks. Further study of frequent coexpression subnetworks enhances the discovery of biological modules and biological signatures for gene expression and disease classification. We propose a reverse search algorithm, called RASMA, for mining frequent and maximal frequent subgraphs in a given collection of graphs. A key innovation in RASMA is a connected subgraph enumerator that uses a reverse-search strategy to enumerate connected subgraphs of an undirected graph. Using this enumeration strategy, RASMA obtains all maximal frequent subgraphs very efficiently. To overcome the computationally prohibitive task of enumerating all frequent subgraphs while mining for the maximal frequent subgraphs, RASMA employs several pruning strategies that substantially improve its overall runtime performance. Experimental results show that on large gene coexpression networks, the proposed algorithm efficiently mines biologically relevant maximal frequent subgraphs. Extracting recurrent gene coexpression subnetworks from multiple gene expression experiments enables the discovery of functional modules and subnetwork biomarkers. We have proposed a reverse search algorithm for mining maximal frequent subnetworks. Enrichment analysis of the extracted maximal frequent subnetworks reveals that subnetworks that are frequent are highly enriched with known biological ontologies.

机译：鉴于一组基因上的一系列共表达网络，识别出频繁出现的子网是一个重要的研究问题，称为挖掘频繁子图。最大频繁的子图是一种代表性的频繁子图;如果它没有频繁的超级图，则频繁的子图是最大的。在生物信息学学科中，频繁和/或最大频繁子图的采矿方法可用于发现通过频繁子网边缘反射的基因之间的复杂相互作用的有趣网络图案。进一步研究频繁的共抑制子网增强了生物模块的发现和基因表达和疾病分类的生物签名。我们提出了一种反向搜索算法，称为RASMA，用于在给定的图形集合中挖掘频繁和最大频繁的子图。 RASMA的关键创新是一个连接的子图枚举器，它使用反向搜索策略来列举无向图的连接子图。使用此枚举策略，RASMA非常有效地获得所有最大频繁子图。为了克服枚举所有频繁子图的计算禁止任务，同时挖掘最大频繁子图，RASMA采用了几种大幅提高其整体运行时性能的修剪策略。实验结果表明，在大型基因共存网络上，所提出的算法有效地挖掘生物学相关的最大频繁子图。从多基因表达实验中提取复发基因共抑制子网可以发现功能模块和子网生物标志物。我们提出了一种用于挖掘最大频繁子网的反向搜索算法。提取的最大频繁子网的浓缩分析表明，频繁的子网是高度丰富的已知生物本体。

著录项

来源
《BioData Mining》 |2021年第1期|共23页
作者
Saeed Salem; Mohammed Alokshiya; Mohammad Al Hasan;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物科学;
关键词
Biological networksSubgraph enumerationFrequent subgraphsMaximal subgraphsReverse search;

机译：生物网络库枚举罚款小型子画面亚太山脉浏览器搜索;

相似文献

外文文献
中文文献
专利

1. Uncertain maximal frequent subgraph mining algorithm based on adjacency matrix and weight [J] . Di Wu, Jiadong Ren, Long Sheng International journal of machine learning and cybernetics . 2018,第9期

机译：基于邻接矩阵和权重的不确定最大频繁子图挖掘算法
2. EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS [J] . Ye-In Chang, Chia-En Li, Wei-Hau Peng, International Journal of Electrical Engineering: Transactions of the Chinese Institute of Engineers, Series E . 2013,第2期

机译：高效的子格算法，用于挖掘数据流中的封闭频率项和最大频率项
3. EFFICIENT SOFTWARE FAULT LOCALIZATION BY HIERARCHICAL INSTRUMENTATION AND MAXIMAL FREQUENT SUBGRAPH MINING [J] . Jiadong Ren, Huifang Wang, Yue Ma, International Journal of Innovative Computing Information and Control . 2015,第6期

机译：通过分层仪器和最大频率子图挖掘进行有效的软件故障定位
4. A parallel algorithm for mining maximal frequent subgraphs [C] . Eihab El Radie, Saeed Salem IEEE International Conference on Bioinformatics and Biomedicine . 2017

机译：挖掘最大频繁子图的并行算法
5. Adaptation of Frequent Subgraph Mining Algorithms to Noncoding RNA Topology Alignment and Function Prediction [D] . Liu, Muyi. 2017

机译：频繁的子图挖掘算法适应非编码RNA拓扑比对和功能预测
6. RASMA: a reverse search algorithm for mining maximal frequent subgraphs [O] . Saeed Salem, Mohammed Alokshiya, Mohammad Al Hasan 2021

机译：RASMA：用于采矿最大频繁子图的反向搜索算法
7. MFC: Mining Maximal Frequent Dense Subgraphs without Candidate Maintenance in Imbalanced PPI Networks [O] . Miao Wang, Xuequn Shang, Zhanhuai Li 2014

机译：mFC：在不平衡ppI网络中挖掘没有候选维护的最大频繁密集子图
8. O(/V/+/E/) Algorithm for finding an Edge-Maximal Subgraph with a TR-Formative Coloring [R] . Balas, E. 1985

机译：用于形成TR形成着色的边 - 极大子图的O（/ V / + / E /）算法

RASMA: a reverse search algorithm for mining maximal frequent subgraphs

摘要

著录项

相似文献

相关主题

期刊订阅