Subgroup discovery on Big Data: exhaustive methodologies using Map-Reduce

机译：大数据的子组发现：使用地图减少的详尽方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Subgroup Discovery is a flexible supervised local pattern mining method whose aim is to discover interesting subgroups with respect to one property of interest. Although many efficient algorithms have been developed in this field, the growing interest in data storage has provoked that the datasets are larger and larger hampering their performance. In this paper, two new algorithms to discover subgroups on Big Data have been proposed. In this regard, the MapReduce paradigm has been considered and in concrete Apache Spark was used to face up the Big Data requirements. The experimental study considers more than 40 high dimensional datasets and a set of efficient algorithms on the subgroup discovery field. Search spaces bigger than 3.3·10~(13) available subgroups are used. The experimental analysis demonstrates that the proposed algorithms obtain excellent results in efficiency, demonstrating the usefulness of using Apache Spark in the field.

机译：亚组发现是一种灵活的监督本地模式挖掘方法，其目的是发现关于感兴趣的一个属性的有趣子组。虽然在该领域开发了许多有效的算法，但是在数据存储中越来越感兴趣地激发了数据集更大，更大妨碍了它们的性能。在本文中，已经提出了两个用于在大数据上发现子组的新算法。在这方面，MapReduce范式已被考虑，并且在混凝土中，使用Apache Spark用于面对大数据要求。实验研究考虑了40多维数据集和亚组发现场上的一组高效算法。使用比3.3·10〜（13）的搜索空间使用可用子组。实验分析表明，所提出的算法的效率优异地获得了优异的结果，展示了在现场中使用Apache火花的有用性。

著录项

来源
《IEEE International Conference on Trust, Security and Privacy in Computing and Communications;IEEE International Conference on Big Data Science and Engineering;IEEE International Symposium on Parallel and Distributed Processing with Applications》|2016年|1516-2272p|共8页
会议地点
作者
F. Padillo; J. M. Luna; S. Ventura;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393.08-53;
关键词

相似文献

外文文献
中文文献
专利

1. Induction of comprehensible models for gene expression datasets by subgroup discovery methodology. [J] . Gamberger D, Lavrac N, Zelezny F, Journal of biomedical informatics. . 2004,第4期

机译：通过亚组发现方法归纳出用于基因表达数据集的可理解模型。
2. Description-oriented community detection using exhaustive subgroup discovery [J] . Atzmueller Martin, Doerfel Stephan, Mitzlaff Folke Information Sciences: An International Journal . 2016,第Null期

机译：使用详尽的亚组发现进行面向描述的社区检测
3. Fast exhaustive subgroup discovery with numerical target concepts [J] . Lemmerich Florian, Atzmueller Martin, Puppe Frank Data mining and knowledge discovery . 2016,第3期

机译：利用数值目标概念快速穷举发现亚组
4. Subgroup Discovery on Big Data: Exhaustive Methodologies Using Map-Reduce [C] . F. Padillo, J.M. Luna, S. Ventura IEEE International Conference on Big Data Science and Engineering;IEEE International Conference on Trust, Security and Privacy in Computing and Communications;IEEE International Symposium on Parallel and Distributed Processing with Applications . 2016

机译：大数据的子组发现：使用Map-Reduce的详尽方法
5. A Methodology for Rapid Deployment of Shared Cohort Discovery System with Integrated EHR to Enable Federated Health Data Governance [D] . Khan, Muhammad Asim 2018

机译：快速部署具有集成电子病历的共享队列发现系统以实现联合健康数据治理的方法
6. Q-Finder: An Algorithm for Credible Subgroup Discovery in Clinical Data Analysis — An Application to the International Diabetes Management Practice Study [O] . Cyril Esnault, May-Line Gadonna, Maxence Queyrel, 2020

机译：Q-Finder：临床数据分析中可信子群发现的一种算法 - 应用于国际糖尿病管理实践研究的应用
7. Induction of comprehensible models for gene expression datasets by subgroup discovery methodology [O] . Gamberger Dragan, Lavrač Nada, Železný Filip, 2004

机译：亚组发现方法归纳基因表达数据集的可理解模型

Subgroup discovery on Big Data: exhaustive methodologies using Map-Reduce

摘要

著录项

相似文献

相关主题

期刊订阅