Ensemble-Based Relationship Discovery in Relational Databases

机译：基于组合的关系数据库关系发现

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We performed an investigation of how several data relationship discovery algorithms can be combined to improve performance. We investigated eight relationship discovery algorithms like Cosine similarity, Soundex similarity, Name similarity, Value range similarity, etc., to identify potential links between database tables in different ways using different categories of database information. We proposed voting system and hierarchical clustering ensemble methods to reduce the generalization error of each algorithm. Voting scheme uses a given weighting metric to combine the predictions of each algorithm. Hierarchical clustering groups predictions into clusters based on similarities and then combine a member from each cluster together. We run experiments to validate the performance of each algorithm and compare performance with our ensemble methods and the state-of-the-art algorithms (FaskFK, Randomness and HoPF) using Precision, Recall and F-Measure evaluation metrics over TPCH and AdvWork datasets. Results show that performance of each algorithm is limited, indicating the importance of combining them to consolidate their strengths.

机译：我们对如何将几个数据关系发现算法组合以改善性能进行调查。我们调查了八个关系发现算法，如余弦相似度，Soundex相似性，名称相似度，值范围相似度等，以使用不同类别的数据库信息以不同的方式识别数据库表之间的潜在链接。我们提出了投票系统和分层群集集合方法，以减少每种算法的泛化误差。投票方案使用给定的加权度量来组合每种算法的预测。分层群集组基于相似性进入群集，然后将来自每个集群的成员组合在一起。我们运行实验以验证每种算法的性能，并使用Precion，Recall和F-Measure评估指标与我们的集合方法和最先进的算法（FASKFK，随机性和HOPF）进行比较，并使用TPCH和Advwork数据集进行比较。结果表明，每种算法的性能都是有限的，表明将它们组合巩固其优势的重要性。

著录项

来源
《SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence》|2020年|286-300|共15页
会议地点
作者
Akinola Ogunsemi; John McCall; Mathias Kern; Benjamin Lacroix; David Corsar; Gilbert Owusu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantic relationship; Primary/Foreign key relationship; Data discovery; Database management; Ensemble-based discovery;

机译：语义关系;主要/外国关键关系;数据发现;数据库管理;基于合奏的发现;
入库时间 2022-08-26 13:58:22

相似文献

外文文献
中文文献
专利

1. SQL or Third Manifesto Compliant Object-Relational Database Management Systems as the Platforms for Maintaining the Whole-Part Relationships in a Database [J] . ERKI EESSAAR WSEAS Transactions on Computers . 2006,第10期

机译：SQL或符合第三宣言的对象关系数据库管理系统，作为维护数据库中整体关系的平台
2. Design Methodology for Relational Databases : Issues Related to Ternary Relationships in Entitiy-Relationship Model and Higher Normal Forms [J] . Vimala, H Khanna Nehemiah, R S Bhuvaneswaran, International Journal of Database Management Systems . 2013,第3期

机译：关系数据库的设计方法论：实体关系模型和高范式中的三元关系问题
3. Estimating Null Values In Relational Database Systems Having Negative Dependency Relationships Between Attributes [J] . SHYI-MING CHEN, SHU-TING CHANG Cybernetics and Systems . 2009,第2期

机译：在属性之间具有负相关关系的关系数据库系统中估计空值
4. Ensemble-Based Relationship Discovery in Relational Databases [C] . Akinola Ogunsemi, John McCall, Mathias Kern, SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence . 2020

机译：基于组合的关系数据库关系发现
5. Semi-Automatic Discovery of Meaningful Ontology from a Relational Database [D] . Witherspoon, David B. 2011

机译：从关系数据库中半自动发现有意义的本体
6. A Relational Database for the Discovery of Genes Encoding AminoAcid Biosynthetic Enzymes in Pathogenic Fungi [O] . Peter F. Giles, Darren M. Soanes, Nicholas J. Talbot 2003

机译：用于发现编码氨基酸的基因的关系数据库病原真菌中的酸性生物合成酶
7. Ensemble-Based Relationship Discovery in Relational Databases [O] . Akinola Ogunsemi, John McCall, Mathias Kern, 2020

机译：基于合奏的关系数据库关系发现

Ensemble-Based Relationship Discovery in Relational Databases

摘要

著录项

相似文献

相关主题

期刊订阅