Multi-source Data Modelling: Integrating Related Data to Improve Model Performance

机译：多源数据建模：集成相关数据以提高模型性能

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional methods in Data Mining cannot be applied to all types of data with equal success. Innovative methods for model creation are needed to address the lack of model performance for data from which it is difficult to extract relationships. This paper proposes a set of algorithms that allow the integration of data from multiple datasets that are related, as well as results from the implementation of these techniques using data from the field of Predictive Toxicology. The results show significant improvements when related data is used to aid in the model creation process, both overall and in specific data ranges. The proposed algorithms have potential for use within any field where multiple datasets exist, particularly in fields combining computing, chemistry and biology.

机译：数据挖掘中的传统方法无法成功应用于所有类型的数据。需要创新的模型创建方法来解决难以从中提取关系的数据缺乏模型性能的问题。本文提出了一组算法，这些算法允许集成来自多个相关数据集的数据，以及使用预测毒理学领域的数据实施这些技术的结果。当使用相关数据辅助模型创建过程时，无论是整体数据还是特定数据范围，结果均显示出显着改善。所提出的算法具有在存在多个数据集的任何领域中使用的潜力，特别是在结合了计算，化学和生物学的领域中。

著录项

来源
《Machine Learning and Data Mining in Pattern Recognition(MLDM 2007); 20070718-20; Leipzig(DE)》|2007年|P.32-46|共15页
会议地点 Leipzig(DE)
作者
Paul R. Trundle; Daniel C. Neagu; Qasim Chaudhry;
展开▼
作者单位

University of Bradford, Richmond Road, Bradford, West Yorkshire, BD7 1DP, UK;

Central Science Laboratory, Sand Hutton, York, YO41 1LZ, UK;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机的应用;
关键词
data integration; data mining; machine learning; multi-species modelling;

机译：数据集成;数据挖掘;机器学习;多物种建模;

相似文献

外文文献
中文文献
专利

1. The integration of multi-source remote sensing data for the modelling of shoreline change rates in a mediterranean coastal sector [J] . Aguilar Fernando J., Fernandez-Luque Ismael, Aguilar Manuel A., International journal of remote sensing . 2019,第3a4期

机译：集成多源遥感数据以建模地中海沿岸地区的海岸线变化率
2. Research on Multi-Source Data Integration Based on Ontology and Karma Modeling [J] . Hongyan Yun, Ying He, Li Lin, International Journal of Intelligent Information Technologies . 2019,第2期

机译：基于本体和业力模型的多源数据集成研究
3. Coal seam surface modeling and updating with multi-source data integration using Bayesian Geostatistics [J] . Xiaojun Li, Peinan Li, Hehua Zhu Engineering Geology . 2013,第Null期

机译：使用贝叶斯地统计学的多源数据集成进行煤层表面建模和更新
4. Multi-source Data Modelling: Integrating Related Data to Improve Model Performance [C] . Paul R. Trundle, Daniel C. Neagu, Qasim Chaudhry Machine Learning and Data Mining in Pattern Recognition International Conference . 2007

机译：多源数据建模：集成相关数据以提高模型性能
5. Spatio-Temporal Information Extraction Under Uncertainty Using Multi-Source Data Integration and Machine Learning: Applications to Human Settlement Modelling [D] . Uhl, Johannes Hermann. 2019

机译：使用多源数据集成和机器学习的不确定性下的时空信息提取：用于人类解决模型的应用
6. Improving the Performance of Risk-adjusted Mortality Modeling for Colorectal Cancer Surgery by Combining Claims Data and Clinical Data [O] . Won Mo Jang, Jae-Hyun Park, Jong-Hyock Park, 2013

机译：结合索赔数据和临床数据提高大肠癌手术风险调整死亡率模型的性能
7. Physical Network Models and Multi-source Data Integration [O] . Chen-Hsiang Yeang And, Chen-hsiang Yeang, Tommi Jaakkola 2003

机译：物理网络模型和多源数据集成

Multi-source Data Modelling: Integrating Related Data to Improve Model Performance

摘要

著录项

相似文献

相关主题

期刊订阅