首页> 外文会议>Machine Learning and Data Mining in Pattern Recognition(MLDM 2007); 20070718-20; Leipzig(DE) >Multi-source Data Modelling: Integrating Related Data to Improve Model Performance
【24h】

Multi-source Data Modelling: Integrating Related Data to Improve Model Performance

机译:多源数据建模:集成相关数据以提高模型性能

获取原文
获取原文并翻译 | 示例

摘要

Traditional methods in Data Mining cannot be applied to all types of data with equal success. Innovative methods for model creation are needed to address the lack of model performance for data from which it is difficult to extract relationships. This paper proposes a set of algorithms that allow the integration of data from multiple datasets that are related, as well as results from the implementation of these techniques using data from the field of Predictive Toxicology. The results show significant improvements when related data is used to aid in the model creation process, both overall and in specific data ranges. The proposed algorithms have potential for use within any field where multiple datasets exist, particularly in fields combining computing, chemistry and biology.
机译:数据挖掘中的传统方法无法成功应用于所有类型的数据。需要创新的模型创建方法来解决难以从中提取关系的数据缺乏模型性能的问题。本文提出了一组算法,这些算法允许集成来自多个相关数据集的数据,以及使用预测毒理学领域的数据实施这些技术的结果。当使用相关数据辅助模型创建过程时,无论是整体数据还是特定数据范围,结果均显示出显着改善。所提出的算法具有在存在多个数据集的任何领域中使用的潜力,特别是在结合了计算,化学和生物学的领域中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号