首页> 外文会议>National Conferences on Aritificial Intelligence >Methods for Linking and Mining Massive Heterogeneous Databases
【24h】

Methods for Linking and Mining Massive Heterogeneous Databases

机译:链接和挖掘大规模异构数据库的方法

获取原文

摘要

Many real-world KDD expeditions involve investigation of relationships between variables in different, heterogeneous databases. We present a dynamic programming technique for linking records in multiple heterogeneous databases using loosely defined fields that allow free-style verbatim entries. We develop an interestingness measure based on non-parametric randomization tests, which can be used for mining potentially useful relationships among variables. This mea- sure uses distributional characteristics of historical events, hence accommodating variable-length records in a natural way. As an illustration, we include a successful application of the proposed methodology to a real-world data mining problem in Lucent Technologies.
机译:许多现实世界KDD探险队涉及调查不同异构数据库中变量之间的关系。我们提出了一种动态编程技术,用于使用松散定义的字段链接多个异构数据库中的记录,该字段允许自由样式的逐字条目。我们基于非参数随机化测试开发了一个有趣的措施,可用于挖掘变量之间的潜在有用的关系。这种MEA肯定使用历史事件的分布特征,因此以自然的方式容纳可变长度的记录。作为一个插图,我们包括在朗讯技术的真实数据挖掘问题中成功地应用了拟议的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号