【24h】

KNOWLEDGE DISCOVERY FROM NUMERICAL DATA

机译:从数值数据中发现知识发现

获取原文

摘要

Since many real data consist of numerical data, the discovery of understandable propositions from numerical data is important, and can be regarded as inductive learning with continuous classes. One of the authors previously presented an al-gorithm for discovering understandable propositions from numerical data. The algorithm consists of normalization, multiple regression analysis and the approx-imation of multilinear functions by continuous Boolean functions. Continuous Boolean functions are included in the space of multilinear functions. The space of multilinear functions can be corresponded to the space of probability distributions using the principle of indifference. The distance between two probability distri-butions is described by Kullback-Leibler (KL) information. Thus, the distance between multilinear functions and continuous Boolean functions is described using KL information. Therefore multilinear functions can be approximated to continu-ous Boolean functions. However, since the approximation algorithm is exponential in computational complexity, it can hardly be applied to real databases. This pa-per presents a polynomial approximation algorithm and applies the algorithm to bodyfat data to discover understandable propositions.
机译:由于许多真实数据由数值数据组成,因此可以从数值数据中发现可理解的命题是重要的,并且可以被视为与连续类的感应学习。其中一位作者以前呈现了一种用于发现来自数值数据的可理解命题的AL-Gorithm。该算法由归一化,多元回归分析和通过连续布尔函数的多线函数的大约仿真组成。连续布尔函数包含在多线性函数的空间中。多线性函数的空间可以使用漠不关量的原理对应于概率分布的空间。两个概率分布之间的距离由Kullback-Leibler(KL)信息描述。因此,使用KL信息描述多线性函数与连续布尔函数之间的距离。因此,多线性函数可以近似于连续的布尔函数。然而,由于近似算法在计算复杂度中是指数的,因此它几乎不能应用于真实数据库。该PA-PER呈现多项式近似算法,并将算法应用于BodyFAT数据以发现可理解的命题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号