【24h】

Heuristic measures of interestingness

机译:趣味性的启发式度量

获取原文
获取原文并翻译 | 示例

摘要

The tuples in a generalized relation (i.e.,a summary generated from a database) are unique,and therefore,can be considered to be a population with a structure that can be described by some probability distribution.In this paper,we present and empirically compare sixteen heuristic measures that evaluate the structure of a summary to assign a single real-valued index that represents its interestingness relative to other summaries generated from the same database.The heuristics are based upon well-known measures of diversity,dispersion,dominance,and inequality used in everal areas of the physical,social,ecological,management,information,and computer sciences.Their use for ranking summaries generated from databases is a new application area.All sixteen heuristics rank less ocmplex summariex (i.e.,those with few tuples and/or few non-ANY attributes) as most interesting.We demonstrate that for sample data sets,the order in which some of the measures rank summaries is highly correlated.
机译:广义关系中的元组(即从数据库生成的摘要)是唯一的,因此可以认为是具有可以用某种概率分布描述的结构的总体。在本文中,我们进行实证比较十六种启发式度量,用于评估摘要的结构,以分配单个实值索引,该索引表示相对于从同一数据库生成的其他摘要而言的有趣程度。启发式方法基于众所周知的多样性,分散性,支配性和不平等性度量用于物理,社会,生态,管理,信息和计算机科学的各个领域。它们对从数据库生成的摘要进行排名的用途是一个新的应用领域。所有十六种启发式算法对osumplex summariex的排名都较少(即那些元组和/或元组和/或最少的非ANY属性)。我们证明,对于样本数据集,某些度量对摘要进行排序的顺序高度相关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号