【24h】

Heuristic Measures of Interestingness

机译:有趣的启发式措施

获取原文

摘要

The tuples in a generalized relation (i.e., a summary generated from a database) are unique, and therefore, can be considered to be a population with a structure that can be described by some probability distribution. In this paper, we present and empirically compare sixteen heuristic measures that evaluate the structure of a summary to assign a single real-valued index that represents its interestingness relative to other summaries generated from the same database. The heuristics are based upon well-known measures of diversity, dispersion, dominance, and inequality used in several areas of the physical, social, ecological, management, information, and computer sciences. Their use for ranking summaries generated from databases is a new application area. All sixteen heuristics rank less complex summaries (i.e., those with few tuples and/or few non-ANY attributes) as most interesting. We demonstrate that for sample data sets, the order in which some of the measures rank summaries is highly correlated.
机译:广义关系中的元组(即,从数据库生成的摘要)是唯一的,因此,可以被认为是具有可以通过一些概率分布描述的结构的群体。在本文中,我们展示和经验比较了十六个启发式措施,评估了摘要的结构,以分配一个实质值索引,它相对于来自同一数据库生成的其他摘要表示其有趣的索引。启发式基于物理,社会,生态,管理,信息和计算机科学的几个领域的众所周知的多样性,分散,优势和不平等的识别措施。它们用于从数据库生成的排名摘要是一个新的应用程序区域。所有十六次启发式的概要排名不那么复杂的摘要(即,具有少数元素和/或少数属性的人)是最有趣的。我们展示了对于样本数据集,其中一些措施等级摘要的顺序高度相关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号