首页> 外文学位 >Rough set based rule evaluations and their applications.
【24h】

Rough set based rule evaluations and their applications.

机译:基于粗糙集的规则评估及其应用。

获取原文
获取原文并翻译 | 示例

摘要

Knowledge discovery is an important process in data analysis, data mining and machine learning. Typically knowledge is presented in the form of rules. However, knowledge discovery systems often generate a huge amount of rules. One of the challenges we face is how to automatically discover interesting and meaningful knowledge from such discovered rules. It is infeasible for human beings to select important and interesting rules manually. How to provide a measure to evaluate the qualities of rules in order to facilitate the understanding of data mining results becomes our focus. In this thesis, we present a series of rule evaluation techniques for the purpose of facilitating the knowledge understanding process. These evaluation techniques help not only to reduce the number of rules, but also to extract higher quality rules. Empirical studies on both artificial data sets and real world data sets demonstrate how such techniques can contribute to practical systems such as ones for medical diagnosis and web personalization.; In the first part of this thesis, we discuss several rule evaluation techniques that are proposed towards rule postprocessing. We show how properly defined rule templates can be used as a rule evaluation approach. We propose two rough set based measures, a Rule Importance Measure, and a Rules-As-Attributes Measure, to rank the important and interesting rules. In the second part of this thesis, we show how data preprocessing can help with rule evaluation. Because well preprocessed data is essential for important rule generation, we propose a new approach for processing missing attribute values for enhancing the generated rules. In the third part of this thesis, a rough set based rule evaluation system is demonstrated to show the effectiveness of the measures proposed in this thesis. Furthermore, a new user-centric web personalization system is used as a case study to demonstrate how the proposed evaluation measures can be used in an actual application.
机译:知识发现是数据分析,数据挖掘和机器学习中的重要过程。通常,知识以规则的形式表示。但是,知识发现系统通常会生成大量规则。我们面临的挑战之一是如何从发现的规则中自动发现有趣和有意义的知识。人类手动选择重要且有趣的规则是不可行的。如何提供一种评估规则质量的措施以促进对数据挖掘结果的理解成为我们关注的重点。在本文中,我们提出了一系列规则评估技术,以促进知识理解过程。这些评估技术不仅有助于减少规则数量,而且有助于提取更高质量的规则。对人工数据集和现实世界数据集的实证研究表明,这些技术如何对诸如医学诊断和网络个性化的系统等实用系统做出贡献。在本文的第一部分,我们讨论了针对规则后处理提出的几种规则评估技术。我们展示了如何正确定义规则模板可以用作规则评估方法。我们提出了两个基于粗糙集的度量,一个规则重要性度量和一个作为属性的规则度量,以对重要而有趣的规则进行排名。在本文的第二部分,我们展示了数据预处理如何帮助规则评估。由于预处理良好的数据对于重要的规则生成至关重要,因此我们提出了一种新的方法来处理缺失的属性值,以增强生成的规则。在论文的第三部分中,展示了一种基于粗糙集的规则评估系统,以证明本文提出的措施的有效性。此外,以新的以用户为中心的Web个性化系统被用作案例研究,以演示所提出的评估措施如何在实际应用中使用。

著录项

  • 作者

    Li, Jiye.;

  • 作者单位

    University of Waterloo (Canada).;

  • 授予单位 University of Waterloo (Canada).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 205 p.
  • 总页数 205
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号