首页> 外文学位 >A usefulness metric and its application to decision tree-based classification.
【24h】

A usefulness metric and its application to decision tree-based classification.

机译:有用性度量标准及其在基于决策树的分类中的应用。

获取原文
获取原文并翻译 | 示例

摘要

Data mining, a new area of research in computer science, combines various algorithms and techniques used in the established fields of database systems, statistics and artificial intelligence. Data mining is concerned with the extraction of information from generally large volumes of data. Data mining has grown out of industry's need to make better use of the vast amounts of stored data accumulated over the years. Users are demanding software that is more sophisticated and able to answer queries that are unable to be answered by more traditional query methods. In order to meet this demand, researchers are focusing their attention on improving data mining extraction methods. Extracting information using the traditional methods of SQL or statistical analysis are not powerful enough to answer a general query, such as, "What determines whether a graduate student will complete the program?" The goal of data mining extraction methods is to answer these general types of queries.; In the past, the primary focus has been to improve the correctness of extraction algorithms. The more correct the results, the more successful the algorithm. Although this is important for industry, measuring correctness is no longer sufficient for measuring success. Industry has placed an additional demand on the information; the information must be useful. The intent of data mining is to provide a user with information that will help them do their job better. The information or results of the data mining algorithm must be analyzed to determine if this goal has been achieved. This analysis cannot be limited to measuring correctness; it must also measure the information's usefulness. This is not an easy task; that which is useful to one user may not be useful to another.; A usefulness metric will be presented, which will incorporate both objective and subjective measures in order to measure usefulness. It will be shown that this metric can be tailored for an individual's needs. It will also be shown how a decision tree based classifier, one type of data mining extraction algorithm, can be adapted to use the usefulness metric in order to be more successful. By using the usefulness metric, the algorithm will look for and extract useful information. The metric will also be applied to the resulting rule sets of decision tree based classifiers in order to determine their usefulness.
机译:数据挖掘是计算机科学的新研究领域,结合了数据库系统,统计和人工智能等已建立领域中使用的各种算法和技术。数据挖掘与从通常的大量数据中提取信息有关。数据挖掘已经超出了行业的需求,以便更好地利用多年来积累的大量存储数据。用户要求软件更加复杂,并且能够回答更传统的查询方法无法回答的查询。为了满足这一需求,研究人员将注意力集中在改进数据挖掘提取方法上。使用传统的SQL或统计分析方法提取信息的能力不足以回答一般查询,例如“由什么决定研究生是否会完成该程序?”数据挖掘提取方法的目的是回答这些一般类型的查询。过去,主要重点是提高提取算法的正确性。结果越正确,算法就越成功。尽管这对行业很重要,但是测量正确性已不足以测量成功。工业对信息提出了额外的要求。该信息必须有用。数据挖掘的目的是向用户提供信息,以帮助他们更好地完成工作。必须分析数据挖掘算法的信息或结果,以确定是否已实现该目标。这种分析不能局限于测量正确性。它还必须衡量信息的有用性。这不是一件容易的事。对一个用户有用的内容可能对另一用户无效。将介绍一个有用性度量标准,该度量标准将结合客观和主观措施,以衡量有用性。将显示该度量标准可以针对个人需求进行调整。还将展示如何将基于决策树的分类器(一种类型的数据挖掘提取算法)调整为使用有用性度量,以使其更加成功。通过使用有用性度量,该算法将查找并提取有用的信息。该度量还将应用于基于决策树的分类器的所得规则集,以确定其有用性。

著录项

  • 作者

    St.Clair, Caroline Maria.;

  • 作者单位

    DePaul University, School of Computer Science, Telecommunications, and Information Systems.;

  • 授予单位 DePaul University, School of Computer Science, Telecommunications, and Information Systems.;
  • 学科 Computer Science.; Information Science.; Artificial Intelligence.
  • 学位 Ph.D.
  • 年度 2000
  • 页码 112 p.
  • 总页数 112
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;信息与知识传播;人工智能理论;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号