首页> 外国专利> Segmenting information records with missing values using multiple partition trees

Segmenting information records with missing values using multiple partition trees

机译:使用多个分区树分割具有缺失值的信息记录

摘要

A method and system for predicting the class membership of a record where information for one or more variables in the record is missing. Multiple classification trees are generated. A first classification tree is computed using a substantially complete set of information for all of the variables. Other classification trees are computed for different subsets of the variables. Variables are selected for inclusion in a subset based on how strongly they influence the prediction of class membership. The first classification tree (based on the substantially complete set of information) is applied to a record with missing information. If missing information is needed by this tree in order to classify the record, another classification tree that is not based on the missing variable is selected. The class membership for a record with information missing is predicted more accurately without substantially increasing the complexity of the prediction.
机译:一种用于预测记录的类成员的方法和系统,其中缺少记录中一个或多个变量的信息。生成多个分类树。使用基本上完整的所有变量信息集来计算第一分类树。为变量的不同子集计算其他分类树。根据变量对类成员资格预测的影响程度,选择要包含在子集中的变量。将第一分类树(基于基本完整的信息集)应用于具有丢失信息的记录。如果此树需要缺少信息以对记录进行分类,则选择另一种不基于缺失变量的分类树。在没有实质上增加预测复杂性的情况下,可以更准确地预测信息丢失的记录的类成员身份。

著录项

  • 公开/公告号US2002174088A1

    专利类型

  • 公开/公告日2002-11-21

    原文格式PDF

  • 申请/专利权人 LIU TONGWEI;BEYER DIRK M.;

    申请/专利号US20010851066

  • 发明设计人 DIRK M. BEYER;TONGWEI LIU;

    申请日2001-05-07

  • 分类号G06F7/00;

  • 国家 US

  • 入库时间 2022-08-22 00:09:58

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号