首页> 外国专利> System and method for handling a high-cardinality attribute in decision trees

System and method for handling a high-cardinality attribute in decision trees

机译:在决策树中处理高基数属性的系统和方法

摘要

High-cardinality attributes are used as input attributes and as output attributes in decision tree creation. When determining which attribute test to use at a node, a distribution of states for the high-cardinality attribute in the testing data at the node is created. A certain number of the most common states for the high-cardinality attribute are selected. The most common states are used as the states for the high-cardinality attribute in determining which attribute test to use. The remaining states are combined into one state and used as a single state for the high-cardinality attribute in determining which attribute test to use. The high-cardinality attribute may be either an input attribute or an output attribute to the decision tree.
机译:高基数属性在决策树创建中用作输入属性和输出属性。当确定在节点上使用哪个属性测试时,将在该节点的测试数据中创建高基数属性的状态分布。选择高基数属性的某些最常见状态。在确定使用哪个属性测试时,将最常见的状态用作高基数属性的状态。其余状态组合为一个状态,并在确定使用哪个属性测试时用作高基数属性的单个状态。高基数属性可以是决策树的输入属性或输出属性。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号