首页> 外国专利> Semi-supervised classification system

Semi-supervised classification system

机译:半监督分类系统

摘要

Unclassified observations are classified. Similarity values are computed for each unclassified observation and for each target variable value. A confidence value is computed for each unclassified observation using the similarity values. A high-confidence threshold value and a low-confidence threshold value are computed from the confidence values. For each observation, when the confidence value is greater than the high-confidence threshold value, the observation is added to a training dataset and, when the confidence value is greater than the low-confidence threshold value and less than the high-confidence threshold value, the observation is added to the training dataset based on a comparison between a random value drawn from a uniform distribution and an inclusion percentage value. A classification model is trained with the training dataset and classified observations. The trained classification model is executed with the unclassified observations to determine a label assignment.
机译:未分类的观察分类。 为每个未分类的观察和每个目标变量值计算相似度值。 使用相似性值计算每个未分类观察的置信值。 从置信度值计算高置信阈值和低置信阈值。 对于每个观察,当置信度值大于高置信阈值时,将观察添加到训练数据集,并且当置信度值大于低置信阈值并且小于高置信阈值时 ,基于从均匀分布和包含百分比值汲取的随机值之间的比较,观察将观察添加到训练数据集。 培训数据集和分类模型培训分类模型。 使用未分类的观察执行训练的分类模型以确定标签分配。

著录项

  • 公开/公告号US11200514B1

    专利类型

  • 公开/公告日2021-12-14

    原文格式PDF

  • 申请/专利权人 SAS INSTITUTE INC.;

    申请/专利号US202117342825

  • 发明设计人 XU CHEN;XINMIN WU;

    申请日2021-06-09

  • 分类号G06N20;

  • 国家 US

  • 入库时间 2024-06-14 22:31:28

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号