首页> 外国专利> MANAGING DEFECTS IN A MODEL TRAINING PIPELINE USING SYNTHETIC DATA SETS ASSOCIATED WITH DEFECT TYPES

MANAGING DEFECTS IN A MODEL TRAINING PIPELINE USING SYNTHETIC DATA SETS ASSOCIATED WITH DEFECT TYPES

机译:使用与缺陷类型相关联的合成数据集管理模型培训管道中的缺陷

摘要

The disclosure herein describes managing defects in a model training pipeline. A synthetic data set is generated that is associated with a defect type and lifecycle stage of the model training pipeline, and baseline performance metrics associated with the defect type are generated. Based on a code change to the pipeline, a test model is trained using the pipeline and the synthetic data set, and test performance metrics are collected based on the test model and associated with the defect type. Based on comparing the baseline performance metrics and the test performance metrics, a defect of a particular defect type is identified in the pipeline. An indicator of the defect is provided that includes the defect type and the lifecycle stage with which the synthetic data set is associated, whereby a defect correction process is enabled to remedy the defect based on the associated defect type and the lifecycle stage.
机译:本发明的公开内容描述了模型训练管道中的管理缺陷。 生成与模型训练流水线的缺陷类型和生命周期阶段相关联的合成数据集,并且生成与缺陷类型相关联的基线性能度量。 基于对流水线的代码更改,使用流水线和合成数据集进行测试模型,基于测试模型收集测试性能度量,并与缺陷类型相关联。 基于比较基线性能度量和测试性能度量的比较,在管道中识别特定缺陷类型的缺陷。 提供了缺陷的指示,包括缺陷类型和合成数据集相关联的生命周期阶段,由此能够基于相关的缺陷类型和生命周期阶段来弥补缺陷校正处理来解决缺陷。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号