首页> 外国专利> Dataset Quality for Synthetic Data Generation in Computer-Based Reasoning Systems

Dataset Quality for Synthetic Data Generation in Computer-Based Reasoning Systems

机译:基于计算机的推理系统中的合成数据生成的数据集质量

摘要

Techniques for synthetic data generation in computer-based reasoning systems are discussed and include receiving a request for generation of synthetic data based on a set of training data cases. One or more focal training data cases are determined. For undetermined features (either all of them or those that are not subject to conditions), a value for the feature is determined based on the focal cases. In some embodiments, the generated synthetic data may be checked for similarity against the training data, and if similarity conditions are met, it may be modified (e.g., resampled), removed, and/or replaced.
机译:讨论了基于计算机的推理系统中的合成数据生成的技术,并且包括基于一组训练数据情况接收对合成数据生成的请求。 确定一个或多个焦点训练数据案例。 对于未确定的特征(它们的所有或者那些不受条件的那些),基于焦壳确定特征的值。 在一些实施例中,可以针对训练数据检查所生成的合成数据,并且如果满足相似性条件,则可以被修改(例如,重新采样),移除和/或替换。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号