首页> 外国专利> SYSTEMS AND/OR METHODS FOR AUTOMATICALLY CLASSIFYING AND ENRICHING DATA RECORDS IMPORTED FROM BIG DATA AND/OR OTHER SOURCES TO HELP ENSURE DATA INTEGRITY AND CONSISTENCY

SYSTEMS AND/OR METHODS FOR AUTOMATICALLY CLASSIFYING AND ENRICHING DATA RECORDS IMPORTED FROM BIG DATA AND/OR OTHER SOURCES TO HELP ENSURE DATA INTEGRITY AND CONSISTENCY

机译:用于自动分类和丰富从大数据和/或其他来源导入的数据记录以帮助确保数据完整性和一致性的系统和/或方法

摘要

Techniques relating to managing “bad” or “imperfect” data being imported into a database system are described herein. As an example, a lifecycle technology solution helps receive data from a variety of different data sources of a variety of known and/or unknown formats, standardize it, fit it to a known taxonomy through model-assisted classification, store it to a database in a manner that is consistent with the taxonomy, and allow it to be queried for a variety of different usages. Some or all of the disclosed technology concerning auto-classification, enrichment, clustering model and model stacks, and/or the like, may be used in these and/or other regards.
机译:本文描述了与管理导入数据库系统中的“不良”或“不完美”数据有关的技术。例如,生命周期技术解决方案可帮助从各种已知和/或未知格式的各种不同数据源接收数据,对其进行标准化,通过模型辅助分类使其适合于已知分类法,将其存储到数据库中。一种与分类法保持一致的方式,并允许对其进行查询以用于各种不同的用法。在这些和/或其他方面,可以使用所公开的涉及自动分类,充实,聚类模型和模型堆栈等的一些或全部技术。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号