首页> 外国专利> SYSTEM AND METHOD FOR CLASSIFICATION OF LOW RELEVANCE RECORDS IN A DATABASE USING INSTANCE-BASED CLASSIFIERS AND MACHINE LEARNING

SYSTEM AND METHOD FOR CLASSIFICATION OF LOW RELEVANCE RECORDS IN A DATABASE USING INSTANCE-BASED CLASSIFIERS AND MACHINE LEARNING

机译:使用基于实例的分类器和机器学习对数据库中低相关记录进行分类的系统和方法

摘要

Devices and methods for classification of low relevance records in a database are disclosed. A method includes: in response to a request to delete a selected database record, generating a vector representation of the selected record, deleting the selected record in the database, and storing the vector representation of the deleted selected record; in response to the storing the vector representation of the deleted selected record, determining a cluster from which the vector representation has a shortest determined distance, among a plurality of clusters into which a plurality of vector representations of deleted records is partitioned; determining a distance between a record in the database and a nearest cluster among the plurality of clusters into which the plurality of vector representations of deleted records is partitioned; and in response to the record being within a predetermined distance of the nearest cluster, determining that the record is a deletion candidate record.
机译:公开了用于对数据库中的低相关性记录进行分类的设备和方法。一种方法,包括:响应于删除所选数据库记录的请求,生成所选记录的向量表示,在数据库中删除所选记录,以及存储删除的所选记录的向量表示;响应于存储删除的所选记录的向量表示,确定在其中将删除的记录的多个向量表示划分成的多个集群之中,向量表示具有最短确定距离的集群;确定数据库中的记录与被删除的记录的多个矢量表示被划分成的多个簇中的最近簇之间的距离;响应于该记录在最接近簇的预定距离内,确定该记录为删除候选记录。

著录项

获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号