首页> 外国专利> CONSTRUCTION OF REFERENCE DATABASE ACCURATELY REPRESENTING COMPLETE SET OF DATA ITEMS FOR FASTER AND TRACTABLE CLASSIFICATION USAGE

CONSTRUCTION OF REFERENCE DATABASE ACCURATELY REPRESENTING COMPLETE SET OF DATA ITEMS FOR FASTER AND TRACTABLE CLASSIFICATION USAGE

机译:参考数据库的构建,以准确表示完整的数据项集,以更快,更可追踪地进行分类

摘要

For each unique pair of a complete set of data items, a computing device determines a distance between the data items of the unique pair. The computing device repeats the following until no data items remain in the complete set. For each data item remaining in the complete set, the computing device determines a similarity subset including each other data item that the distance between the data item and the other data item is less than a target difference threshold. The computing device moves a selected data item from a largest similarity subset to a reference database that is a subset of the complete set. The computing device removes each data item from the complete set that the distance between the selected data item and the data item is less than the threshold. A new data item can be classified using the reference database.
机译:对于一组完整的数据项的每个唯一对,计算设备确定唯一对的数据项之间的距离。计算设备重复以下步骤,直到没有数据项保留在完整集中。对于完整集合中剩余的每个数据项,计算设备确定包括每个其他数据项的相似性子集,该相似性子集是该数据项与另一个数据项之间的距离小于目标差异阈值。计算设备将所选数据项从最大相似性子集移动到作为完整集的子集的参考数据库。计算设备从所选数据项和数据项之间的距离小于阈值的完整集中删除每个数据项。可以使用参考数据库对新数据项进行分类。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号