首页> 外国专利> Identification of mislabeled samples via phantom nodes in label propagation

Identification of mislabeled samples via phantom nodes in label propagation

机译:通过标签传播中的幻像节点识别标签错误的样本

摘要

Systems and method identify potentially mislabeled file samples. A graph is created from a plurality of sample files. The graph includes nodes associated with the sample files and behavior nodes associated with behavior signatures. Phantom nodes are created in the graph for those sample files having a known label. During a label propagation operation, a node receives data indicating a label distribution of a neighbor node in the graph. In response to determining that the current label for the node is known, a neighborhood opinion is determined for the associated phantom node, based at least in part on the label distribution of the neighboring nodes. After the label propagation operation has completed, differences between the neighborhood opinion and the current label distribution for nodes are determined. If the difference exceeds a threshold, then the current label may be incorrect.
机译:系统和方法识别可能贴错标签的文件样本。从多个样本文件创建图形。该图包括与样本文件关联的节点和与行为签名关联的行为节点。在图中为那些具有已知标签的样本文件创建幻影节点。在标签传播操作期间,节点接收指示图中邻居节点的标签分布的数据。响应于确定该节点的当前标签是已知的,至少部分地基于相邻节点的标签分布来确定相关联的幻象节点的邻居意见。标签传播操作完成后,将确定邻居意见与节点的当前标签分布之间的差异。如果差异超过阈值,则当前标签可能不正确。

著录项

  • 公开/公告号US10198576B2

    专利类型

  • 公开/公告日2019-02-05

    原文格式PDF

  • 申请/专利权人 AVAST SOFTWARE S.R.O.;

    申请/专利号US201615374865

  • 发明设计人 MARTIN VEJMELKA;

    申请日2016-12-09

  • 分类号G06F21/53;G06F21/56;

  • 国家 US

  • 入库时间 2022-08-21 12:08:49

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号