首页> 外国专利> NATURAL LANGUAGE PROCESSING SYSTEMS AND METHODS FOR AUTOMATIC REDUCTION OF FALSE POSITIVES IN DOMAIN DISCOVERY

NATURAL LANGUAGE PROCESSING SYSTEMS AND METHODS FOR AUTOMATIC REDUCTION OF FALSE POSITIVES IN DOMAIN DISCOVERY

机译:自然语言处理系统和方法,用于在域发现中自动减少误报

摘要

A rules engine is adapted for analyzing each match produced by a domain discovery system as matching a seed domain. Utilizing a natural language processing (NLP) library, the rules engine determines segments from the match, assigns a lexical category to each segment based on the context in how a seed domain string is used, and compares the lexical category of the segment that is closest to the seed domain string with a lexical category of the seed domain string. Based on the comparing, the rules engine determines whether the match is relevant to the seed domain and, if not, the match produced by the domain discovery system is identified as a false positive and automatically removed from a set of matches produced by the domain discovery system for the seed domain.
机译:规则引擎适用于分析由域发现系统产生的每个匹配,如匹配种子域。利用自然语言处理(NLP)库,规则引擎从匹配中确定段,基于如何使用种子域字符串的上下文,将词汇类别分配给每个段,并比较最接近的段的词汇类别使用SEED域字符串的词汇类别的SEED域字符串。基于比较,规则引擎确定匹配是否与种子域相关,如果不是,则域发现系统产生的匹配被标识为假正面并自动从域发现产生的一组匹配项中删除种子域的系统。

著录项

  • 公开/公告号US2021067557A1

    专利类型

  • 公开/公告日2021-03-04

    原文格式PDF

  • 申请/专利权人 PROOFPOINT INC.;

    申请/专利号US202016871258

  • 申请日2020-05-11

  • 分类号H04L29/06;G06F16/2458;G06F40/237;H04L29/12;

  • 国家 US

  • 入库时间 2024-06-14 21:20:56

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号