首页> 外国专利> SYSTEMS AND METHODS FOR GENERATING NATURAL LANGUAGE PROCESSING TRAINING SAMPLES WITH INFLECTIONAL PERTURBATIONS

SYSTEMS AND METHODS FOR GENERATING NATURAL LANGUAGE PROCESSING TRAINING SAMPLES WITH INFLECTIONAL PERTURBATIONS

机译:用于产生具有折射扰动的自然语言处理培训样本的系统和方法

摘要

Embodiments described herein provide systems and methods for generating an adversarial sample with inflectional perturbations for training a natural language processing (NLP) system. A natural language sentence is received at an inflection perturbation module. Tokens are generated from the natural language sentence. For each token that has a part of speech that is a verb, adjective, or an adverb, an inflected form is determined. An adversarial sample of the natural language sentence is generated by detokenizing inflected forms of the tokens. The NLP system is trained using the adversarial sample.
机译:本文描述的实施方案提供了用于产生对逆势样本的系统和方法,其具有用于训练自然语言处理(NLP)系统的折射扰动。在拐点扰动模块处收到自然语言句子。令牌是从自然语言句中生成的。对于具有动词,形容词或副词的一部分语音的每个令牌,确定了形成的形式。通过解释令牌的变形形式来生成自然语言句子的对抗性样本。使用对抗性样本培训NLP系统。

著录项

  • 公开/公告号US2021173872A1

    专利类型

  • 公开/公告日2021-06-10

    原文格式PDF

  • 申请/专利权人 SALESFORCE.COM INC.;

    申请/专利号US202016869903

  • 发明设计人 SAMSON MIN RONG TAN;SHAFIQ RAYHAN JOTY;

    申请日2020-05-08

  • 分类号G06F16/9032;G10L15/16;G10L15/18;G06F40/284;

  • 国家 US

  • 入库时间 2022-08-24 19:07:35

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号