首页> 外国专利> SIMPLIFYING AND/OR PARAPHRASING COMPLEX TEXTUAL CONTENT BY JOINTLY LEARNING SEMANTIC ALIGNMENT AND SIMPLICITY

SIMPLIFYING AND/OR PARAPHRASING COMPLEX TEXTUAL CONTENT BY JOINTLY LEARNING SEMANTIC ALIGNMENT AND SIMPLICITY

机译:通过共同学习语义对齐和简单性来简化和/或参数化复杂的文本内容

摘要

Techniques are described herein for training machine learning models to simplify (e.g., paraphrase) complex textual content by ensuring that the machine learning models jointly learn both semantic alignment and notions of simplicity. In various embodiments, an input textual segment having multiple tokens and being associated with a first measure of simplicity may be applied as input across a trained machine learning model to generate an output textual segment. The output textual segment may be is semantically aligned with the input textual segment and associated with a second measure of simplicity that is greater than the first measure of simplicity (e.g., a paraphrase thereof). The trained machine learning model may include an encoder portion and a decoder portion, as well as control layer(s) trained to maximize the second measure of simplicity by replacing token(s) of the input textual segment with replacement token(s).
机译:本文描述了用于通过确保机器学习模型共同学习语义对齐和简单性概念来训练机器学习模型以简化(例如,复述)复杂文本内容的技术。在各种实施例中,可以将具有多个令牌并且与简单性的第一量度相关联的输入文本片段用作跨训练的机器学习模型的输入,以生成输出文本片段。输出文本段可以在语义上与输入文本段对齐,并且与大于第一简单度的第二简单度(例如,其释义)相关联。训练有素的机器学习模型可以包括编码器部分和解码器部分,以及控制层,这些控制层被训练以通过用替换令牌替换输入文本段的令牌来最大化第二简单性度量。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号