首页> 外国专利> Simplifying and/or paraphrasing complex textual content by jointly learning semantic alignment and simplicity

Simplifying and/or paraphrasing complex textual content by jointly learning semantic alignment and simplicity

机译:通过共同学习语义对齐和简单来简化和/或释放复杂的文本内容

摘要

Techniques are described herein for training machine learning models to simplify (e.g., paraphrase) complex textual content by ensuring that the machine learning models jointly learn both semantic alignment and notions of simplicity. In various embodiments, an input textual segment having multiple tokens and being associated with a first measure of simplicity may be applied as input across a trained machine learning model to generate an output textual segment. The output textual segment may be is semantically aligned with the input textual segment and associated with a second measure of simplicity that is greater than the first measure of simplicity (e.g., a paraphrase thereof). The trained machine learning model may include an encoder portion and a decoder portion, as well as control layer(s) trained to maximize the second measure of simplicity by replacing token(s) of the input textual segment with replacement token(s).
机译:这里用于训练机器学习模型的技术通过确保机器学习模型共同学习简单性的语义对准和概念来简化(例如,释放)复杂的文本内容。在各种实施例中,可以将具有多个令牌的输入文本段和与简单测量相关联的输入文本段作为培训的机器学习模型的输入应用,以产生输出文本段。可以用输入文本段语义对齐输出文本段,并且与第二种简单测量相关联,其大于简单的第一度量(例如,其释义)。训练的机器学习模型可以包括编码器部分和解码器部分,以及通过用替换令牌的输入文本段的令牌替换令牌来最大化简化的第二次度量的控制层。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号