首页> 外文会议>Conference on Neural Information Processing Systems >SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
【24h】

SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems

机译:Superglue:通用语言理解系统的贴纸基准

获取原文

摘要

In the last year, new models and methods for pretraining and transfer learning have driven striking performance improvements across a range of language understanding tasks. The GLUE benchmark, introduced a little over one year ago, offers a single-number metric that summarizes progress on a diverse set of such tasks, but performance on the benchmark has recently surpassed the level of non-expert humans, suggesting limited headroom for further research. In this paper we present SuperGLUE, a new benchmark styled after GLUE with a new set of more difficult language understanding tasks, a software toolkit, and a public leaderboard. SuperGLUE is available at super.gluebenchmark.com.
机译:在去年,预先训练和转移学习的新模型和方法具有跨越一系列语言理解任务的引人注目的性能改进。 胶水基准介绍了一岁多,提供了一个单数度量,总结了各种这些任务的进展,但基准的性能最近超过了非专家人类的水平,建议进一步提出有限的余规 研究。 在本文中,我们呈现Superglue,一个新的基准,胶合后用一套新的更困难的语言理解任务,软件工具包和公共排行榜。 super.gluebenchmark.com提供了SuperGlue。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号