首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Identification of Tasks, Datasets, Evaluation Metrics, and Numeric Scores for Scientific Leaderboards Construction
【24h】

Identification of Tasks, Datasets, Evaluation Metrics, and Numeric Scores for Scientific Leaderboards Construction

机译:识别科学排行榜结构的任务,数据集,评估指标和数字分数

获取原文

摘要

While the fast-paced inception of novel tasks and new datasets helps foster active research in a community towards interesting directions, keeping track of the abundance of research activity in different areas on different datasets is likely to become increasingly difficult. The community could greatly benefit from an automatic system able to summarize scientific results, e.g., in the form of a leaderboard. In this paper we build two datasets and develop a framework (TDMS-IE) aimed at automatically extracting task, dataset, metric and score from NLP papers, towards the automatic construction of leaderboards. Experiments show that our model outperforms several baselines by a large margin. Our model is a first step towards automatic leaderboard construction, e.g., in the NLP domain.
机译:虽然快节奏的新型任务和新数据集的成立有助于促进在社区实现有趣方向的积极研究,但是跟踪不同数据集不同区域的丰富研究活动可能会越来越困难。社区可以从能够总结科学效果的自动系统大大受益,例如,以排行榜的形式总结。在本文中,我们构建了两个数据集,并开发了一个框架(TDMS-IE),旨在自动提取从NLP论文的任务,数据集,度量标准和分数,朝向自动构建排行榜。实验表明,我们的模型优于几个基线的大幅度。我们的模型是迈向NLP域中的自动排行板结构的第一步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号