首页> 外文会议>Workshop on fact extraction and VERification >Interactive Evidence Detection: train state-of-the-art model out-of-domain or simple model interactively?
【24h】

Interactive Evidence Detection: train state-of-the-art model out-of-domain or simple model interactively?

机译:交互式证据检测:跨领域训练最新模型或简单地交互模型?

获取原文

摘要

Finding evidence is of vital importance in research as well as fact checking and an evidence detection method would be useful in speeding up this process. However, when addressing a new topic there is no training data and there are two approaches to get started. One could use large amounts of out-of-domain data to train a state-of-the-art method, or to use the small data that a person creates while working on the topic. In this paper, we address this problem in two steps. First, by simulating users who read source documents and label sentences they can use as evidence, thereby creating small amounts of training data for an interactively trained evidence detection model; and second, by comparing such an interactively trained model against a pre-trained model that has been trained on large out-of-domain data. We found that an interactively trained model not only often out-performs a state-of-the-art model but also requires significantly lower amounts of computational resources. Therefore, especially when computational resources are scarce, e.g. no GPU available, training a smaller model on the fly is preferable to training a well generalising but resource hungry out-of-domain model.
机译:查找证据在研究以及事实检查中都至关重要,而证据检测方法将有助于加快这一过程。但是,在解决新主题时,没有培训数据,并且有两种入门方法。一个人可以使用大量的域外数据来训练一种最新的方法,或者使用一个人在处理该主题时创建的少量数据。在本文中,我们分两个步骤解决了这个问题。首先,通过模拟阅读源文档和标记句子的用户,他们可以将其用作证据,从而为交互式训练的证据检测模型创建少量的训练数据;其次,通过将这种交互式训练的模型与已经针对大型域外数据进行训练的预训练模型进行比较。我们发现,经过交互训练的模型不仅经常胜过最新模型,而且所需的计算资源也大大减少。因此,尤其是在计算资源稀缺时,例如在没有GPU可用的情况下,动态地训练较小的模型比训练通用性强但资源匮乏的域外模型更可取。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号