首页> 外文会议>International Conference on Web Research >COPER: a Query-Adaptable Semantics-based Search Engine for Persian COVID-19 Articles
【24h】

COPER: a Query-Adaptable Semantics-based Search Engine for Persian COVID-19 Articles

机译:Coper:Persian Covid-19文章的基于查询适应的语义搜索引擎

获取原文

摘要

With the surge of pretrained language models, a new pathway has been opened to incorporate Persian text contextual information. Meanwhile, as many other countries, including Iran, are fighting against COVID-19, a plethora of COVID-19 related articles has been published in Iranian Healthcare magazines to better inform the public of the situation. However, finding answers in this sheer volume of information is an extremely difficult task. In this paper, we collected a large dataset of these articles, leveraged different BERT variations as well as other keyword models such as BM25 and TF-IDF, and created a search engine to sift through these documents and rank them, given a user's query. Our final search engine consists of a ranker and a re-ranker, which adapts itself to the query. We fine-tune our models using Semantic Textual Similarity and evaluate them with standard task metrics. Our final method outperforms the rest by a considerable margin.
机译:随着预先预用的语言模型的激增,已打开了一种新的途径来包含波斯文本上下文信息。 与此同时,随着包括伊朗在内的许多其他国家,正在反对Covid-19,一流的Covid-19相关文章已在伊朗医疗保健杂志上发表,以便更好地通知公众情况。 但是,在这个纯粹的信息卷中找到答案是一项非常艰巨的任务。 在本文中,我们收集了这些文章的大型数据集,利用了不同的BERT变体以及其他关键字模型,如BM25和TF-IDF,并创建了一个搜索引擎以筛选通过这些文档并为其进行排序,给定用户查询。 我们的最终搜索引擎包括一个Ranker和Re-Ranker,它适应查询。 我们使用语义文本相似性微调我们的模型,并使用标准任务指标进行评估。 我们的最终方法通过相当的边缘表现出剩下的余地。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号