COPER: a Query-Adaptable Semantics-based Search Engine for Persian COVID-19 Articles

机译：Coper：Persian Covid-19文章的基于查询适应的语义搜索引擎

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the surge of pretrained language models, a new pathway has been opened to incorporate Persian text contextual information. Meanwhile, as many other countries, including Iran, are fighting against COVID-19, a plethora of COVID-19 related articles has been published in Iranian Healthcare magazines to better inform the public of the situation. However, finding answers in this sheer volume of information is an extremely difficult task. In this paper, we collected a large dataset of these articles, leveraged different BERT variations as well as other keyword models such as BM25 and TF-IDF, and created a search engine to sift through these documents and rank them, given a user's query. Our final search engine consists of a ranker and a re-ranker, which adapts itself to the query. We fine-tune our models using Semantic Textual Similarity and evaluate them with standard task metrics. Our final method outperforms the rest by a considerable margin.

机译：随着预先预用的语言模型的激增，已打开了一种新的途径来包含波斯文本上下文信息。与此同时，随着包括伊朗在内的许多其他国家，正在反对Covid-19，一流的Covid-19相关文章已在伊朗医疗保健杂志上发表，以便更好地通知公众情况。但是，在这个纯粹的信息卷中找到答案是一项非常艰巨的任务。在本文中，我们收集了这些文章的大型数据集，利用了不同的BERT变体以及其他关键字模型，如BM25和TF-IDF，并创建了一个搜索引擎以筛选通过这些文档并为其进行排序，给定用户查询。我们的最终搜索引擎包括一个Ranker和Re-Ranker，它适应查询。我们使用语义文本相似性微调我们的模型，并使用标准任务指标进行评估。我们的最终方法通过相当的边缘表现出剩下的余地。

著录项

来源
《International Conference on Web Research》|2021年|64-70|共7页
会议地点
作者
Reza Khanmohammadi; Mitra Sadat Mirshafiee; Mehdi Allahyari;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
COVID-19; Training; Adaptation models; Computational modeling; Semantics; Search engines; Task analysis;

机译：covid-19;培训;适应模型;计算建模;语义;搜索引擎;任务分析;
入库时间 2022-08-26 13:56:35

相似文献

外文文献
中文文献
专利

1. Practical Guideline for Screening the Patients with SARS-CoV-2 Infection and Persian Gulf Criteria for Diagnosis of COVID-19 [J] . Iraj Salehi-Abari, Shabnam Khazaeli, Fardin Salehi-Abari, 传染病进展（英文） . 2020,第003期
2. Practical Guideline for Screening the Patients with SARS-CoV-2 Infection and Persian Gulf Criteria for Diagnosis of COVID-19 [J] . Iraj Salehi-Abari, Shabnam Khazaeli, Fardin Salehi-Abari, 传染病进展（英文） . 2020,第0S1期
3. Image search and retrieval problems in web search engines: A case study of Persian language writing style challenges [J] . Norouzi Yaghoub, Homavandi Hoda Online Information Review . 2018,第6期

机译：网络搜索引擎中的图像搜索和检索问题：波斯语写作风格挑战的案例研究
4. Ranking of Wikipedia Articles in Search Engines Revisited: Fair Ranking for Reasonable Quality? [J] . Dirk Lewandowski, Ulrike Spree Journal of the American Society for Information Science and Technology . 2011,第1期

机译：维基百科文章在搜索引擎中的排名再访：合理质量的合理排名？
5. A comparative study of the origin, structure, and indexing language of the Persian and English keywords of articles indexed in the IranMedex database and their compliance with the Persian medical thesaurus and Medical Subject Headings [J] . Parastoo Parsaei-Mohammadi, Ali Hossein Ghasemi, Raziyeh Hassanzadeh-Beheshtabad Journal of Education and Health Promotion . 2017,第1期

机译：对在伊朗Medex数据库中被索引的文章的波斯和英语关键字的来源，结构和索引语言以及它们与波斯医学词库和医学主题词的依从性进行比较研究
6. How questions are posed to a search engine? An empiricial analysis of question queries in a large scale Persian search engine log [C] . Mohammad Sadegh Zahedi, Behrouz Mansouri, Shiva Moradkhani, International Conference on Web Research . 2017

机译：如何向搜索引擎提出问题？大型波斯搜索引擎日志中问题查询的实证分析
7. An Application of Sentiment Analysis with Transformer Models on Online News Articles Covering the Covid-19 Pandemic [D] . Asthana, Prakul. 2021

机译：在覆盖Covid-19大流行的在线新闻文章中的情感模型的情感分析
8. Contradictions in the promotion of publishing academic and scientific journal articles and the inability to cope with the new coronavirus (COVID-19) [O] . Rostam Jalali, Amin Hosseinian-Far, Masoud Mohammadi 2021

机译：促进出版学术和科学期刊文章的矛盾无法应对新的冠状病毒（Covid-19）
9. Metadiscursive Distinction between Persian and English: An Analysis of Computer Engineering Research Articles [O] . Sara Mansoori, Gholam Reza Zarei 2011

机译：波斯语与英语之间的元话语区分：计算机工程研究论文分析
10. Engineering Approach to Atomic Transaction Verification: Use of a Simple Object211 Model to Achieve Semantics-Based Reasoning at Compile-Time [R] . Spelt, D., Even, S. 1998

机译：原子事务验证的工程方法：使用简单的Object211模型在编译时实现基于语义的推理

COPER: a Query-Adaptable Semantics-based Search Engine for Persian COVID-19 Articles

摘要

著录项

相似文献

相关主题

期刊订阅