首页> 外文期刊>Journal of the Indian Institute of Science >A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases
【24h】

A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases

机译:使用基于root和规则的短语来探索CORD-19数据集的Web资源

获取原文
       

摘要

This short paper describes a web resource—the NISTCORD-19 Web Resource—for community explorations of the COVID19 Open Research Dataset (CORD-19). The tools for exploration in theweb resource make use of the NIST-developed Root- and Rule-basedmethod, which exploits underlying linguistic structures to create termsthat represent phrases in a corpus. The method allows for auto-suggest?ing-related terms to discover terms to refne the search of a COVID-19heterogenous document base. The method also produces taxonomicstructures in the target domain as well as providing semantic informa?tion about the relationships between terms. This term structure can serveas a basis for creating topic modeling and trend analysis tools. In thispaper, we describe use of a novel search engine to demonstrate someof the capabilities above.
机译:本文简介介绍了一个Web资源 - Nistcord-19 Web资源 - 用于Covid19开放研究数据集(CORD-19)的社区探索。 Web资源中的探索工具利用基于NIST开发的根和规则的方法,该方法利用基础结构来创建Termsthat代表语料库中的短语。该方法允许自动建议与相关的术语来发现替换Covid-19heterogent文档基础的搜索条款。该方法还在目标域中产生分类结构以及提供关于术语之间的关系的语义信息。该术语结构可以伺服创建主题建模和趋势分析工具的基础。在此纸纸中,我们描述了一种新颖的搜索引擎来展示一些上述功能的方法。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号