Wikipedia as a Resource for Text Analysis and Retrieval

机译：Wikipedia作为文本分析和检索的资源

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

As a counterpart to expert-created knowledge resources such as WordNet or Cyc, non-expert users may collaboratively create large resources of unstructured or semi-structured knowledge, a leading representative of which is Wikipedia. Collectively, articles within Wikipedia form an easily-cditablc collection, reflecting an ever-growing number of topics of interest to Web users. This tutorial examines the characteristics of Wikipedia relative to other human-curated resources of knowledge; and the role of Wikipedia and resources derived from it in text analysis and in enhancing information retrieval. Applicable text analysis tasks include coreference resolution (Ratinov and Roth, 2012), word sense and entity disambiguation (Ganea and Hofmann, 2017). More prominently, they include information extraction (Zhuetal., 2019). In information retrieval, a better understanding of the structure and meaning of queries (Hu et al, 2009; Pantel and Fuxman, 2011; Tan et al., 2017) helps in matching queries against documents (Ensan and Bagheri, 2017), clustering search results (Scaiella et al., 2012), answer (Chen ct al., 2017) and entity retrieval (Ma et al., 2018) and retrieving knowledge panels for queries asking about popular entities.

机译：作为专家创建的知识资源（如WordNet或Cyc）的对应物，非专家用户可以协作创建大量非结构化或半结构化知识的资源，其主要代表是维基百科。总体而言，维基百科中的文章构成了容易分类的集合，反映了Web用户越来越感兴趣的主题。本教程研究了Wikipedia与其他人类策划的知识资源的相关特性;以及维基百科及其衍生资源在文本分析和增强信息检索中的作用。适用的文本分析任务包括共指解析（Ratinov和Roth，2012），词义和实体歧义消除（Ganea和Hofmann，2017）。更重要的是，它们包括信息提取（Zhuetal。，2019）。在信息检索中，更好地了解查询的结构和含义（Hu等，2009; Pantel和Fuxman，2011; Tan等，2017）有助于将查询与文档进行匹配（Ensan和Bagheri，2017），聚类搜索结果（Scaiella等人，2012），答案（Chen ct等人，2017）和实体检索（Ma等人，2018），并检索知识面板以查询有关流行实体的查询。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|24-24|共1页
会议地点
作者
Marius Pasca;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A semi-explicit short text retrieval method combining Wikipedia features [J] . Pu Li, Tianci Li, Suzhi Zhang, Engineering Applications of Artificial Intelligence . 2020,第Sepa期

机译：组合维基百科功能的半显式短文本检索方法
2. TAG RECOMMENDATION FOR SHORT ARABIC TEXT BY USING LATENT SEMANTIC ANALYSIS OF WIKIPEDIA [J] . Iyad AlAgha, Yousef Abu-Samra Jordanian Journal of Computers and Information Technology . 2020,第2期

机译：使用Wikipedia的潜在语义分析来为短阿拉伯语发作的标签推荐
3. Text data management and analysis: a practical introduction to information retrieval and text mining [J] . Fernando Berzal Computing reviews . 2017,第2期

机译：文本数据管理和分析：信息检索和文本挖掘的实用介绍
4. Wikipedia as a Resource for Text Analysis and Retrieval [C] . Marius Pasca Annual meeting of the Association for Computational Linguistics . 2019

机译：维基百科作为文本分析和检索的资源
5. Leveraging open source web resources to improve retrieval of low text content items. [D] . Singhal, Ayush. 2014

机译：利用开源Web资源来改善对低文本内容项的检索。
6. Walking across Wikipedia: a scale-free network model of semantic memory retrieval [O] . Graham W. Thompson, Christopher T. Kello 2014

机译：跨维基百科：语义记忆检索的无标度网络模型
7. Wikipedia as a Resource for Text Analysis and Retrieval [O] . Marius Pasca 2019

机译：维基百科作为文本分析和检索的资源
8. Result Diversity and Entity Ranking Experiments: Anchors, Links, Text and Wikipedia [R] . Kaptein, R., Koolen, M., Kamps, J. 2009

机译：结果多样性和实体排名实验：锚点，链接，文本和维基百科

Wikipedia as a Resource for Text Analysis and Retrieval

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅