Excavating the mother lode of human-generated text: A systematic review of research that uses the wikipedia corpus

Mohamad Mehdi; Chitu Okoli; Mostafa Mesgari; Finn Arup Nielsen; Arto Lanamaeki

首页> 外文期刊>Information Processing & Management >Excavating the mother lode of human-generated text: A systematic review of research that uses the wikipedia corpus

【24h】

Excavating the mother lode of human-generated text: A systematic review of research that uses the wikipedia corpus

机译：挖掘人类生成文本的母体：使用维基百科语料库的研究的系统综述

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Although primarily an encyclopedia, Wikipedia's expansive content provides a knowledge base that has been continuously exploited by researchers in a wide variety of domains. This article systematically reviews the scholarly studies that have used Wikipedia as a data source, and investigates the means by which Wikipedia has been employed in three main computer science research areas: information retrieval, natural language processing, and ontology building. We report and discuss the research trends of the identified and examined studies. We further identify and classify a list of tools that can be used to extract data from Wikipedia, and compile a list of currently available data sets extracted from Wikipedia.

机译：维基百科虽然主要是百科全书，但其广泛的内容提供了一个知识库，该知识库已被研究人员在各个领域中不断地利用。本文系统地回顾了使用Wikipedia作为数据源的学术研究，并研究了Wikipedia在三个主要计算机科学研究领域中所采用的方法：信息检索，自然语言处理和本体构建。我们报告并讨论已确定和审查的研究的研究趋势。我们进一步确定并分类了可用于从Wikipedia提取数据的工具列表，并汇编了从Wikipedia提取的当前可用数据集的列表。

著录项

来源
《Information Processing & Management》 |2017年第2期|505-529|共25页
作者
Mohamad Mehdi; Chitu Okoli; Mostafa Mesgari; Finn Arup Nielsen; Arto Lanamaeki;
展开▼
作者单位

Computer Science, Concordia University, Montreal, Canada;

John Molson School of Business, Concordia University, Montreal, Canada;

Love School of Business, Eton University, Elon, NC, USA;

DTU Compute, Technical University of Denmark, DK-2800 Kongens Lyngby, Denmark;

Interact research unit, University of Oulu, Oulu, Finland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Information retrieval; Information extraction; Natural language processing; Ontologies; Wikipedia; Literature review;

机译：信息检索;信息提取;自然语言处理;本体;维基百科;文献评论;

相似文献

外文文献
中文文献
专利

1. Wikipedia in the Eyes of Its Beholders: A Systematic Review of Scholarly Research on Wikipedia Readers and Readership [J] . Chitu Okoli, Mostafa Mesgari, Mohamad Mehdi, Journal of the American Society for Information Science and Technology . 2014,第12期

机译：旁观者眼中的维基百科：对维基百科读者和读者群学术研究的系统综述
2. Eight in Every 10 Abstracts of Low Back Pain Systematic Reviews Presented Spin and Inconsistencies With the Full Text: An Analysis of 66 Systematic Reviews [J] . Nascimento Dafne Port, Gonzalez Gabrielle Zoldan, Araujo Amanda Costa, The journal of orthopaedic and sports physical therapy . 2020,第1期

机译：每10个摘要中的八个低腰疼痛系统评论提出了旋转和不一致的全文：分析66个系统评价
3. Using text mining for study identification in systematic reviews: a systematic review of current approaches [J] . Alison O’Mara-Eves, James Thomas, John McNaught, Systematic Reviews . 2015,第1期

机译：在系统评价中使用文本挖掘进行研究鉴定：当前方法的系统评价
4. Text-Mining Techniques and Tools for Systematic Literature Reviews: A Systematic Literature Review [C] . Luyi Feng, Yin Kia Chiam, Sin Kuang Lo Asia-Pacific Software Engineering Conference . 2017

机译：系统文献综述的文本挖掘技术和工具：系统文献综述
5. Text classification on imbalanced data: Application to systematic reviews automation. [D] . Ma, Yimin. 2007

机译：不平衡数据的文本分类：在系统评价自动化中的应用。
6. Hitting the mother lode of tumor angiogenesis [O] . Daylon James, Sina Rabbany, Shahin Rafii -1

机译：击中肿瘤血管生成的母体
7. Excavating the mother lode of human-generated text: A systematic review of research that uses the Wikipedia corpus [O] . Mehdi, Mohamad, Okoli, Chitu, Mesgari, Mostafa, 2018

机译：挖掘人类生成的文本的母体：对使用维基百科语料库的研究进行系统评价

Excavating the mother lode of human-generated text: A systematic review of research that uses the wikipedia corpus

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅