首页> 美国卫生研究院文献>PLoS Computational Biology >A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts

【2h】

A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts

机译：对1500万篇全文文章中的文本挖掘与相应摘要进行全面定量的比较

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15 million English scientific full-text articles published during the period 1823–2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein–protein, disease–gene, and protein subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full-text articles consistently outperforms using abstracts only.

机译：在学术界和工业界，文本挖掘已成为一种与科学文献的快速增长保持一致的流行策略。由于可获得性，科学文献的文本挖掘主要在摘要的集合上进行。在这里，我们对1823年至2016年期间发表的1500万篇英语科学全文文章进行了分析。我们描述了近250年中文章长度和出版物子主题的发展。我们通过使用命名的实体识别系统提取已发布的蛋白质，蛋白质，疾病基因和蛋白质亚细胞相关联，展示了文本挖掘的潜力，并使用黄金标准基准数据集定量报告了其准确性。随后，我们将调查结果与MEDLINE中包含的1,650万个摘要的相应结果进行了比较，结果表明，全文文章的文本挖掘始终优于仅使用摘要的文本挖掘。

著录项

期刊名称 PLoS Computational Biology
作者
David Westergaard; Hans-Henrik Stærfeldt; Christian Tønsberg; Lars Juhl Jensen; Søren Brunak;
展开▼
作者单位

展开▼
年(卷),期 2018(14),2
年度 2018
页码 e1005962
总页数 16
原文格式 PDF
正文语种
中图分类生化遗传学;生化药理学;
关键词
入库时间 2022-08-17 12:18:03

相似文献

外文文献
中文文献
专利

1. A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts [J] . David Westergaard, Hans-Henrik St?rfeldt, Christian T?nsberg, PLoS Computational Biology . 2018,第2期

机译：对1500万篇全文文章中的文本挖掘与相应摘要进行全面，定量的比较
2. Comparison of conference abstracts and presentations with full-text articles in the health technology assessments of rapidly evolving technologies [J] . Health technology assessment: HTA . 2006,第5期

机译：在快速发展的技术的卫生技术评估中比较会议摘要和演示文稿以及全文文章
3. A text-mining system for extracting metabolic reactions from full-text articles [J] . Jan Czarnecki, Irene Nobeli, Adrian M Smith, BMC Bioinformatics . 2012,第1期

机译：用于从全文文章中提取代谢反应的文本挖掘系统
4. Comparison of Full-text Articles and Abstracts for Visual Trend Analytics through Natural Language Processing [C] . Kawa Nazemi, Maike J. Klepsch, Dirk Burkhardt, International Conference Information Visualisation . 2020

机译：通过自然语言处理对视觉趋势分析的全文文章与摘要的比较
5. Sex education programs, motivation, and the seeking of educational versus erotic material: A comparison of abstinence only until marriage and comprehensive programs. [D] . Kleinert, Paul Dale. 2016

机译：性教育计划，动机以及对教育材料和色情材料的追求：仅在婚姻和全面计划之前禁欲的比较。
6. A comparison of the accuracy of clinical decisions based on full-text articles and on journal abstracts alone: a study among residents in a tertiary care hospital [O] . Alvin Marcelo, Alex Gavino, Iris Thiele Isip-Tan, -1

机译：基于全文和仅基于期刊摘要的临床决策准确性的比较：三级医院居民中的一项研究
7. A comparison of the accuracy of clinical decisions based on full-text articles and on journal abstracts alone: a study among residents in a tertiary care hospital [O] . Alvin Marcelo, Alex Gavino, Iris Thiele Isip-Tan, 2012

机译：基于全文文章和杂志上的临床决策准确性的比较：三级护理医院居民的研究
8. Instrumentation and quantitative methods of evaluation. Comprehensive three-year progress report, January 15, 1989-July 15, 1991. [R] . R. N. Beck M. D. Cooper 1991

机译：仪器和定量评估方法。综合三年进度报告，1989年1月15日至1991年7月15日。

A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts

摘要

著录项

相似文献

相关主题

期刊订阅