...
首页> 外文期刊>Journal of the American Society for Information Science and Technology >Predicting the Impact of Scientific Concepts Using Full-Text Features
【24h】

Predicting the Impact of Scientific Concepts Using Full-Text Features

机译:使用全文功能预测科学概念的影响

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

New scientific concepts, interpreted broadly, are continuously introduced in the literature, but relatively few concepts have a long-term impact on society. The identification of such concepts is a challenging prediction task that would help multiple parties-including researchers and the general public-focus their attention within the vast scientific literature. In this paper we present a system that predicts the future impact of a scientific concept, represented as a technical term, based on the information available from recently published research articles. We analyze the usefulness of rich features derived from the full text of, the articles through a variety of approaches, including rhetorical sentence analysis, information extraction, and time-series analysis. The results from two large-scale experiments with 3.8 million full-text articles and 48 million metadata records support the conclusion that full-text features are significantly more useful for prediction than metadata-only features and that the most accurate predictions result from combining the metadata and full-text features. Surprisingly, these results hold even when the metadata features are available for a much larger number of documents than are available for the full-text features.
机译:在文献中不断引入了广泛解释的新科学概念,但是相对较少的概念会对社会产生长期影响。对这些概念的识别是一项具有挑战性的预测任务,它将帮助包括研究人员和公众在内的多个方面将他们的注意力集中在庞大的科学文献中。在本文中,我们基于最近发表的研究文章中提供的信息,提供了一个预测科学概念(以技术术语表示)的未来影响的系统。我们通过各种方法(包括修辞句分析,信息提取和时间序列分析)来分析从文章全文中获得的丰富功能的有用性。两项具有380万篇全文文章和4800万条元数据记录的大规模实验的结果支持以下结论:全文特征比纯元数据的特征对预测有用得多,并且最准确的预测是通过组合元数据得出的和全文功能。令人惊讶的是,即使元数据功能可用于的文档数量比全文功能大得多,这些结果仍然成立。

著录项

  • 来源
  • 作者单位

    Department of Computer Science, 1214 Amsterdam Avenue, Columbia University, New York, NY 10027;

    Computer Science, University of Maryland, College Park, MD 20742, USA;

    Department of Computer Science, 3258, A. V. Williams Building, University of Maryland, College Park, MD 20742, USA;

    Department of Computer Science, 1214 Amsterdam Avenue, Columbia University, New York, NY 10027, USA;

    Department of Computer Science, 1214 Amsterdam Avenue, Columbia University, New York, NY 10027, USA;

    Department of Computer Science, 1214 Amsterdam Avenue, Columbia University, New York, NY 10027, USA;

    Department of Computer Science, 1214 Amsterdam Avenue, Columbia University, New York, NY 10027, USA;

    Department of Computer Science, 1214 Amsterdam Avenue, Columbia University, New York, NY 10027, USA;

    Department of Computer Science, 1214 Amsterdam Avenue, Columbia University, New York, NY 10027, USA;

    The University of Texas at Austin School of Information, 1616 Guadalupe Suite #5.202, Austin, TX 78701-1213, USA;

    Department of Computer Science, 1214 Amsterdam Avenue, Columbia University, New York, NY 10027, USA;

    2260 Hayward Street, University of Michigan, Ann Arbor, MI 48109, USA;

    Ubiquiti, Inc., 303 Detroit Street, Suite #202, Ann Arbor, MI 48109, USA;

    Department of Computer Science, Rutgers, The State University of New Jersey, 110 Frelinghuysen Road, Piscataway, NJ 08854-8019, USA;

    IBM T.J. Watson Research Center, 1101 Kitchawan Road, Yorktown Heights, NY 10598, USA;

    College of Information and Computer Sciences, 140 Governors Drive, Amherst, MA 01002, USA;

    University of Cambridge Computer Laboratory, 15 JJ Thomson Avenue, Cambridge CB3 0FD, UK;

    Department of Electrical Engineering and Computer Science, School of Information, Department of Linguistics,University of Michigan, Ann Arbor, MI 48109, USA;

    School of Information, University of Texas at Austin, 1616 Guadalupe Suite #5.202, Austin, TX 78701, USA;

    University of Cambridge Computer Laboratory, 15 JJ Thomson Avenue, Cambridge CB3 0FD, UK;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号