首页> 美国政府科技报告 >Science and Technology Text Mining: Origins of Database Tomography and Multi-Word Phrase Clustering.
【24h】

Science and Technology Text Mining: Origins of Database Tomography and Multi-Word Phrase Clustering.

机译:科技文本挖掘:数据库层析成像和多词短语聚类的起源。

获取原文

摘要

This report initially describes the motivations for co-word analysis in support of research policy formulation and research implementation evaluation. It compares co-word analysis in relation to other co-occurrence techniques such as co-citation and co-nomination analyses. It then traces the origins of co-word analysis in computational linguistics, describes in detail the development of co-word analysis for research evaluation, and concludes by presenting a new approach to co-word analysis for research evaluation (Database Tomography). The report shows that this new approach to co-word analysis, which requires no index or key words but deals with text directly, is a useful tool for scanning large bodies of text. It can identify pervasive thrust areas and their interrelationships, and serves as a starting point for further in-depth analysis of the text. Its value increases as the size of text increases and the breadth of topical areas covered by the text increases beyond the expertise of a moderate number of expert panels. A single link clustering example is shown that represents the first use of multi-word technical phrases in modern clustering. (75 refs.).

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号