首页> 外文会议>2017 IEEE Technology amp; Engineering Management Conference >A topic model analysis of science and technology linkages: A case study in pharmaceutical industry
【24h】

A topic model analysis of science and technology linkages: A case study in pharmaceutical industry

机译:科技联系的主题模型分析:以制药业为例

获取原文
获取原文并翻译 | 示例

摘要

Science and technology (S&T) linkages have been studied extensively using patent and scientific publication databases. Existing methods used to track S&T linkages, such as analysis of non-patent literature (NPL) or author-inventor matching offer a narrow window for industry level analysis of the data. This paper examines the application of a machine learning algorithm, namely Latent Dirichlet Allocation, to detect the semantic relationship between patent and scientific publication corpus. The case of “Taxol”, a cancer drug, is used to illustrate the performance of the unsupervised algorithm in clustering documents with similar topics. In total 26 475 documents retrieved from the Europe PMC database was used a sample for the analysis. Qualitative analysis of the clusters shows that the topic clustering algorithm is valuable approach in detection of patent and publication linkage.
机译:使用专利和科学出版物数据库对科学与技术(S&T)链接进行了广泛的研究。用于跟踪S&T链接的现有方法,例如非专利文献(NPL)的分析或作者与发明人的匹配,为数据的行业水平分析提供了一个狭窄的窗口。本文探讨了机器学习算法(即潜在狄利克雷分配)在检测专利与科学出版物语料库之间的语义关系中的应用。以“ Taxol”(一种抗癌药)为例,说明了无监督算法在具有相似主题的文档聚类中的性能。从欧洲PMC数据库检索的总共26 475个文档被用作分析样本。对聚类的定性分析表明,主题聚类算法是检测专利和出版物链接的有价值的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号