首页> 外文会议>International conference on theory and practice of digital libraries >What Should I Cite? Cross-Collection Reference Recommendation of Patents and Papers
【24h】

What Should I Cite? Cross-Collection Reference Recommendation of Patents and Papers

机译:我应该引用什么?专利和论文交叉引用参考建议

获取原文

摘要

Research results manifest in large corpora of patents and scientific papers. However, both corpora lack a consistent taxonomy and references across different document types are sparse. Therefore, and because of contrastive, domain-specific language, recommending similar papers for a given patent (or vice versa) is challenging. We propose a recommender system that leverages topic distributions and keywords to recommend related work despite these challenges. As a case study, we evaluate our approach on patents and papers of two fields: medical and computer science. We find that topic-based recommenders complement word-based recommenders for documents with collection-specific language and increase mean average precision by up to 27%. As a result of our work, publications from both corpora form a joint digital library, which connects academia and industry.
机译:研究结果体现在大量专利和科学论文中。但是,两个语料库都缺乏一致的分类法,并且跨不同文档类型的引用很少。因此,并且由于对比性的,领域特定的语言,为给定专利推荐类似的论文(反之亦然)是具有挑战性的。我们提出了一种推荐系统,尽管存在这些挑战,但该系统可以利用主题分布和关键字来推荐相关工作。作为案例研究,我们评估了我们在两个领域的专利和论文上的研究方法:医学和计算机科学。我们发现,基于主题的推荐者可以使用特定于集合的语言来补充文档的基于单词的推荐者,并且可以将平均平均精度提高多达27%。由于我们的工作,两个语料库的出版物形成了一个联合数字图书馆,该图书馆将学术界和工业界联系在一起。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号