首页> 外文期刊>International journal on digital libraries >Bag of works retrieval: TF*IDF weighting of works co-cited with a seed
【24h】

Bag of works retrieval: TF*IDF weighting of works co-cited with a seed

机译:检索袋:与种子一起引用的作品的TF * IDF权重

获取原文
获取原文并翻译 | 示例
       

摘要

Although not presently possible in any system, the style of retrieval described here combines familiar components—co-citation linkages of documents and TF*IDF weighting of terms—in a way that could be implemented in future databases. Rather than entering keywords, the user enters a string identifying a work—a seed—to retrieve the strings identifying other works that are co-cited with it. Each of the latter is part of a “bag of works,” and it presumably has both a co-citation count with the seed and an overall citation count in the database. These two counts can be plugged into a standard formula for TF*IDF weighting such that all the co-cited items can be ranked for relevance to the seed, given that the entire retrieval is relevant to it by evidence from multiple co-citing authors. The result is analogous to, but different from, traditional “bag of words” retrieval, which it supplements. Some properties of the ranking are illustrated by works co-cited with three seeds: an article on search behavior, an information retrieval textbook, and an article on centrality in networks. While these are case studies, their properties apply to bag of works retrievals in general and have implications for users (e.g., humanities scholars, domain analysts) that go beyond any one example.
机译:尽管目前在任何系统中都不可能,但是这里描述的检索方式结合了熟悉的组件-文档的共引链接和TF * IDF术语权重-可以在将来的数据库中实现。用户无需输入关键字,而是输入标识作品(种子)的字符串,以检索标识与该作品共同引用的其他作品的字符串。后者的每一个都是“工作包”的一部分,并且大概具有与种子的共同引用计数和数据库中的总体引用计数。可以将这两个计数插入TF * IDF加权的标准公式中,这样就可以对所有共同引用的项目进行排序,以得出与种子相关的信息,前提是整个检索都与来自多个共同引用作者的证据相关。其结果类似于它所补充的传统“单词袋”检索,但与之不同。排名的某些属性通过以下三篇种子的著作进行了说明:一篇关于搜索行为的文章,一本信息检索教科书和一篇关于网络中心性的文章。尽管这些都是案例研究,但它们的属性通常适用于书包检索,并且对用户(例如人文学者,领域分析人员)产生了超出任何一个例子的影响。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号