A Simple Kernel Co-Occurrence-Based Enhancement for Pseudo-Relevance Feedback

Min Pan; Jimmy Xiangji Huang; Tingting He; Zhiming Mao; Zhiwei Ying; Xinhui Tu

首页> 外文期刊>Journal of the American Society for Information Science and Technology >A Simple Kernel Co-Occurrence-Based Enhancement for Pseudo-Relevance Feedback

【24h】

A Simple Kernel Co-Occurrence-Based Enhancement for Pseudo-Relevance Feedback

机译：基于简单核共现的伪相关反馈增强

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this article a kernel co-occurrence-based framework was proposed in which term co-occurrence information is integrated into the classic Rocchio model and a relevance model (RM3) to achieve enhanced retrieval performance. When selecting and weighting the candidate terms from feedback documents, we used a linear combination of model components to balance the influence of classic models and models that capture term co-occurrence information to achieve better performance. Thus, two kernel co-occurrence-based methods, KRoc and KRM3, are proposed. In particular, to better utilize the term co-occurrence information, we incorporated this information into the whole term weight formula by simultaneously refining both the factor of the term discriminating power and the factor of the within-document term weight in feedback documents to achieve better performance. The experimental results show that the proposed KRoc and KRM3 methods are effective and outperform the corresponding strong baseline methods in terms of the MAP and P@10 results on most collections used for testing. Meanwhile, our proposed methods are at least comparable to the state-of-the-art TF-PRF, PRoc, IF&FB, and MRF models. Additionally, we carefully analyzed the influence of σ on our proposed KRoc and KRM3methods, and an empirical rule for setting this parameter to achieve good performance is suggested.

机译：在本文中，提出了一个基于内核共现的框架，其中将术语共现信息集成到经典的Rocchio模型和关联模型（RM3）中，以实现增强的检索性能。从反馈文档中选择候选术语并对其进行加权时，我们使用模型组件的线性组合来平衡经典模型和捕获术语共现信息的模型的影响，以实现更好的性能。因此，提出了两种基于核共现的方法，即KRoc和KRM3。特别是，为了更好地利用术语共现信息，我们通过同时完善术语区分能力的因素和反馈文档中文档内术语权重的因素，将此信息合并到整个术语权重公式中，以实现更好的效果性能。实验结果表明，在大多数用于测试的集合中，所提出的KRoc和KRM3方法有效且优于相应的强基线方法。同时，我们提出的方法至少可以与最新的TF-PRF，PRoc，IF＆FB和MRF模型相提并论。此外，我们仔细分析了σ对我们提出的KRoc和KRM3方法的影响，并提出了设置此参数以获得良好性能的经验规则。

著录项

来源
《Journal of the American Society for Information Science and Technology》 |2020年第3期|264-281|共18页
作者
Min Pan; Jimmy Xiangji Huang; Tingting He; Zhiming Mao; Zhiwei Ying; Xinhui Tu;
展开▼
作者单位

Information Retrieval and Knowledge Management Research Lab National Engineering Research Center for E-Learning Central China Normal University Wuhan China and School of Computer and Information Engineering Hubei Normal University Huangshi China and Information Retrieval and Knowledge Management Research Lab School of Information Technology York University Toronto ON Canada;

Information Retrieval and Knowledge Management Research Lab School of Information Technology York University Toronto ON Canada;

Information Retrieval and Knowledge Management Research Lab School of Computer Central China Normal University Wuhan China;

School of Computer and Information Engineering Hubei Normal University Huangshi China and Information Retrieval and Knowledge Management Research Lab School of Computer Central China Normal University Wuhan China;

Information Retrieval and Knowledge Management Research Lab School of Information Technology York University Toronto ON Canada and Information Retrieval and Knowledge Management Research Lab School of Information Management Central China Normal University Wuhan China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-18 05:24:19

相似文献

外文文献
中文文献
专利

1. Proximity-Based Good Turing Discounting and Kernel Functions for Pseudo-Relevance Feedback [J] . Ilyes Khennak, Bab Ezzouar International journal of information retrieval research . 2017,第3期

机译：基于接近度的伪相关反馈的良好图灵折价和核函数
2. A New Re-Ranking Method Using Enhanced Pseudo-Relevance Feedback for Content-Based Medical Image Retrieval [J] . Yonggang HUANG, Jun ZHANG, Yongwang ZHAO, IEICE transactions on information and systems . 2012,第2期

机译：利用增强的伪相关反馈的基于内容的医学图像检索的新重新排序方法
3. A New Re-Ranking Method Using Enhanced Pseudo-Relevance Feedback for Content-Based Medical Image Retrieval [J] . Yonggang HUANG, Jun ZHANG, Yongwang ZHAO, IEICE Transactions on Information and Systems . 2012,第2期

机译：利用增强的伪相关反馈的基于内容的医学图像检索的新重新排序方法
4. Simple questions to improve pseudo-relevance feedback results [C] . Giridhar Kumaran, James Allan, PGiridhar Kumaran, Annual international ACM SIGIR conference on Research and development in information retrieval;International ACM SIGIR conference on Research and development in information retrieval . 2006

机译：改善伪相关反馈结果的简单问题
5. Studies on Kernels of Simple Polygons [D] . Mark, Jason A. 2020

机译：简单多边形仁的研究
6. Evaluation of Term Ranking Algorithms for Pseudo-Relevance Feedback in MEDLINE Retrieval [O] . Sooyoung Yoo, Jinwook Choi 2011

机译：MEDLINE检索中伪相关反馈的术语排序算法的评估
7. A New Re-Ranking Method Using Enhanced Pseudo-Relevance Feedback for Content-Based Medical Image Retrieval [O] . Yonggang HUANG, Jun ZHANG, Yongwang ZHAO, 2012

机译：一种新的重新排序方法，使用增强基于内容的医学图像检索的伪相关反馈

A Simple Kernel Co-Occurrence-Based Enhancement for Pseudo-Relevance Feedback

摘要

著录项

相似文献

相关主题

期刊订阅