Relation Based Term Weighting Regularization

机译：基于关系的术语加权正则化

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional retrieval models compute term weights based on only the information related to individual terms such as TF and IDF. However, query terms are related. Intuitively, these relations could provide useful information about the importance of a term in the context of other query terms. For example, query "perl tutorial" specifies that a user look for information relevant to both perl and tutorial. Thus, a document containing both terms should have higher relevance score than the ones with only one of them. However, if the IDF value of "tutorial" is much smaller than "perl", existing retrieval models may assign the document lower score than those containing multiple occurrences of "perl". It is clear that the importance of a term should be dependent on not only collection statistics but also the relations with other query terms. In this work, we study how to utilize semantic relations among query terms to regularize term weighting. Experiment results over TREC collections show that the proposed strategy is effective to improve the retrieval performance.

机译：传统检索模型仅基于与单个术语（例如TF和IDF）有关的信息来计算术语权重。但是，查询词是相关的。直观地，这些关系可以在其他查询词的上下文中提供有关该词重要性的有用信息。例如，查询“ perl教程”指定用户查找与perl和教程相关的信息。因此，包含两个术语的文档的相关性得分应高于仅包含两个术语的文档。但是，如果“ tutorial”的IDF值比“ perl”小得多，则现有的检索模型可以为文档分配比包含多次出现的“ perl”的评分更低的分数。很明显，一个术语的重要性不仅应取决于集合统计，而且还应取决于与其他查询术语的关系。在这项工作中，我们研究如何利用查询词之间的语义关系来规范词加权。在TREC集合上的实验结果表明，该策略可有效提高检索性能。

著录项

来源
《Advances in information retrieval.》|2012年|p.109-120|共12页
会议地点 Barcelona(ES);Barcelona(ES)
作者
Hao Wu; Hui Fang;
展开▼
作者单位

Department of Electrical and Computer Engineering University of Delaware, USA;

Department of Electrical and Computer Engineering University of Delaware, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Regularization Based Iterative Point Match Weighting for Accurate Rigid Transformation Estimation [J] . Liu Yonghuai, De Dominicis Luigi, Wei Baogang, Visualization and Computer Graphics, IEEE Transactions on . 2015,第9期

机译：基于正则化的迭代点匹配权重，用于精确的刚性变换估计
2. On Re-weighting, Regularization Selection, and Transient in Nuclear Norm based Identification * [J] . Mohamed Abdalmoaty, H?kan Hjalmarsson IFAC PapersOnLine . 2015,第28期

机译：基于核规范的识别中的重新加权，正则化选择和瞬态 * < / ce：交叉引用>
3. On Re-weighting, Regularization Selection, and Transient in Nuclear Norm based Identification * [J] . Mohamed Abdalmoaty, H?kan Hjalmarsson IFAC PapersOnLine . 2015,第28期

机译：基于核规范的识别中的重新加权，正则化选择和瞬态 * < / ce：交叉引用>
4. Structure-Based Supervised Term Weighting and Regularization for Text Classification [C] . Niloofer Shanavas, Hui Wang, Zhiwei Lin, International Conference on Applications of Natural Language to Information Systems . 2019

机译：基于结构的文本术语监督术语加权和正则化
5. A single document-based term weighting scheme by supporting terms. [D] . Cheng, Juan. 2006

机译：通过支持术语的单个基于文档的术语加权方案。
6. Weighting function effects in a direct regularization method for image-guided near-infrared spectral tomography of breast cancer [O] . Jinchao Feng, Shudong Jiang, Brian W. Pogue, 2018

机译：图像引导的乳腺癌近红外光谱层析直接正则化方法中的加权函数效应
7. Regularization Based Iterative Point Match Weighting for Accurate Rigid Transformation Estimation [O] . Liu Yonghuai, De Dominicis L., Baogang Wei, 2015

机译：基于正则化的迭代点匹配权重，用于精确的刚性变换估计

Relation Based Term Weighting Regularization

摘要

著录项

相似文献

相关主题

期刊订阅