Extending Weighting Models with a Term Quality Measure

机译：通过术语质量措施扩展加权模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Weighting models use lexical statistics, such as term frequencies, to derive term weights, which are used to estimate the relevance of a document to a query. Apart from the removal of stopwords, there is no other consideration of the quality of words that are being 'weighted'. It is often assumed that term frequency is a good indicator for a decision to be made as to how relevant a document is to a query. Our intuition is that raw term frequency could be enhanced to better discriminate between terms. To do so, we propose using non-lexical features to predict the 'quality' of words, before they are weighted for retrieval. Specifically, we show how parts of speech (e.g. nouns, verbs) can help estimate how informative a word generally is, regardless of its relevance to a query/document. Experimental results with two standard TREC collections show that integrating the proposed term quality to two established weighting models enhances retrieval performance, over a baseline that uses the original weighting models, at all times.

机译：加权模型使用词汇统计（例如术语频率）来导出术语权重，用于估计文档对查询的相关性。除了删除秒表之外，还没有其他考虑“加权”的词语质量。通常假设术语频率是一个良好指标，用于决定如何相关文档对查询进行查询。我们的直觉是可以提高原始术语频率以更好地区分术语。为此，我们建议使用非词汇特征来预测单词的“质量”，然后在加权检索之前。具体而言，我们展示了如何演讲（例如名词，动词）的部分如何帮助估计一般的信息，无论与查询/文档相关如何。具有两个标准TREC集合的实验结果表明，将所提出的术语质量集成到两种既定的加权模型，增强了使用原始加权模型的基线进行检索性能。

著录项

来源
《International Conference on String Processing and Information Retrieval》|2007年||共12页
会议地点
作者
Christina Lioma; Iadh Onnis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类数据备份与恢复;
关键词

相似文献

外文文献
中文文献
专利

1. Does Weighting Capture What’s Important? Revisiting Subjective Importance Weighting with a Quality of Life Measure [J] . Lara B. Russell, Anita M. Hubley, Anita Palepu, Social Indicators Research . 2006,第1期

机译：权重捕获重要内容吗？运用生活质量量度重新审视主观重要性加权
2. Why are we "weighting"? An assessment of a self-weighting approach to measuring oral health-related quality of life. [J] . McGrath C, Bedi R Community dentistry and oral epidemiology . 2004,第1期

机译：我们为什么要“加权”？对衡量口腔健康相关生活质量的自我加权方法的评估。
3. Combining supervised term-weighting metrics for SVM text classification with extended term representation [J] . Haddoud Mounia, Mokhtari Aicha, Lecroq Thierry, Knowledge and information systems . 2016,第3期

机译：将用于SVM文本分类的监督术语权重度量与扩展术语表示相结合
4. Extending Weighting Models with a Term Quality Measure [C] . Christina Lioma, Iadh Onnis International Conference on String Processing and Information Retrieval(SPIRE 2007); 20071029-31; Santiago(CL) . 2007

机译：用术语质量度量扩展权重模型
5. Multialternative decision field theory model fitting using different measures of attribute weighting [D] . Zhang, Ruohui. 2015

机译：使用不同属性权重度量的多方案决策场理论模型拟合
6. PubMed-supported clinical term weighting approach for improving inter-patient similarity measure in diagnosis prediction [O] . Lawrence WC Chan, Ying Liu, Tao Chan, 2015

机译：PubMed支持的临床术语加权法可改善诊断预测中的患者间相似性度量
7. Extending Weighting Models with a Term Quality Measure [O] . Christina Lioma, Iadh Ounis 2008

机译：使用术语质量测量扩展加权模型
8. Optimal Weighting Function in Water Quality Modeling. [R] . lee, e. stanley misra, p. k. 1974

机译：水质模拟中的最优加权函数。

Extending Weighting Models with a Term Quality Measure

摘要

著录项

相似文献

相关主题

期刊订阅