首页> 外文会议>Advances in Information Retrieval >Evaluating Text Representations for Retrieval of the Best Group of Documents

【24h】

Evaluating Text Representations for Retrieval of the Best Group of Documents

机译：评估文本表示形式以检索最佳文档组

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cluster retrieval assumes that the probability of relevance of a document should depend on the relevance of other similar documents to the same query. The goal is to find the best group of documents. Many studies have examined the effectiveness of this approach, by employing different retrieval methods or clustering algorithms, but few have investigated text representations. This paper revisits the problem of retrieving the best group of documents, from the language-modeling perspective. We analyze the advantages and disadvantages of a range of representation techniques, derive features that characterize the good document groups, and experiment with a new probabilistic representation as a first step toward incorporating these features. Empirical evaluation demonstrates that the relationship between documents can be leveraged in retrieval when a good representation technique is available, and that retrieving the best group of documents can be more effective than retrieving individual documents.

机译：聚类检索假设文档的相关概率应取决于其他相似文档与同一查询的相关性。目的是找到最佳的文档组。许多研究已经通过采用不同的检索方法或聚类算法来检验了这种方法的有效性，但是很少研究文本表示。本文从语言建模的角度重新审视了检索最佳文档组的问题。我们分析了各种表示技术的优缺点，得出了表征良好文档组的特征，并尝试了一种新的概率表示，作为并入这些功能的第一步。经验评估表明，当可以使用良好的表示技术时，可以利用文档之间的关系来进行检索，并且检索最佳的文档组比检索单个文档更有效。

著录项

来源
《Advances in Information Retrieval》|2008年|P.454-462|共9页
会议地点 Glasgow(GB);Glasgow(GB)
作者
Xiaoyong Liu; W. Bruce Croft;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词
text representation; document retrieval; cluster retrieval; cluster representation; representation techniques;

机译：文本表示;文档检索;集群检索;集群表示;表示技术;

相似文献

外文文献
中文文献
专利

1. Tret: A Text Retrieval Efficiency Testing Tool For Different Document Types/Formats And Calculating Evaluation Measures For Xml Retrieval [J] . Guozhen Cheng Advances in computational sciences and technology . 2018,第3期

机译：Tret：一种用于不同文档类型/格式的文本检索效率测试工具，并为Xml检索计算评估方法
2. An Event Graph Based Document Representation for Information Retrieval and Summarazing the Text Based on Events [J] . P. Janarthanan, V. Ramachandran Asian Journal of Information Technology . 2016,第18期

机译：基于事件图的文档表示，用于信息检索和基于事件的文本摘要
3. REPRESENTING TEXT DOCUMENTS IN TRAINING DOCUMENT SPACES: A NOVEL MODEL FOR DOCUMENT REPRESENTATION [J] . ASMAA MOUNTASSIR, HOUDA BENBRAHIM, ILHAM BERRADA Journal of Theoretical and Applied Information Technology . 2013,第1期

机译：训练文档空间中的文本文档表示：一种新的文档表示模型
4. Evaluating Text Representations for Retrieval of the Best Group of Documents [C] . Xiaoyong Liu, W. Bruce Croft European Conference on IR Research . 2008

机译：评估用于检索最佳文件的文本表示
5. Evaluation of text-based and image-based representations for moving image documents. [D] . Goodrum, Abby Ann. 1997

机译：评估运动图像文档的基于文本和基于图像的表示形式。
6. Free-text medical document retrieval via phrase-based vector space model. [O] . Wenlei Mao, Wesley W. Chu 2002

机译：通过基于短语的向量空间模型检索自由文本医学文献。
7. Evaluating text representations for retrieval of the best group of documents [O] . Xiaoyong Liu, W. Bruce Croft 2008

机译：评估文本表示以检索最佳文档组

Evaluating Text Representations for Retrieval of the Best Group of Documents

摘要

著录项

相似文献

相关主题

期刊订阅