...
首页> 外文期刊>Computing and informatics >A Double Scoring Method for XML Element Retrieval
【24h】

A Double Scoring Method for XML Element Retrieval

机译:XML元素检索的双评分方法

获取原文
           

摘要

Efficient retrieval of XML elements and documents is essential in the effective application of the XML format. The ranking function BM25F is composed of several document fields with potentially different degrees of importance; these fields are known as selected fields that give substantial improvements over the baseline BM25. The BM25F function has performed well in past evaluations; however, there are issues that require additional attention. In the first instance, which elements should be treated as fields? Secondly, what is an appropriate weight for each field? Previously, document fields were selected manually, and the weight for each chosen field was tuned before being assigned. Two automatic methods are introduced in this paper that enable the extraction of fields in document-centric XML documents and the assignment weights to the selected fields. Our experiments show an improvement of up to 28 % over BM25, and up to 15 % over BM25F at iP[0.01] based on INEX evaluations.
机译:在XML格式的有效应用中,有效检索XML元素和文档至关重要。排名功能BM25F由几个文档字段组成,这些文档字段的重要性可能不同。这些字段称为“选定字段”,相对于基线BM25有了实质性的改进。 BM25F功能在过去的评估中表现良好;但是,有些问题需要进一步注意。首先,应将哪些元素视为字段?其次,每个领域的合适权重是多少?以前,文档字段是手动选择的,并且每个分配的字段的权重在分配之前都经过了调整。本文介绍了两种自动方法,它们可以提取以文档为中心的XML文档中的字段,并为所选字段分配权重。我们的实验显示,根据INEX评估,在iP [0.01]时,比BM25最多提高28%,比BM25F最多提高15%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号