...
首页> 外文期刊>Information retrieval >Adapting boosting for information retrieval measures
【24h】

Adapting boosting for information retrieval measures

机译:适应信息检索措施的提升

获取原文
获取原文并翻译 | 示例

摘要

We present a new ranking algorithm that combines the strengths of two previous methods: boosted tree classification, and LambdaRank, which has been shown to be empirically optimal for a widely used information retrieval measure. Our algorithm is based on boosted regression trees, although the ideas apply to any weak learners, and it is significantly faster in both train and test phases than the state of the art, for comparable accuracy. We also show how to find the optimal linear combination for any two rankers, and we use this method to solve the line search problem exactly during boosting. In addition, we show that starting with a previously trained model, and boosting using its residuals, furnishes an effective technique for model adaptation, and we give significantly improved results for a particularly pressing problem in web search-training rankers for markets for which only small amounts of labeled data are available, given a ranker trained on much more data from a larger market.
机译:我们提出了一种新的排序算法,该算法结合了之前两种方法的优势:增强树分类和LambdaRank,根据经验,该算法对于广泛使用的信息检索方法是最佳的。尽管该思想适用于任何弱学习者,但我们的算法基于增强的回归树,并且在训练和测试阶段,与现有技术相比,它的速度明显要快得多,以实现可比的准确性。我们还展示了如何找到任意两个等级的最佳线性组合,并且我们使用这种方法来精确地解决提升过程中的线搜索问题。另外,我们表明,从先前训练的模型开始,并利用其残差进行提振,可以提供一种有效的模型自适应技术,并且针对仅针对小型市场的网络搜索培训排名中的一个特别紧迫的问题,我们给出了明显改善的结果。给定排名者接受来自更大市场的更多数据的培训,可获得大量的标签数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号