首页> 外文期刊>Information Processing & Management >Search task success evaluation by exploiting multi-view active semi-supervised learning
【24h】

Search task success evaluation by exploiting multi-view active semi-supervised learning

机译:利用多视图主动半监督学习进行搜索任务成功评估

获取原文
获取原文并翻译 | 示例
       

摘要

Search task success rate is an important indicator to measure the performance of search engines. In contrast to most of the previous approaches that rely on labeled search tasks provided by users or third-party editors, this paper attempts to improve the performance of search task success evaluation by exploiting unlabeled search tasks that are existing in search logs as well as a small amount of labeled ones. Concretely, the Multi-view Active Semi-Supervised Search task Success Evaluation (MA4SE) approach is proposed, which exploits labeled data and unlabeled data by integrating the advantages of both semi-supervised learning and active learning with the multi-view mechanism. In the semi-supervised learning part of MA4SE, we employ a multi-view semi-supervised learning approach that utilizes different parameter configurations to achieve the disagreement between base classifiers. The base classifiers are trained separately from the predefined action and time views. In the active learning part of MA4SE, each classifier received from semi-supervised learning is applied to unlabeled search tasks, and the search tasks that need to be manually annotated are selected based on both the degree of disagreement between base classifiers and a regional density measurement. We evaluate the proposed approach on open datasets with two different definitions of search tasks success. The experimental results show that MA4SE outperforms the state-of-the-art semi-supervised search task success evaluation approach.
机译:搜索任务成功率是衡量搜索引擎性能的重要指标。与以往大多数依靠用户或第三方编辑者提供的带标签的搜索任务的方法相比,本文试图通过利用搜索日志中存在的未标记的搜索任务以及其他方法来提高搜索任务成功评估的性能。少量标记的。具体而言,提出了多视图主动半监督搜索任务成功评估(MA4SE)方法,该方法通过将半监督学习和主动学习的优势与多视图机制相结合来利用标记数据和未标记数据。在MA4SE的半监督学习部分,我们采用了多视图半监督学习方法,该方法利用不同的参数配置来实现基本分类器之间的分歧。基本分类器与预定义的动作和时间视图分开进行训练。在MA4SE的主动学习部分中,将从半监督学习中收到的每个分类器应用于未标记的搜索任务,并且需要根据基本分类器之间的不一致程度和区域密度测量来选择需要手动注释的搜索任务。我们对具有两个不同搜索任务成功定义的开放数据集评估提出的方法。实验结果表明,MA4SE优于最新的半监督搜索任务成功评估方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号