首页> 外文会议>ACM SIGKDD international conference on Knowledge discovery in data mining >SVM selective sampling for ranking with application to data retrieval
【24h】

SVM selective sampling for ranking with application to data retrieval

机译:支持向量机的选择性采样,用于数据检索排名

获取原文

摘要

Learning ranking (or preference) functions has been a major issue in the machine learning community and has produced many applications in information retrieval. SVMs (Support Vector Machines) - a classification and regression methodology - have also shown excellent performance in learning ranking functions. They effectively learn ranking functions of high generalization based on the "large-margin" principle and also systematically support nonlinear ranking by the "kernel trick". In this paper, we propose an SVM selective sampling technique for learning ranking functions. SVM selective sampling (or active learning with SVM) has been studied in the context of classification. Such techniques reduce the labeling effort in learning classification functions by selecting only the most informative samples to be labeled. However, they are not extendable to learning ranking functions, as the labeled data in ranking is relative ordering, or partial orders of data. Our proposed sampling technique effectively learns an accurate SVM ranking function with fewer partial orders. We apply our sampling technique to the data retrieval application, which enables fuzzy search on relational databases by interacting with users for learning their preferences. Experimental results show a significant reduction of the labeling effort in inducing accurate ranking functions.
机译:学习排名(或偏好)功能一直是机器学习社区中的一个主要问题,并已在信息检索中产生了许多应用。 SVM(支持向量机)-一种分类和回归方法-在学习排名函数方面也表现出出色的性能。他们基于“大利润”原理有效地学习了高泛化的排名功能,并通过“内核技巧”系统地支持了非线性排名。在本文中,我们提出了一种用于学习排序函数的SVM 选择性采样技术。在分类的背景下,研究了SVM选择性采样(或使用SVM进行主动学习)。这样的技术通过仅选择要标记的信息最多的样本来减少学习分类函数中的标记工作。但是,它们不能扩展到学习排名函数,因为排名中的标记数据是数据的相对顺序或部分顺序。我们提出的采样技术可以有效地学习具有较少偏序的准确SVM 排名函数。我们将采样技术应用于数据检索应用程序,该应用程序通过与用户进行交互以学习他们的偏好,从而实现对关系数据库的模糊搜索。实验结果表明,在诱导准确的排名函数中,标记工作量显着减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号