首页> 外文会议>International conference on machine learning and data mining >A Bayesian Approach to Sparse Learning-to-Rank for Search Engine Optimization
【24h】

A Bayesian Approach to Sparse Learning-to-Rank for Search Engine Optimization

机译:用于搜索引擎优化的稀疏学习到排名的贝叶斯方法

获取原文

摘要

Search engine optimization (SEO) is the process of affecting the visibility of a web page in the engine's search results. SEO specialists must understand how search engines work and which features of the web-page affect its position in the search results. This paper employs machine learning ranking algorithms to constructing the rank model of a web-search engine. Ranking a set of retrieved documents according to their relevance to a given query has become a popular problem at the intersection of web search, machine learning and information retrieval. Feature selection in learning to rank has recently emerged as a crucial issue. Recent work on ranking, focused on a number of different paradigms, namely, point-wise, pair-wise, and list-wise approaches, for which several preprocessing feature section methods have been proposed. Unfortunately, only a few works have been focused on integrating the feature selection into the learning process and all of these embedded methods are based on l_1 regularization technique. Such type of regularization does not possess many properties, essential for SEO, such as unbiasedness, grouping effect and oracle property. In this paper we suggest a new Bayesian framework for feature selection in learning-to-rank problem. The proposed approach gives the strong probabilistic statement of shrinkage criterion for features selection. The proposed regularization is unbiased, has grouping and oracle properties, its maximal risk diverges to finite value. Experimental results show that the proposed framework is competitive on both artificial data and publicly available LETOR data sets.
机译:搜索引擎优化(SEO)是影响网页在引擎搜索结果中的可见性的过程。 SEO专家必须了解搜索引擎如何工作以及网页的哪些功能会影响其在搜索结果中的位置。本文采用机器学习排名算法来构建网络搜索引擎的排名模型。在网络搜索,机器学习和信息检索的交叉点上,根据一组检索文档与给定查询的相关性对它们进行排序已经成为一个普遍的问题。最近,学习学习排名中的特征选择已成为一个关键问题。最近的排名工作集中在许多不同的范例上,即点对,对和列表方式,为此提出了几种预处理特征部分的方法。不幸的是,只有少数工作集中在将特征选择集成到学习过程中,并且所有这些嵌入式方法都基于l_1正则化技术。这种类型的正则化不具有许多对于SEO必不可少的属性,例如无偏性,分组效果和oracle属性。在本文中,我们提出了一种新的贝叶斯框架,用于学习排名问题中的特征选择。所提出的方法给出了用于特征选择的收缩准则的强概率陈述。所提出的正则化是无偏的,具有分组和预言属性,其最大风险发散到有限值。实验结果表明,该框架在人工数据和可公开获得的LETOR数据集上均具有竞争力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号