Quality versus efficiency in document scoring with learning-to-rank models

Gabriele Capannini; Claudio Lucchese; Franco Maria Nardini; Salvatore Orlando; Raffaele Perego; Nicola Tonellotto

首页> 外文期刊>Information Processing & Management >Quality versus efficiency in document scoring with learning-to-rank models

【24h】

Quality versus efficiency in document scoring with learning-to-rank models

机译：学习等级模型在文档评分中的质量与效率

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning-to-Rank (LtR) techniques leverage machine learning algorithms and large amounts of training data to induce high-quality ranking functions. Given a set of documents and a user query, these functions are able to precisely predict a score for each of the documents, in turn exploited to effectively rank them. Although the scoring efficiency of LtR models is critical in several applications - e.g., it directly impacts on response time and throughput of Web query processing - it has received relatively little attention so far.The goal of this work is to experimentally investigate the scoring efficiency of LtR models along with their ranking quality. Specifically, we show that machine-learned ranking models exhibit a quality versus efficiency trade-off. For example, each family of LtR algorithms has tuning parameters that can influence both effectiveness and efficiency, where higher ranking quality is generally obtained with more complex and expensive models. Moreover, LtR algorithms that learn complex models, such as those based on forests of regression trees, are generally more expensive and more effective than other algorithms that induce simpler models like linear combination of features.We extensively analyze the quality versus efficiency trade-off of a wide spectrum of state-of-the-art LtR, and we propose a sound methodology to devise the most effective ranker given a time budget. To guarantee reproducibility, we used publicly available datasets and we contribute an open source C++ framework providing optimized, multi-threaded implementations of the most effective tree-based learners: Gradient Boosted Regression Trees (GBRT), Lambda-Mart (λ-MART), and the first public-domain implementation of Oblivious Lambda-Mart (Ωλ-MART), an algorithm that induces forests of oblivious regression trees.We investigate how the different training parameters impact on the quality versus efficiency trade-off, and provide a thorough comparison of several algorithms in the quality-cost space. The experiments conducted show that there is not an overall best algorithm, but the optimal choice depends on the time budget.

机译：学习到排名（LtR）技术利用机器学习算法和大量的训练数据来诱导高质量的排名功能。给定一组文档和一个用户查询，这些功能能够准确预测每个文档的分数，进而用于对它们进行有效排名。尽管LtR模型的评分效率在一些应用中至关重要-例如，它直接影响Web查询处理的响应时间和吞吐量-到目前为止，它的关注度相对较小。该工作的目的是通过实验研究LtR模型的评分效率LtR模型及其排名质量。具体来说，我们证明了机器学习的排名模型表现出质量与效率之间的权衡。例如，每个LtR算法系列都具有可影响有效性和效率的调整参数，通常使用更复杂和更昂贵的模型可获得更高的排名质量。此外，学习复杂模型（例如基于回归树森林的模型）的LtR算法通常比其他诱导更简单模型（例如特征的线性组合）的算法更昂贵，更有效。我们广泛分析了质量与效率之间的权衡范围广泛的最新LtR，我们提出了一种合理的方法来设计在给定的时间预算下最有效的排名。为确保可重复性，我们使用了公开可用的数据集，并且我们贡献了一个开放源C ++框架，该框架为最有效的基于树的学习者提供了优化的多线程实现：梯度提升回归树（GBRT），Lambda-Mart（λ-MART），以及Oblivious Lambda-Mart（Ωλ-MART）的第一个公共领域实施方案，该算法可生成遗忘的回归树森林。我们研究了不同的训练参数如何影响质量与效率之间的权衡，并进行了全面的比较质量成本空间中的几种算法。进行的实验表明，没有一个总体上最佳的算法，但是最佳选择取决于时间预算。

著录项

来源
《Information Processing & Management》 |2016年第6期|1161-1177|共17页
作者
Gabriele Capannini; Claudio Lucchese; Franco Maria Nardini; Salvatore Orlando; Raffaele Perego; Nicola Tonellotto;
展开▼
作者单位

Innovation Design och Teknik (IDT), Maelardalens hoegskola, Vaesteras, Sweden;

Istituto di Scienza e Tecnologie dell'Informazione (ISTI) of the National Research Council of Italy (CNR), Pisa, Italy and Istella Srl,Cagliari, Italy;

Istituto di Scienza e Tecnologie dell'Informazione (ISTI) of the National Research Council of Italy (CNR), Pisa, Italy and Istella Srl,Cagliari, Italy;

University Ca'Foscari of Venice, Italy;

Istituto di Scienza e Tecnologie dell'Informazione (ISTI) of the National Research Council of Italy (CNR), Pisa, Italy and Istella Srl,Cagliari, Italy;

Istituto di Scienza e Tecnologie dell'Informazione (ISTI) of the National Research Council of Italy (CNR), Pisa, Italy and Istella Srl,Cagliari, Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Efficiency; Learning-to-rank; Document scoring;

机译：效率;学习排名;文件评分;
入库时间 2022-08-17 23:20:12

相似文献

外文文献
中文文献
专利

1. MQAPRank: improved global protein model quality assessment by learning-to-rank [J] . Xiaoyang Jing, Qiwen Dong BMC Bioinformatics . 2017,第1期

机译：MQAPRank：通过学习排名来改进全球蛋白质模型质量评估
2. Evidence of Incremental Diagnostic Quality Gain in The Assessment of Pulmonary Embolism With Computed Tomography Angiography versus Ventilation Perfusion Scan Using Wells Score and Bayesian Statistical Modeling [J] . Cochon L., Pena M., Baez A. Annals of Emergency Medicine: Journal of the American College of Emergency Physicians and the University Association for Emergency Medicine . 2013,第4Suppla期

机译：吞咽栓塞血管造影评估中增量诊断质量增益的证据与通风灌注扫描使用井分数和贝叶斯统计学建模
3. Evidence of Incremental Diagnostic Quality Gain in The Assessment of Pulmonary Embolism With Computed Tomography Angiography versus Ventilation Perfusion Scan Using Wells Score and Bayesian Statistical Modeling [J] . Cochon L., Pena M., Baez A. Annals of Emergency Medicine: Journal of the American College of Emergency Physicians and the University Association for Emergency Medicine . 2013,第4Suppla期

机译：吞咽栓塞血管造影评估中增量诊断质量增益的证据与通风灌注扫描使用井分数和贝叶斯统计学建模
4. Protein model quality assessment by learning-to-rank [C] . Xiaoyang Jing, Qiwen Dong, Xuan Liu, IEEE International Conference on Bioinformatics and Biomedicine . 2015

机译：通过等级学习进行蛋白质模型质量评估
5. Complex versus simple modeling for differential item functioning (DIF) detection: When the intraclass correlation coefficient (ρ) of the studied item is less than the ρ of the total score [D] . Jin, Ying 2013

机译：用于差异项功能（DIF）检测的复杂模型与简单模型：当所研究项的类内相关系数（ρ）小于总分的ρ时
6. MQAPRank: improved global protein model quality assessment by learning-to-rank [O] . Xiaoyang Jing, Qiwen Dong 2017

机译：MQAPRank：通过学习排名改进了全球蛋白质模型质量评估
7. MODERATED EPOSTERS1385Longitudinal strain assessment in dilated cardiomyopathy patients using a novel accelerated DENSE sequence1407Simultaneous T1 and T2 cardiac quantification with CABIRIA: initial clinical experience1423Head-to-head comparison of acceleration algorithms in 4-dimensional flow CMR1502Left ventricular function and size evaluated by hybrid cardiac positron emission tomography-magnetic resonance: Intraindividual comparison of left ventricular ejection fraction and ventricular volumes derived by two modalities1510Left Atrium assessed by Cardiovascular Magnetic Resonance at 1.5 and 3 Tesla – age and gender effects1514Comparison of Free Breathing Cardiac MRI Radial technique to the Standard Multi breath-hold cine SSFP CMR technique for the assessment of LV Volumes and Function1536Self-navigated free-breathing isotropic 3D whole heart phase sensitive inversion recovery magnetic resonance without navigator for detection of myocardial infarction1547Assessment of Right Ventricular Strain Using Myocardial Deformation Recovery Semi Automated Technique: Initial Experience and Normal Values1586Tissue tracking myocardial deformation analysis and prediction of left ventricular remodeling in acute myocardial infarction1589Investigating strategies for optimal 31P MRS clinical cardiac at 3T: Initial Results1620Quantitative Criteria for the Diagnosis of the Congenital Absence of Pericardium by Cardiac Magnetic Resonance1632Widespread tissue injury during acute myocardial infarction: evidence from advanced CMR relaxometry1322Computed tomography coronary angiography verSus sTRess cArdiac magneTic rEsonance for the manaGement of sYmptomatic revascularized patients: a cost effectiveness study (STRATEGY study)1339Comparison of low- versus high-dose of gadobutrol for late gadolinium enhancement imaging at 1.5 Tesla: a clinical feasibility study1347Multi-parametric Cardiac Magnetic Resonance for Prediction of Cardiac Complications in Thalassemia Intermedia: a Prospective Multicenter Study1461Prognostic value of Cardiovascular Magnetic Resonance derived indexes of myocardial fibrosis in heart transplant recipients1523The role of CMR in the acute phase of hospitalization: changing paradigms1542Preoperative CMR-based score predict ventricular response after surgical left ventricular reconstruction in ischemic heart failure patients1555Excellent response rate to cardiac resynchronization therapy guided with magnetic resonance imaging1626The ECG as a predictor of arrhythmogenic substrate on Cardiac Magnetic Resonance Imaging in patients undergoing ablation for premature ventricular contractions1649Comparison of T1-mapping at 3.0T CMR and angiographic APPROACH score for area at risk assessment in ST-segment elevation myocardial infarction1340Pathological correlates of left bundle branch disease in patients with non-ischemic cardiomyopathy: a cardiovascular magnetic resonance study1342Myocardial remodelling and fibrosis in nonischaemic dilated cardiomyopathy: insights from cardiovascular magnetic resonance1411The association between fibrosis and contractile dysfunction in hypertrophic cardiomyopathy assessed by cardiovascular magnetic resonance1622Persistent myocardial inflammation due to intramyocardial haemorrhage in reperfused STEMI as a precursor to adverse LV remodelling - insights from multi-parametric mapping1566Semiquantitative analysis of low and high b value DWI for detecting myocardial edema in acute myocarditis1567Value of Cardiac MRI In Detecting Coronary Artery Disease In Newly Diagnosed Systolic Dysfunction1570Usefulness of cardiac magnetic resonance in tuberous sclerosis complex1578Papillary muscles offer further insight into hypertrophied hearts: a cardiovascular magnetic resonance study1627Diagnostic and clinical implications of CMR timing (early versus late) in patients with troponin positive acute coronary syndromes and unobstructed coronary arteries: Table 1. [O] . Upasana Tayal, Alexandros Kallifatidis, P. Garg, 2016

机译：在使用新的扩张型心肌病的患者缓和EPOSTERS1385Longitudinal应变评估加速DENSE sequence1407Simultaneous T1和T2与CABIRIA心脏定量：在4维流动的加速算法初始临床experience1423Head对头比较CMR1502Left心室功能和尺寸由混合心脏正电子发射断层摄影术评价 - 磁性共振：由两个modalities1510Left庭派生左室射血分数和心室体积的个体间的比较，在1.5和3特斯拉评估心血管磁共振 - 免费的年龄和性别effects1514Comparison呼吸心脏MRI径向技术标准的多屏气电影SSFP CMR技术的LV卷和Function1536Self-导航自由呼吸各向同性3D整个心脏相位敏感反转恢复磁共振导航仪没有检测右Ventricu心肌infarction1547Assessment的评估拉尔菌株使用心肌变形恢复半自动技术：初步经验和正常Values1586Tissue跟踪心肌变形分析和预测左室重构急性心肌infarction1589Investigating策略优化31P MRS临床心脏在3T：初始Results1620Quantitative标准的先天缺失的诊断心包心脏磁Resonance1632Widespread组织损伤急性心肌梗死时：证据先进CMR relaxometry1322Computed CT冠状动脉成像与压力心脏磁共振对症的管理吻合血管患者：成本效益研究（战略研究）1339Comparison低与高剂量钆布醇在1.5特斯拉晚钆增强成像的临床可行性，在中间型地贫心脏并发症的预测study1347Multi参数心脏核磁共振的前瞻性德穆尔心血管磁共振的ticenter Study1461Prognostic价值衍生心肌纤维化指标在住院的急性期CMR的心脏移植recipients1523The作用：改变基于CMR-paradigms1542Preoperative比分预测缺血性心脏衰竭patients1555Excellent响应速度外科左心室重建心脏再同步化后心室反应治疗与磁共振imaging1626The ECG导引作为心脏磁共振成像致心律失常性基板的在ST段抬高心肌infarction1340Pathological在风险评估经历在3.0T CMR T1映射的室性早搏contractions1649Comparison和血管造影APPROACH分数区域消融患者的预测左束支传导疾病的患者与非缺血性心肌病相关因素：心血管磁共振study1342Myocardial重塑和纤维化nonischaemic扩张型心肌病：在从在由心血管磁性resonance1622Persistent心肌炎症评估肥厚型心肌病的纤维化和收缩功能障碍之间心血管磁性resonance1411The协会由于心肌内出血景点在再灌注STEMI为先导，以不利的LV重塑 - 从低的多参数mapping1566Semiquantitative分析和高的b值DWI的见解为在心脏MRI检测冠状动脉疾病急性myocarditis1567Value在结节性硬化症complex1578Papillary肌肉初诊收缩期Dysfunction1570Usefulness心脏磁共振检测心肌水肿提供进一步的深入了解肥大心脏：心血管磁共振study1627Diagnostic和CMR定时的临床意义（早期与晚）患者肌钙蛋白阳性的急性冠脉综合征和通畅的冠状动脉：表1。
8. Mitigation of atmospheric carbon emissions through increased energy efficiency versus increased non-carbon energy sources: A trade study using a simplified (open quotes)market-free(close quotes) exogenously driven model [R] . Krakowski, R. A. 1997

机译：与增加的非碳能源相比，通过提高能源效率来减少大气中的碳排放：使用简化（开放报价）无市场（近距离报价）外生驱动模型的贸易研究

Quality versus efficiency in document scoring with learning-to-rank models

摘要

著录项

相似文献

相关主题

期刊订阅