Improving Context-Aware Query Classification via Adaptive Self-training

机译：通过自适应自我培训改进上下文感知查询分类

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Topical classification of user queries is critical for general-purpose web search systems. It is also a challenging task, due to the sparsity of query terms and the lack of labeled queries. On the other hand, search contexts embedded in query sessions and unlabeled queries free on the web have not been fully utilized in most query classification systems. In this work, we leverage these information to improve query classification accuracy. We first incorporate search contexts into our framework using a Conditional Random Field (CRF) model. Discriminative training of CRFs is favored over the traditional maximum likelihood training because of its robustness to noise. We then adapt self-training with our model to exploit the information in unlabeled queries. By investigating different confidence measurements and model selection strategies, we effectively avoid the error-reinforcing nature of self-training. In extensive experiments on real search logs, we have averaged around 20% improvement in classification accuracy over other state-of-the-art baselines.

机译：用户查询的局部分类对于通用网络搜索系统至关重要。由于查询术语的稀缺性和缺乏标记查询，这也是一个具有挑战性的任务。另一方面，在大多数查询分类系统中尚未充分利用嵌入在网上查询会话和未标记查询中的搜索上下文。在这项工作中，我们利用这些信息来提高查询分类准确性。我们首先使用条件随机字段（CRF）模型将搜索上下文纳入我们的框架。由于其对噪声的稳健性，对CRF的歧视性培训受到传统最大可能性培训。然后，我们使用模型进行自我培训，以利用未标记查询中的信息。通过调查不同的置信度量和模型选择策略，我们有效地避免了自我训练的错误增强性质。在实验的实验实验中，我们在其他最先进的基线上平均分类准确性提高约20％。

著录项

来源
《ACM international conference on information and knowledge management》|2011年||共9页
会议地点
作者
Minmin Chen; Jian-Tao Sun; Xiaochuan Ni; Yixin Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
Query classification; User search context; Unlabeled queries;

机译：查询分类;用户搜索上下文;未标记的查询;

相似文献

外文文献
中文文献
专利

1. Context-aware semantic classification of search queries for browsing community question-answering archives [J] . Figueroa Alejandro, Neumann Guenter Knowledge-Based Systems . 2016,第Mara15期

机译：用于浏览社区问答档案的搜索查询的上下文感知语义分类
2. Improved well-log classification using semisupervised label propagation and self-training, with comparisons to popular supervised algorithms [J] . Geophysics: Journal of the Society of Exploration Geophysicists . 2020,第1期

机译：使用半体验标签传播和自我培训来改进井 - 日志分类，与流行的监督算法进行比较
3. Improved well-log classification using semisupervised label propagation and self-training, with comparisons to popular supervised algorithms [J] . Dunham Michael W., Malcolm Alison, Welford J. Kim CASTANEA . 2020,第1期

机译：使用半体验标签传播和自我培训改进了良好的对数分类，与流行的监督算法进行比较
4. Improving Context-Aware Query Classification via Adaptive Self-training [C] . Minmin Chen, Jian-Tao Sun, Xiaochuan Ni, ACM international conference on information and knowledge management . 2011

机译：通过自适应自训练改善上下文感知的查询分类
5. Fuzziness for Classification and Visual Query Interface: Platform Independent Query Model with Self-Adaptive Fuzzy Capabilities [D] . Kian Mehr, Keivan 2011

机译：分类和视觉查询界面的模糊性：具有自适应模糊功能的平台无关查询模型
6. A Rolling Bearing Fault Classification Scheme Based on k-Optimized Adaptive Local Iterative Filtering and Improved Multiscale Permutation Entropy [O] . Yi Zhang, Yong Lv, Mao Ge 2021

机译：一种基于K优化自适应局部迭代过滤和改进的多尺度置换熵的滚动轴承故障分类方案
7. Context-Aware Query Classification [O] . Technology Of China, Huanhuan Cao 2013

机译：上下文感知查询分类
8. Adaptive Polarization Processing for Improved Detection/Classification of Stationary Targets [R] . Wicks, M. C., Zhang, Y., Schneible, R., 2010

机译：自适应极化处理改进的固定目标检测/分类

Improving Context-Aware Query Classification via Adaptive Self-training

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅