Feedback-driven Result Ranking and Query Refinement for Exploring Semi-structured Data Collections

机译：反馈驱动的结果排名和查询细化，以探索半结构化数据集合

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feedback process has been used extensively in document-centric applications, such as text retrieval and multimedia retrieval. Recently, there have been efforts to apply feedback to semi-structured XML document collections as well. In this paper, we note that feedback can also be an effective tool for exploring (through result ranking and query refinement) large semi-structured data collections. In particular, in large scale data sharing and curation environments, where the user may not know the structure of the data, queries may initially be overly vague. Given a path query and a set of results identified by the system to this query over the data, we consider two types of feedback: Soft feedback captures the user's preference for some features over the others. Hard feedback, on the other hand, expresses users' assertions regarding whether certain features should be further enforced or, in contrast, are to be avoided. Both soft and hard feedback can be "positive" or "negative". For soft feedback, we develop a probabilistic feature significance measure and describe how to use this for ranking results in the presence of dependencies between the path features. To deal with the hard feedback efficiently (i.e., fast enough for interactive exploration), we present finite automata based query refinement solutions. In particular, we present a novel LazyDFA+ algorithm for managing hard feedback. We also describe optimizations that leverage the inherently iterative nature of the feedback process. We bring together these techniques in AXP, a system for adaptive and exploratory path retrieval. The experimental results show the effectiveness of the proposed techniques.

机译：反馈过程已在以文档为中心的应用程序中广泛使用，例如文本检索和多媒体检索。最近，人们也在努力将反馈应用于半结构化XML文档集合。在本文中，我们注意到反馈也可以是探索（通过结果排名和查询细化）大型半结构化数据集合的有效工具。特别是，在用户可能不知道数据结构的大规模数据共享和管理环境中，查询最初可能过于含糊。给定路径查询和系统针对数据查询所确定的一组结果，我们考虑两种类型的反馈：软反馈捕获用户对某些功能的偏好。另一方面，硬反馈表示用户是否应该进一步实施某些功能，或者应避免使用某些功能。软反馈和硬反馈都可以是“正”或“负”。对于软反馈，我们开发了一种概率特征重要性度量，并描述了如何在路径特征之间存在依赖性的情况下使用该度量对结果进行排名。为了有效地处理硬反馈（即足够快以进行交互式探索），我们提出了基于有限自动机的查询优化解决方案。特别是，我们提出了一种新颖的LazyDFA +算法，用于管理硬反馈。我们还将描述利用反馈过程的固有迭代性质的优化。我们将这些技术整合到AXP中，该系统是一种自适应和探索性路径检索系统。实验结果表明了所提出技术的有效性。

著录项

来源
《13th international conference on extending database technology 2010》|2010年|P.3-14|共12页
会议地点 Lausanne(CH);Lausanne(CH)
作者
Huiping Cao; Van Qi; K. Selcuk Candan; Maria Luisa Sapino;
展开▼
作者单位

Arizona State Univ. Tempe, AZ 85283, USA;

Arizona State Univ. Tempe, AZ 85283, USA;

Arizona State Univ. Tempe, AZ 85283, USA;

Univ. di Torino Torino, Italy;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.13;
关键词
relevance feedback; inter-dependent structural feature; feature cover; data-centric XML;

机译：相关性反馈；相互依存的结构特征；功能盖；以数据为中心的XML;
入库时间 2022-08-26 13:47:09

相似文献

外文文献
中文文献
专利

1. Learn-As-You-Go: Feedback-Driven Result Ranking and Query Refinement for Interactive Data Exploration [J] . Vikram Singh, Ajay Singh Procedia Computer Science . 2018,第1期

机译：边学习边学习：基于反馈的结果排名和查询细化，用于交互式数据探索
2. Adaptive query relaxation and top-k result ranking over autonomous web databases [J] . Meng Xiangfu, Zhang Xiaoyan, Tang Yanhuan, Knowledge and information systems . 2017,第2期

机译：自动查询放松和Top-K结果排名在自动Web数据库上
3. USER SATISFACTION OVER QUERY RESULT RANKING IN WEB DATABASE SYSTEMS [J] . PRAMOD KUMAR GHADEI, S. SRIDHAR International Journal of Computer Science Engineering and Information Technology Research . 2014,第1期

机译：Web数据库系统中用户对查询结果排名的满意度
4. Learn-As-You-Go: Feedback-Driven Result Ranking and Query Refinement for Interactive Data Exploration [C] . Vikram Singh, Ajay Singh International Conference on Smart Computing and Communications . 2018

机译：you-you-go：反馈驱动的结果排序和互动数据探索的查询精制
5. Algorithms and Data Structures for Indexing, Querying, and Analyzing Large Collections of Sequencing Data in the Presence or Absence of a Reference [D] . ?Almodaresi, Fatemeh 2020

机译：用于索引，查询和分析大量测序数据的索引，查询和分析参考的算法和数据结构
6. Data structures based on k-mers for querying large collections of sequencing data sets [O] . Camille Marchet, Christina Boucher, Simon J. Puglisi, 2021

机译：基于K-MERS查询大量测序数据集的数据结构
7. Approximate Query Answering and Result Refinement on XML Data [O] . Katja Seidler, Eric Peukert, Gregor Hackenbroich, 2015

机译：XmL数据的近似查询答案和结果细化

Feedback-driven Result Ranking and Query Refinement for Exploring Semi-structured Data Collections

摘要

著录项

相似文献

相关主题

期刊订阅