Comparing top-k XML lists

Ramakrishna Varadarajan; Fernando Farfan; Vagelis Hristidis

首页> 外文期刊>Information Systems >Comparing top-k XML lists

【24h】

Comparing top-k XML lists

机译：比较前k个XML列表

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Systems that produce ranked lists of results are abundant. For instance, Web search engines return ranked lists of Web pages. There has been work on distance measure for list permutations, like Kendall tau and Spearman's footrule, as well as extensions to handle top-k lists, which are more common in practice. In addition to ranking whole objects (e.g., Web pages), there is an increasing number of systems that provide keyword search on XML or other semistructured data, and produce ranked lists of XML sub-trees. Unfortunately, previous distance measures are not suitable for ranked lists of sub-trees since they do not account for the possible overlap between the returned sub-trees. That is, two sub-trees differing by a single node would be considered separate objects. In this paper, we present the first distance measures for ranked lists of sub-trees, and show under what conditions these measures are metrics. Furthermore, we present algorithms to efficiently compute these distance measures. Finally, we evaluate and compare the proposed measures on real data using three popular XML keyword proximity search systems.

机译：产生结果排名列表的系统非常丰富。例如，Web搜索引擎返回网页的排名列表。已经进行了列表排列的距离度量的工作，例如Kendall tau和Spearman的脚法，以及用于处理top-k列表的扩展，这在实践中更为常见。除了对整个对象（例如，网页）进行排名之外，越来越多的系统提供对XML或其他半结构化数据的关键字搜索，并生成XML子树的排名列表。不幸的是，先前的距离度量不适用于子树的排名列表，因为它们没有考虑返回的子树之间的可能重叠。也就是说，两个单个节点不同的子树将被视为单独的对象。在本文中，我们提出了用于子树排名列表的第一种距离度量，并显示了在什么条件下这些度量是度量。此外，我们提出了可有效计算这些距离量度的算法。最后，我们使用三种流行的XML关键字邻近搜索系统评估和比较针对实际数据的建议措施。

著录项

来源
《Information Systems》 |2013年第6期|820-834|共15页
作者
Ramakrishna Varadarajan; Fernando Farfan; Vagelis Hristidis;
展开▼
作者单位

Hewlett-Packard, Billerica, MA 01821, United States;

Department of Computer Science and Engineering, University of Michigan, Ann Arbor, MI 48109, United States;

Department of Computer Science and Engineering, University of California, Riverside, CA 92521, United States;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Total mapping; Partial mapping; Similarity distance; Position distance;

机译：总映射;部分映射;相似距离定位距离;
入库时间 2022-08-18 02:47:55

相似文献

外文文献
中文文献
专利

1. Answering Top-k k Zhu Yuanyuan, Qin Lu, Yu Jeffrey Xu, IEEE Transactions on Knowledge and Data Engineering . 2020,第8期

机译：回答top-k k <内联图xlink：href =“zhu-iteq1-2906608.gif”xmlns：xlink =“http://www.w3.org
2. An approach of top-k keyword querying for fuzzy XML [J] . Ma Zongmin, Li Ting, Yan Li Computing . 2018,第3期

机译：模糊XML的top-k关键字查询方法
3. Object-stack: An object-oriented approach for top-k keyword querying over fuzzy XML [J] . Li Ting, Ma Zongmin Information systems frontiers . 2017,第3期

机译：对象堆栈：面向对象的方法，用于基于模糊XML的top-k关键字查询
4. The Safest Way to Scroll a List: A Usability Study Comparing Different Ways of Scrolling Through Lists on Touch Screen Devices [C] . Jurek Breuninger, Severina Popova-Dlugosch, Klaus Bengler IFAC/IFIP/IFORS/IEA Symposium on Analysis, Design, and Evaluation of Human-Machine Systems . 2013

机译：滚动列表的最安全方法：可用性研究比较触摸屏设备上的列表滚动的不同方式
5. Comparing top-k algorithms in summary-based XML retrieval. [D] . Gu, Xin. 2007

机译：在基于摘要的XML检索中比较top-k算法。
6. RAxML and FastTree: Comparing Two Methods for Large-Scale Maximum Likelihood Phylogeny Estimation [O] . Kevin Liu, C. Randal Linder, Tandy Warnow 2011

机译：RAxML和FastTree：比较大型最大似然系统发生估计的两种方法
7. Efficient Top-k Search across Heterogeneous XML Data Sources [O] . 2015

机译：跨异构XmL数据源的高效Top-k搜索

Comparing top-k XML lists

摘要

著录项

相似文献

相关主题

期刊订阅