Distributed SLCA-Based XML Keyword Search by Map-Reduce

机译：通过Map-Reduce分布式基于SLCA的XML关键字搜索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Large scales of XML information comes continually from new Web applications, and SLCA (Smallest Lowest Common Ancestor)-based XML keyword search is one of the most important information retrieval approaches. Previous approaches focus on building index for XML documents. However in information dissemination scenario, it is impossible to build index in advance for continuous XML document streams. This paper addresses SLCA-based keyword search for continuous XML documents by Map-Reduce mechanism. We use parallel algorithms to process plenty of XML documents in Hadoop environment. A distributed SLCA computation method is designed, where each net node computes SLCA independently and just a little information needs be transmitted. A real Hadoop environment is built and we demonstrate the efficiency of our algorithms analytically and experimentally.

机译：新的Web应用程序不断产生大量的XML信息，基于SLCA（最小的最低共同祖先）的XML关键字搜索是最重要的信息检索方法之一。先前的方法着重于为XML文档建立索引。但是，在信息传播场景中，不可能为连续的XML文档流预先建立索引。本文通过Map-Reduce机制解决了基于SLCA的连续XML文档的关键字搜索。我们使用并行算法在Hadoop环境中处理大量XML文档。设计了一种分布式SLCA计算方法，其中每个网络节点独立地计算SLCA，仅需要传输少量信息。构建了一个真实的Hadoop环境，我们通过分析和实验证明了算法的效率。

著录项

来源
《DASFAA 2010;International conference on database systems for advances applications;International workshop on graph data management: Techniques and application;GDM 2010;International workshop on benchmarking of database management systems and data-oriented web technologies;BenchmarX 2010;International workshop on managing data quality in collaborative information systems;MCIS 2010;Workshop on social networks and social media mining on the web;SNSMW 2010;Data-intensive eScience workshop;DIEW 2010;International workshop on ubiquitous data management;UDM 2010 》|2010年|p.386-397|共12页
会议地点 Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP)
作者
Chenjing Zhang; Qiang Ma; Xiaoling Wang; Aoying Zhou;
展开▼
作者单位

College of Information Technology Shanghai Ocean University China School of Computer Science and Technology Fudan University China;

School of Computer Science and Technology Fudan University China;

Shanghai Key Laboratory of Trustworthy Computing Software Engineering Institute East China Normal University;

School of Computer Science and Technology Fudan University China Shanghai Key Laboratory of Trustworthy Computing Software Engineering Institute East China Normal University;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
SLCA; keyword search; XML; distributed system;

机译：SLCA；关键词搜索; XML;分布式系统;

相似文献

外文文献
中文文献
专利

1. LAF: a new XML encoding and indexing strategy for keyword-based XML search [J] . Zhi-Hong Deng, Yong-Qing Xiang, Ning Gao Concurrency and Computation . 2013 ,第11期

机译：LAF：一种新的基于关键字的XML搜索的XML编码和索引策略
2. XML Keyword Search Using Breadth-First Search [J] . Vinothsaravanan R, Palanisamy C Journal of computational and theoretical nanoscience . 2018 ,第5期

机译：XML关键字搜索使用宽度第一搜索
3. No-but-semantic-match: computing semantically matched xml keyword search results [J] . Naseriparsa Mehdi, Islam Md. Saiful, Liu Chengfei, World Wide Web . 2018 ,第5期

机译：No-but-semantic-match：计算语义匹配的xml关键字搜索结果
4. Distributed SLCA-Based XML Keyword Search by Map-Reduce [C] . Chenjing Zhang, Qiang Ma, Xiaoling Wang, International Conference on Database Systems for Advanced Applications . 2010

机译：分布式基于SLCA的XML关键字搜索按地图减少
5. Enhancing personalized search and improving accuracy and performance for keyword-based XML queries. [D] . Taha, Kamal. 2010

机译：增强个性化搜索并提高基于关键字的XML查询的准确性和性能。
6. Collating of a Distributed XML-based Medical Records into a Relational Database [O] . Do Hoon Oh, Alberto Riva, Kenneth D. Mandl, 2000

机译：将基于XML的分布式医疗记录整理到关系数据库中
7. No-but-semantic-match: computing semantically matched xml keyword search results [O] . Naseriparsa, Mehdi, Islam, Saiful, Liu, Chengfei, 2017

机译：No-but-semantic-match：计算语义匹配的xml关键字搜索结果

Distributed SLCA-Based XML Keyword Search by Map-Reduce

摘要

著录项

相似文献

相关主题

期刊订阅