【24h】

XML Query Routing in Structured P2P Systems

机译:结构化P2P系统中的XML查询路由

获取原文
获取原文并翻译 | 示例

摘要

This paper addresses the problem of data placement, indexing, and querying large XML data repositories distributed over an existing P2P service infrastructure. Our architecture scales gracefully to the network and data sizes, is fully distributed, fault tolerant and self-organizing, and handles complex queries efficiently, even those queries that use full-text search. Our framework for indexing distributed XML data is based on both meta-data information and textual content. We introduce a novel data synopsis structure to summarize text that correlates textual with positional information and increases query routing precision. Our processing framework maps an XML query with full-text search into a distributed program that migrates from peer to peer, collecting relevant document locations along the way. In addition, we introduce methods to handle network updates, such as node arrivals, departures, and failures. Finally, we report on a prototype implementation, which is used to validate the accuracy of our data synopses and to analyze the various costs involved in indexing XML data and answering queries.
机译:本文解决了分布在现有P2P服务基础架构上的大型XML数据存储库的数据放置,索引和查询问题。我们的体系结构可以根据网络和数据大小灵活地扩展,是完全分布式的,容错的和自组织的,并且可以有效地处理复杂的查询,甚至包括使用全文本搜索的查询。我们为分布式XML数据建立索引的框架是基于元数据信息和文本内容的。我们引入了一种新颖的数据概要结构来总结文本,该文本将文本与位置信息相关联,并提高了查询路由精度。我们的处理框架将带有全文搜索的XML查询映射到一个分布式程序中,该程序从对等方迁移到另一方,并在此过程中收集相关的文档位置。另外,我们介绍了处理网络更新的方法,例如节点到达,离开和故障。最后,我们报告了一个原型实现,该原型实现用于验证数据概要的准确性并分析索引XML数据和回答查询所涉及的各种成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号