【24h】

Fast Search for Multimedia Metadata in an XML Data Repository

机译:在XML数据存储库中快速搜索多媒体元数据

获取原文
获取原文并翻译 | 示例

摘要

Performance of multimedia metadata management depends on the storage, indexing, and proper schema design of the system. In this paper, we propose a design strategy for improvement of search performance to build a prototype multimedia metadata system (MMS). The proposed design strategy includes the following aspects. First, we model and design the MMS using native XML database. The native XML database stores XML documents directly without going through normalization processes required in an RDBMS. We do not store XML document on top of an RDBMS, either. Second, we introduce an assistant document, the rotated document for each searchable descriptor. Optimal inter-document schema is designed to search and locate documents in the data repository. Third, on the rotated documents, we perform lexical processing including word separating, stemming and stop word removal. Finally, different index structures are implemented and tested to build fast indices for the system. Experiments show that the proposed design achieves a speedup factor of 9.3 compared with the direct DOM method. It can speed up even more than a system based on RDBMS. It also has a better scalability.
机译:多媒体元数据管理的性能取决于系统的存储,索引和适当的架构设计。在本文中,我们提出了一种用于提高搜索性能的设计策略,以构建原型多媒体元数据系统(MMS)。提出的设计策略包括以下方面。首先,我们使用本地XML数据库对MMS进行建模和设计。本机XML数据库直接存储XML文档,而无需经过RDBMS中所需的规范化过程。我们也不将XML文档存储在RDBMS之上。其次,我们引入一个辅助文档,即每个可搜索描述符的旋转文档。最佳的文档间架构旨在搜索和定位数据存储库中的文档。第三,在旋转的文档上,我们执行词汇处理,包括单词分离,词干提取和停止单词去除。最后,实现并测试了不同的索引结构以为系统构建快速索引。实验表明,与直接DOM方法相比,该设计的加速因子达到9.3。它可以比基于RDBMS的系统更快。它还具有更好的可伸缩性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号