首页> 外国专利> ELECTRONIC MAIL DATA MODELING FOR EFFICIENT INDEXING

ELECTRONIC MAIL DATA MODELING FOR EFFICIENT INDEXING

机译:用于高效索引的电子邮件数据建模

摘要

Techniques are herein described for creating a scalable IMAP4 compliant email system using a NoSQL database and a distributed full text search engine. Data for each email message is stored in multiple tables to avoid storing redundant data unnecessarily. However, a full text search index is created based on a single table as if the index refers to a single table. In embodiments herein described, the single index is created on the fields of a message metadata table with virtual fields added to the table that are derived from the message content. During this process, data is pulled from a message table in “blob” format and broken down into corresponding fields and data items, so the data items may be converted and placed in the proper virtual fields for index creation. Each blob section that is converted is cached, so the same blob section does not need to be converted multiple times. After index creation, the index may be used to search for emails based on metadata and data within the body of the email.
机译:本文描述了用于使用NoSQL数据库和分布式全文本搜索引擎创建可伸缩的IMAP4兼容电子邮件系统的技术。每封电子邮件的数据都存储在多个表中,以避免不必要地存储冗余数据。但是,将基于单个表创建全文本搜索索引,就好像该索引引用单个表一样。在本文所描述的实施例中,在消息元数据表的字段上创建单个索引,其中从消息内容派生的虚拟字段被添加到表中。在此过程中,数据以“ blob”格式从消息表中提取,并细分为相应的字段和数据项,因此可以转换数据项并将其放置在适当的虚拟字段中以进行索引创建。每个要转换的blob节都被缓存,因此同一blob节不需要多次转换。创建索引后,索引可用于基于电子邮件正文中的元数据和数据搜索电子邮件。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号