应用分布式索引提高海量数据查询性能

窦晓峰; 陈胜; 王熠航; 麦联叨; 由建宏

首页> 中文期刊> 《计算机系统应用》 >应用分布式索引提高海量数据查询性能

应用分布式索引提高海量数据查询性能

AI论文写作 >>

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In the field of telecommunications precision marketing and ad-hoc query, there are a lot of random queries scenarios on one or more wide-tables (which have more than 50 fields). In the traditional system (the queries are performed on the database directly), the query response time can be optimized less than a few seconds to tens of seconds when the database records size is under 10 million. When the data size reaches tens of millions, hundreds of millions or even more than one billion records, whatever optimization including changing indexing mechanism are unable to meet the second-level concurrency query requirements. In the new query system, we introduce the Solr distributed index layer to solve these problems. The layer will index the database records firstly and queries will access the Solr index layer and not perform on the database directly, therefore, the performance will be improved highly. After a comparison of the two processing patterns in same environment, for the data of 50 million, 20 per concurrent access query scenario, the traditional accessing queries all are timeout; while the other’s queries can be returned within 2 seconds and all are success.%在电信领域的精准化营销、即席查询业务中，存在着大量针对一张宽表或几张宽表(超过50字段)的随机查询场景。传统处理模式(直接查询数据库)在数据量不大(<1000万)时，查询响应时间可优化到几秒至数十秒级，而当数据量到达几千万、上亿甚至十亿记录以上时，此处理模式无论如何优化或更改索引机制，都无法满足秒级并发查询要求。新的处理模式通过引入分布式 Solr 索引层解决上述问题。索引层预先对数据库记录建立索引，查询不再作用于数据库而直接查询索引层，如此，可大幅提高查询性能。经过对两种处理模式的对比验证，在相同环境下，数据量到达5000万，每秒20并发访问的宽表查询场景，传统处理模式的查询全部超时失败，而使用分布式索引层的查询可以在2秒以内返回，查询全部成功。

著录项

来源
《计算机系统应用》 |2014年第6期|259-261|共3页
作者
窦晓峰; 陈胜; 王熠航; 麦联叨; 由建宏;
展开▼
作者单位

亚信联创联通事业部;

北京 100086;

亚信联创联通事业部;

北京 100086;

亚信联创联通事业部;

北京 100086;

亚信联创联通事业部;

北京 100086;

亚信联创联通事业部;

北京 100086;

展开▼
原文格式 PDF
正文语种 chi
中图分类
关键词
精准化营销; 即席查询; 海量数据; 大数据; 查询; Solr集群; 分布式索引; 分片; B-Tree;

相似文献

中文文献
外文文献
专利

1. 海量数据冗余干扰下云数据查询的安全索引构建方法 [J] . Zhu-hong LIU ,Wen-jun ZHOU ,Yi-ou WANG . 机床与液压 . 2018,第018期
2. EDA海量数据查询和报表性能优化 [J] . 陈雪梅 . 广东通信技术 . 2014,第006期
3. 基于优化器的提高海量数据查询效率方法研究 [J] . 谭磊 ,顾国强 ,王占宏 . 计算机应用与软件 . 2012,第001期
4. 面向海量文档集的分布式索引构建方法 [J] . 王万牙1 ,石冰1 ,陈驰2 . 网络新媒体技术 . 2016,第005期
5. 面向海量文档集的分布式索引构建方法 [J] . 王万乐 ,石冰 ,陈驰 . 网络新媒体技术 . 2016,第005期
6. 非阻塞分布式数据缓存技术提高CRM海量查询性能 [C] . Cui Xining ,崔希宁 . 2013全国无线及移动通信学术大会 . 2013
7. 面向海量异构历史数据查询的索引管理系统 [A] . 徐冰 . 2013

应用分布式索引提高海量数据查询性能

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅