首页>
外国专利>
Efficient column based data encoding for large-scale data storage
Efficient column based data encoding for large-scale data storage
展开▼
机译:高效的基于列的数据编码,用于大规模数据存储
展开▼
页面导航
摘要
著录项
相似文献
摘要
The subject disclosure relates to column based data encoding where raw data to be compressed is organized by columns, and then, as first and second layers of reduction of the data size, dictionary encoding and/or value encoding are applied to the data as organized by columns, to create integer sequences that correspond to the columns. Next, a hybrid greedy run length encoding and bit packing compression algorithm further compacts the data according to an analysis of bit savings. Synergy of the hybrid data reduction techniques in concert with the column-based organization, coupled with gains in scanning and querying efficiency owing to the representation of the compact data, results in substantially improved data compression at a fraction of the cost of conventional systems.
展开▼