A MapReduce-based Approach to Scale Big Semantic Data Compression with HDT

Gimenez J. M.; Fernandez J. D.; Martinez M. A.

首页> 外文期刊>Limnology and oceanography, methods >A MapReduce-based Approach to Scale Big Semantic Data Compression with HDT

【24h】

A MapReduce-based Approach to Scale Big Semantic Data Compression with HDT

机译：一种基于MapReduce的方法，可以使用HDT缩放大语义数据压缩

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data generation and publication on the Web has increased over the last years. This phenomenon, usually known as "Big Data", poses new challenges related with Volume, Velocity, and Variety ("The three V's") of data. The Semantic Web offers the means to deal with variety, where RDF (Resource Description Framework) is used to model data in the form of triples subject-predicate-object. In this way, it is possible to represent and interconnect RDF triples to build a true Web of Data. Nonetheless, a problem arises when big RDF collections must be stored, exchanges, and/or queried because the existing serialization formats are highly verbose, hence the remaining Big Semantic Data challenges (volume and variety) are aggravated when storing, exchanging, or querying big RDG collections. HDT addresses this issue by proposing a binary serialization format based on compact data structures that allows RDF to be compressed, but also to be queried without prior decompression. Thus, HDT reduces data volume and increases retrieval velocity. However, this achievement comes at the cost of and expensive RDF-to-HDT serialization in terms of computational resources and time. Therefore, HDT alleviates velocity and volume challenges for the end user, but moves Big Data challenges to the data publisher. In this work we show HDT-MR, a MapReduce-based algorithm that allows RDF datasets to be serialized to HDT in a distributed way, reducing processing resources and time, but also enabling larger datasets to be compressed.

机译：None

著录项

来源
《Limnology and oceanography, methods》 |2017年第7期|共8页
作者
Gimenez J. M.; Fernandez J. D.; Martinez M. A.;
展开▼
作者单位

Univ Lyon UJM St Etienne CNRS Lab Hubert Curien UMR 5516 St Etienne France;

Vienna Univ Econ &

Business Vienna Austria;

Univ Valladolid Dept Informat DataWeb Res Segovia Spain;

展开▼
收录信息
原文格式 PDF
正文语种 spa
中图分类水产、渔业;天文学、地球科学;
关键词
Compression; HDT; MapReduce; RDF; Semantic Web; Web of Data;

机译：压缩;HDT;mapreduce;RDF;语义网络;数据网;

相似文献

外文文献
中文文献
专利

1. A MULTI-SCALE APPROACH FOR THE ANALYSIS OF PROPER SAMPLED DATA SCALE IN HOT-WIRE EXPERIMENT OF SQUARE DUCT FLOW [J] . ZHANG Bin, WANG Tong, GU Chuan-gang, 水动力学研究与进展：英文版 . 2010,第003期
2. A MultikeyRank Model Based on Ontology for Large-Scale Semantic Data [J] . JIANG Yang, FENG Zhiyong, WANG Xin 电子学报：英文版 . 2014,第001期
3. A MapReduce-based Approach to Scale Big Semantic Data Compression with HDT [J] . Gimenez J. M., Fernandez J. D., Martinez M. A. Limnology and oceanography, methods . 2017,第7期

机译：一种基于MapReduce的方法，可以使用HDT缩放大语义数据压缩
4. TVD-MRDL: traffic violation detection system using MapReduce-based deep learning for large-scale data [J] . Shiva Asadianfam, Mahboubeh Shamsi, Abdolreza Rasouli Kenari Multimedia Tools and Applications . 2021,第2期

机译：TVD-MRDL：流量违规检测系统使用基于MapReduce的深度学习进行大规模数据
5. One-pass MapReduce-based clustering method for mixed large scale data [J] . Ben HajKacem Mohamed Aymen, Ben Ncir Chiheb-Eddine, Essoussi Nadia Journal of Intelligent Information Systems . 2019,第3期

机译：基于一遍MapReduce的混合大规模数据聚类方法
6. HDT-MR: A Scalable Solution for RDF Compression with HDT and MapReduce [C] . Jose M. Gimenez-Garcia, Javier D. Fernandez, Miguel A. Martinez-Prieto European semantic web conference on semantic web . 2015

机译：HDT-MR：使用HDT和MapReduce进行RDF压缩的可扩展解决方案
7. Towards a linked semantic web: Precisely, comprehensively and scalably linking heterogeneous data in the semantic web [D] . Song, Dezhao. 2014

机译：迈向链接语义网：精确，全面，可扩展地链接语义网中的异构数据
8. Large-Scale ALS Data Semantic Classification Integrating Location-Context-Semantics Cues by Higher-Order CRF [O] . Wei Han, Ruisheng Wang, Daqing Huang, 2020

机译：高阶CRF集成位置-上下文-语义线索的大规模ALS数据语义分类
9. Scalable RDF compression with MapReduce and HDT [O] . Giménez García José Miguel 2015

机译：具有MapReduce和HDT的可扩展RDF压缩
10. Semantic Approach to Enforce Correctness of Data Distribution Schemes [R] . Bakker, J. A. 1993

机译：实现数据分配方案正确性的语义方法

A MapReduce-based Approach to Scale Big Semantic Data Compression with HDT

摘要

著录项

相似文献

相关主题

期刊订阅