首页> 外文期刊>International Journal of Computers & Applications >Storing, preprocessing and analyzing tweets: finding the suitable noSQL system
【24h】

Storing, preprocessing and analyzing tweets: finding the suitable noSQL system

机译:Storing, preprocessing and analyzing tweets: finding the suitable noSQL system

获取原文
获取原文并翻译 | 示例
       

摘要

In the past few years, Tweets have been widely used to perform Big Data analysis. However, the incredible amount of data captured by Twitter needs to be stored for further processing which may be a challenging task for many database systems. NoSQL is a generation of databases that aim to handle a large volume of data. However there is a large set of NoSQL systems, each has its own characteristics. Consequently choosing the suitable NoSQL system to handle Tweets is challenging. Based on these motivations, this work is carried out to find the suitable NoSQL system to manage Tweets. This paper presents the requirements of managing Tweets and provides a detailed comparison of five NoSQL systems namely, Redis, Cassandra, MongoDB, Couchbase and Neo4j regarding these requirements. The five NoSQL systems are compared in a real scenario where we collect and analyze 1.000.000 Tweets. The chosen scenario enables to evaluate not only the performance of the read and write operations, but also other requirements related to Tweets management such as scalability, analysis tools support and analysis languages support. The obtained results show that Couchbase is the most suitable NoSQL systems for managing Tweets.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号