首页> 外文期刊>Data in Brief >A first public dataset from Brazilian twitter and news on COVID-19 in Portuguese
【24h】

A first public dataset from Brazilian twitter and news on COVID-19 in Portuguese

机译:来自巴西推特的第一首公共数据集和在葡萄牙语的Covid-19上的新闻

获取原文
       

摘要

In this data article, we provide a collection of 3,925,366 tweets and 18,413 online news around the online discussion about COVID-19 in Brazil. The data from Twitter were collected through Twitterscraper Python library and we considered a set of keywords in Portuguese regarding to COVID-19. In order to facilitate the identification of tweets that have hashtags, media and retweets for researchers or data enthusiasts, we created three specific datasets for each of these categories. The news on COVID-19 was collected from the UOL portal, the most popular Brazilian website. All the data were gathered from January to May, 2020. These datasets can attract the attention from communities such as data science, social science, natural language processing, tourism, infodemiology, and public health.
机译:在此数据文章中,我们提供了3,925,366条推文和18,413个关于在线讨论关于Covid-19的18,413个在线新闻。通过Twitterscraper Python库收集来自Twitter的数据,我们在葡萄牙语方面考虑了一组关键字关于Covid-19。为了便于确定具有研究人员或数据爱好者的Hashtags,Media和Reperss的推文,我们为每个类别创建了三个特定的数据集。 Covid-19上的新闻是从UOL门户网站收集的,该网站最受欢迎的巴西网站。所有数据从1月至5月收集到2020年。这些数据集可以吸引数据科学,社会科学,自然语言处理,旅游,信息化和公共卫生等社区的关注。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号