A first public dataset from Brazilian twitter and news on COVID-19 in Portuguese.
Ontology highlight
ABSTRACT: In this data article, we provide a collection of 3,925,366 tweets and 18,413 online news around the online discussion about COVID-19 in Brazil. The data from Twitter were collected through Twitterscraper Python library and we considered a set of keywords in Portuguese regarding to COVID-19. In order to facilitate the identification of tweets that have hashtags, media and retweets for researchers or data enthusiasts, we created three specific datasets for each of these categories. The news on COVID-19 was collected from the UOL portal, the most popular Brazilian website. All the data were gathered from January to May, 2020. These datasets can attract the attention from communities such as data science, social science, natural language processing, tourism, infodemiology, and public health.
SUBMITTER: de Melo T
PROVIDER: S-EPMC7434436 | biostudies-literature |
REPOSITORIES: biostudies-literature
ACCESS DATA