Unknown

Dataset Information

0

Data analytics for novel coronavirus disease.


ABSTRACT: This paper describes different aspects of novel coronavirus disease (COVID-19), presents visualization of the spread of the infection, and discusses the potential applications of data analytics on this viral infection. Firstly, a literature survey is done on COVID-19 highlighting a number of factors including its origin, its similarity with previous coronaviruses, its transmission capacity, its symptoms, etc. Secondly, data analytics is applied on a dataset of Johns Hopkins University to find out the spread of the viral infection. It is shown here that although the disease started in China in December 2019, the highest number of confirmed cases up to June 04, 2020 is in the USA. Thirdly, the worldwide increase in the number of confirmed cases over time is modelled here using a polynomial regression algorithm with degree 2. Fourthly, classification algorithms are applied on a dataset of 5644 samples provided by Hospital Israelita Albert Einstein of Brazil in order to diagnose COVID-19. It is shown here that multilayer perceptron (MLP), XGBoost and logistic regression can classify COVID-19 patients at an accuracy above 91%. Finally, a discussion is presented on the potential applications of data analytics in several important factors of COVID-19.

SUBMITTER: Mondal MRH 

PROVIDER: S-EPMC7295495 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

Data analytics for novel coronavirus disease.

Mondal M Rubaiyat Hossain MRH   Bharati Subrato S   Podder Prajoy P   Podder Priya P  

Informatics in medicine unlocked 20200615


This paper describes different aspects of novel coronavirus disease (COVID-19), presents visualization of the spread of the infection, and discusses the potential applications of data analytics on this viral infection. Firstly, a literature survey is done on COVID-19 highlighting a number of factors including its origin, its similarity with previous coronaviruses, its transmission capacity, its symptoms, etc. Secondly, data analytics is applied on a dataset of Johns Hopkins University to find ou  ...[more]

Similar Datasets

| S-EPMC7307709 | biostudies-literature
| S-EPMC7123615 | biostudies-literature
| S-EPMC10312385 | biostudies-literature
| S-EPMC4905980 | biostudies-other
| S-EPMC6174428 | biostudies-literature
| S-EPMC7088441 | biostudies-literature
| S-EPMC5343946 | biostudies-literature
| S-EPMC8336660 | biostudies-literature
| S-EPMC8459828 | biostudies-literature
| S-EPMC6461626 | biostudies-literature