Coronavirus (COVID-19) is an infectious disease causing a pandemic around the world. A pandemic disease spreads very quickly i.e., at an exponential rate. Most people affected experience respiratory illness and recover without special treatment. However, for older people it is a different story. People over the age of 65 experience underlying medical problems like diabetes, chronic respiratory pain, cardiovascular disease, and even cancer in certain cases. Hence, it is very important we keep ourselves aware and take precautionary measures.
The goal of this project is to forecast the spread of COVID-19 and its future consequences using the statistical model ARIMA.
We will be mainly looking at four parameters of interest:
You will require the following dependencies:
Open config.py to tweak the settings. You will see many parameters that can be changed. The most important ones are:
access_token
: you can refer to this page on how you can obtain a personal access token.FUTURE
: number of future days from today to forecast, must be a value > 0 (otherwise what’s the point of time series forecasting?).PAST
: number of past days from today to forecast, can be used to check the performance.DATA
: path to directory to output dataset, will create it if it does not already exist.FIGURES
: path to directory to output figures, will create it if it does not already exist.The python script can be run with a single command
python -W ignore main.py # ignore warning messages
This will download the dataset in the DATA
directory and generate all the figures in the FIGURES
directory. You can view the results here.