This project demonstrates the process of time series data analysis, decomposition, and forecasting using Python libraries. We use a dataset of poverty rates from the OECD to forecast future values of poverty rates across various countries.
These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.
To run this project, you need to have the following Python libraries installed:
- pandas
- numpy
- matplotlib
- statsmodels
- pmdarima
You can install them using pip:
pip install pandas numpy matplotlib statsmodels pmdarima
The dataset used in this project is from the OECD and can be downloaded here. Save the downloaded CSV file to your local machine.
This project consists of a Jupyter Notebook that walks you through the following steps:
- Import necessary libraries
- Load the dataset
- Preprocess the data
- Visualize the data
- Decompose the time series
- Forecasting methods
- Train and fit the ARIMA model
- Forecast future values
- Visualize the forecast
- Unit test
Open the Jupyter Notebook in your preferred environment (e.g., JupyterLab, VSCode, or Jupyter Notebook) and run the cells one by one to see the results. Make sure to update the file path to the downloaded dataset in the notebook.
Please feel free to submit pull requests to improve the project or open issues if you encounter any problems.
This project is licensed under the MIT License - see the LICENSE file for details.