Correlation between GNI and CO2
Visualization protocol
TIMESTAMP
1960-2021
DATA SOURCE
https://www.kaggle.com/datasets/toriqulstu/global-socio-economic-and-environmental-indicators
This comprehensive dataset provides a rich collection of socio-economic and environmental indicators for countries across the world. Spanning the years from 1990 to 2021, the dataset includes valuable information on Human Development Index (HDI), Life Expectancy, Gross National Income per Capita (GNI), and CO2 Production.
Provenance: hdr.undp.org
License: CC0: Public Domain
https://github.com/owid/co2-data
Complete CO2 and Greenhouse Gas Emissions dataset is a collection of key metrics maintained by Our World in Data. It is updated regularly and includes data on CO2 emissions (annual, per capita, cumulative and consumption-based).
DATASET
GNI/CO2:
https://drive.google.com/file/d/1eyWZtrpB8cGD8MxtKqiLVXv1tDaRlhrV/view?usp=sharing
METADATA
region_CO2: geographical region or grouping
ISO3: Three-letter country code
country: country name
population: the total number of individuals in a specific area
gnipc_: Gross National Income (GNI) per capita for the year 2021
co2_per_capita: co2 production per capita in 2021
PROTOCOL
1. Data Concatenation from Multiple Sources:
Four distinct datasets were sourced from Kaggle and subsequently concatenated into a single comprehensive dataset.
This consolidated dataset was then further combined with an additional dataset obtained from GitHub.
2. Data Preprocessing and Feature Selection:
A focused selection of features was made for the analysis, including 'country', 'year', 'population', 'co2_per capita', 'gnipc_', 'region_co2', and 'ISO3'.
Data corresponding to the year 2021 was exclusively extracted for further processing.
A list comprising exclusively European countries was created and applied to the dataset, resulting in a refined dataset specifically tailored for analyzing the distribution of CO2 emissions in European countries for the year 2021.
3. Data Visualization Using Flourish:
The Flourish software platform was employed to visualize the data. A scatter plot template was chosen for this purpose.
The finalized dataset was uploaded to Flourish.
Appropriate columns were selected to construct a data visualization that effectively demonstrates the correlation between Gross National Income and CO2 emissions per capita among European countries. Notably, the size of each bubble in the scatter plot is representative of the population size of the respective country.