Did you know that despite decades of progress, the gender gap in diplomatic roles remains significant? This project dives into the data to uncover why, focusing on global diplomatic missions with a special spotlight on Italy from 1961 to 2021.
Using SQL on a PostgreSQL database, this analysis seeks to understand the persistent gender disparities in diplomacy and explore potential solutions. Join me in uncovering the data behind the discussions of gender discrimination in the 21st century.
This project aims to analyze gender representation in diplomacy from 1961 to 2021, focusing on global diplomatic missions and positioning Italy within a worldwide context.
The data analysis was conducted using SQL on a PostgreSQL database.
The motivation behind this project stems from my personal curiosity to understand why, in the 21st century, we still discuss gender discrimination. To address this question, I believe it’s essential to focus on every area where this issue may arise, including diplomacy! What better way to find answers than through data? 😃
This project was developed during my journey with start2impact, specifically during the SQL course, and what you see here is the final project.
The analysis is based on data from the GenDip dataset.
For detailed information on the dataset and its columns, please refer to the codebook.
The repository contains the following components:
- Diplomazia_Italiana_al_femminile.pdf: Project presentation
- GenDip_Dataset_public_mainpostings_anonymous_1968-2021_2023-05-30.xlsx: Original dataset
- gendip_dataset.csv: Dataset converted to CSV for processing via the DBMS - Postgres
- gendip_query.sql: SQL queries used for the analysis
For the analysis, PostgreSQL 14.12 and pgAdmin 4 version 7.6 were used.
To convert the XLSX file to CSV and create the template for SQL table creation, the project Create_SQL_script_from_csv_xlsx_dataset was developed.
To reproduce the queries, follow these steps:
- Access pgAdmin.
- Create a Database.
- Create the tables: Open the file gendip_query.sql and run the queries from line 1 to 125, which are the queries for table creation.
- Import the dataset.
- Reproduce the analysis queries: Run the queries from line 127 to 834.
This project is licensed under the MIT License. Feel free to use, modify, and distribute the code as per the license terms.
I would like to express my gratitude to start2impact for providing the learning platform and resources that enabled me to develop this project.
Special thanks to the creators of the GenDip dataset for making this valuable data available for research purposes.