This project focuses on data analysis with Python's Pandas library. The dataset used in this analysis is from a fictional e-commerce company. The analysis involves examining, cleaning, processing, and extracting useful information from the dataset.
-
Data Exploration:
- Imported the dataset and viewed basic statistics.
- Identified the top item categories and clients with the most entries.
-
Data Transformation:
- Created new columns for subtotal, shipping price, total price, cost, and profit.
- Calculated these values based on given conditions and formulas.
-
Verification:
- Confirmed the total prices for specific orders against given receipts.
-
Summary and Analysis:
- Calculated the total spending for the top 5 clients by quantity.
- Created a summary DataFrame showing total units purchased, total shipping price, total revenue, and total profit.
- Formatted monetary values to millions and sorted the data by total profit.
- Clone this repository:
git clone YOUR_GITHUB_URL