0% found this document useful (0 votes)
7 views12 pages

Data Analysis with Pandas Overview

The document provides an overview of data analysis using the Pandas library, including installation, data reading, and DataFrame manipulation. It covers key functionalities such as data slicing, aggregation, merging, and handling missing values. Additionally, it highlights the importance of correctly interpreting missing data rather than simply setting it to zero.

Uploaded by

lokr.789
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views12 pages

Data Analysis with Pandas Overview

The document provides an overview of data analysis using the Pandas library, including installation, data reading, and DataFrame manipulation. It covers key functionalities such as data slicing, aggregation, merging, and handling missing values. Additionally, it highlights the importance of correctly interpreting missing data rather than simply setting it to zero.

Uploaded by

lokr.789
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

1

Data analysis with


pandas
IMAN MOSTAFA
SUEZ CANAL UNIVERSITY
2

Iris Dataset Example


3
Iris Dataset

Features
4
Iris Dataset

Classes
5
Reading Raw dataset example
6
Pandas

 Introduction to Pandas
 Installing Library

Pip install pandas

 Importing the library

Import Pandas as pd

 Reading data using pandas.


pd.read_csv(‘path/file.csv’)
7
Pandas(continued)

 DataFrame
df=pd.DataFrame

 Slicing, indexing and filtering.

 Removing rows, columns or NaN.


8
Pandas(continued)

 Adding new columns.

 Data aggregation (Groupby)

 Merging and Joining.

 Pivoting
9
10
Data cleansing


11
1-Missing Values (Cont.)

 Setting missing values to zero is tempting, but generally wrong, because


there is always some ambiguity as to whether these values should be
interpreted as data or not
 Ex
 Is someone's salary zero because he is unemployed, or did he just not
answer the question?
12
Missing values Examples

You might also like