0% found this document useful (0 votes)

14 views7 pages

Assignment 3

The document outlines a Python assignment that involves data analysis using the Iris dataset with pandas and numpy. It includes calculations of mean, mode, median, standard deviation, minimum, and maximum for various features of the dataset, as well as visualizations through histograms and box plots. Additionally, it demonstrates how to remove outliers using the interquartile range (IQR) method.

Uploaded by

Shubham Khansare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views7 pages

Assignment 3

Uploaded by

Shubham Khansare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Assignment -03

import pandas as pd

import numpy as np

In [2]:

from sklearn import datasets

iris = datasets.load_iris()

df = [Link](data= np.c_[iris['data'], iris['target']],

columns= iris['feature_names'] + ['target'])

Out[2]:

sepal length (cm) sepal width (cm) petal length (cm) petal width (cm) target

0 5.1 3.5 1.4 0.2 0.0

1 4.9 3.0 1.4 0.2 0.0

2 4.7 3.2 1.3 0.2 0.0

3 4.6 3.1 1.5 0.2 0.0

4 5.0 3.6 1.4 0.2 0.0

... ... ... ... ... ...

145 6.7 3.0 5.2 2.3 2.0

146 6.3 2.5 5.0 1.9 2.0

147 6.5 3.0 5.2 2.0 2.0

148 6.2 3.4 5.4 2.3 2.0

149 5.9 3.0 5.1 1.8 2.0

150 rows × 5 columns

In [3]:

#mean
print(df['petal length (cm)'].mean())

print(df['sepal length (cm)'].mean())

print(df['sepal width (cm)'].mean())

3.7580000000000005

5.843333333333334

3.0573333333333337

In [4]:

#mode

print(df['petal length (cm)'].mode())

print(df['sepal length (cm)'].mode())

print(df['sepal width (cm)'].mode())

0 1.4

1 1.5

Name: petal length (cm), dtype: float64

0 5.0

Name: sepal length (cm), dtype: float64

0 3.0

Name: sepal width (cm), dtype: float64

0 3.0

Name: sepal width (cm), dtype: float64

In [5]:

#median

print(df['petal length (cm)'].median())

print(df['sepal length (cm)'].median())

print(df['sepal width (cm)'].median())

4.35

5.8
3.0

3.0

In [6]:

#standard deviation

print(df['petal length (cm)'].std())

print(df['sepal length (cm)'].std())

print(df['sepal width (cm)'].std())

1.7652982332594662

0.828066127977863

0.4358662849366982

In [7]:

#minimun

print(df['petal length (cm)'].min())

print(df['sepal length (cm)'].min())

print(df['sepal width (cm)'].min())

1.0

4.3

2.0

In [8]:

#maximum

print(df['petal length (cm)'].max())

print(df['sepal length (cm)'].max())

print(df['sepal width (cm)'].max())

6.9

7.9

4.4
4.4

In [9]:

import [Link] as plt

[Link](figsize=(12, 8))

[Link](2, 2, 1)

[Link](df['petal length (cm)'], bins=10)

[Link]('Petal Length Distribution')

[Link]('Petal Length (cm)')

[Link]('Frequency')

[Link](2, 2, 2)

[Link](df['sepal length (cm)'], bins=10)

[Link]('Sepal Length Distribution')

[Link]('Sepal Length (cm)')

[Link]('Frequency')

[Link](2, 2, 3)

[Link](df['sepal width (cm)'], bins=10)

[Link]('Sepal Width Distribution')

[Link]('Sepal Width (cm)')

[Link]('Frequency')

[Link](2, 2, 4)

[Link](df['petal width (cm)'], bins=10)

[Link]('Petal Width Distribution')

[Link]('Petal Width (cm)')

[Link]('Frequency')

plt.tight_layout()
[Link]()

In [10]:

[Link](figsize=(12, 6))

[Link](1, 2, 1)

[Link](column=['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)'])

[Link]('Box Plots Before Outlier Removal')

Out[10]:

Text(0.5, 1.0, 'Box Plots Before Outlier Removal')

In [11]:

def remove_outliers_iqr(df, column):

Q1 = df[column].quantile(0.25)

Q3 = df[column].quantile(0.75)

IQR = Q3 - Q1

lower_bound = Q1 - 1.5 * IQR

upper_bound = Q3 + 1.5 * IQR

df_filtered = df[(df[column] >= lower_bound) & (df[column] <= upper_bound)]

return df_filtered

for column in ['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)']:

df = remove_outliers_iqr(df, column)

In [34]:

[Link](1, 2, 2)

[Link](column=['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)'])

[Link]('Box Plots After Outlier Removal')

plt.tight_layout()

[Link]()

In [ ]:

Import As Import As From Import Import As Import As From Import From Import From Import
No ratings yet
Import As Import As From Import Import As Import As From Import From Import From Import
6 pages
Assignment 5'
No ratings yet
Assignment 5'
4 pages
DL Experiment - 1
No ratings yet
DL Experiment - 1
10 pages
EXP 07 (ML) - Darshu
No ratings yet
EXP 07 (ML) - Darshu
4 pages
Exp 07 (ML)
No ratings yet
Exp 07 (ML)
4 pages
EXP 07 (ML) - Ashu
No ratings yet
EXP 07 (ML) - Ashu
4 pages
EXP 07 (ML) - Sarthak
No ratings yet
EXP 07 (ML) - Sarthak
4 pages
Code
No ratings yet
Code
3 pages
Experiment-2-1-Ml Kritika
No ratings yet
Experiment-2-1-Ml Kritika
11 pages
DSE 6 - Colab
No ratings yet
DSE 6 - Colab
5 pages
Dsa 1
No ratings yet
Dsa 1
8 pages
10 TH
No ratings yet
10 TH
7 pages
Matplotlib Notes
No ratings yet
Matplotlib Notes
23 pages
Fds Slips
No ratings yet
Fds Slips
6 pages
MLRecord
No ratings yet
MLRecord
24 pages
Iris Dataset Feature Analysis
No ratings yet
Iris Dataset Feature Analysis
3 pages
085
No ratings yet
085
4 pages
Data Assigment 1
100% (2)
Data Assigment 1
32 pages
Logistic Regression on Iris Dataset
No ratings yet
Logistic Regression on Iris Dataset
7 pages
Nandini Matplotlib Ws
No ratings yet
Nandini Matplotlib Ws
10 pages
Data Analysis with Python Scripts
No ratings yet
Data Analysis with Python Scripts
9 pages
Machine Learning Group Project
No ratings yet
Machine Learning Group Project
22 pages
Ploomber Notebook Conversion - 2
No ratings yet
Ploomber Notebook Conversion - 2
14 pages
EDA of Iris Dataset in Python
No ratings yet
EDA of Iris Dataset in Python
12 pages
Ass - 10.ipynb - Colab
No ratings yet
Ass - 10.ipynb - Colab
8 pages
Mayank Chaudhary DEV Practicals
No ratings yet
Mayank Chaudhary DEV Practicals
14 pages
Exp 5,6,7
No ratings yet
Exp 5,6,7
2 pages
Dsbda 10
No ratings yet
Dsbda 10
3 pages
Practical 10 Code
No ratings yet
Practical 10 Code
5 pages
Exercise For K Means Tutorial
No ratings yet
Exercise For K Means Tutorial
5 pages
Practical No - 1
No ratings yet
Practical No - 1
5 pages
25 - Assignment10.ipynb - Colaboratory
No ratings yet
25 - Assignment10.ipynb - Colaboratory
13 pages
Assignment - 10 - Pandas
No ratings yet
Assignment - 10 - Pandas
53 pages
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
Keeratsi HW8
No ratings yet
Keeratsi HW8
17 pages
DSDBAAssignment2 SUMEET
No ratings yet
DSDBAAssignment2 SUMEET
8 pages
DP Prog
No ratings yet
DP Prog
10 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
K Means Algorithm
No ratings yet
K Means Algorithm
1 page
Exno 4
No ratings yet
Exno 4
13 pages
ML#07
No ratings yet
ML#07
21 pages
Batch1 Ds
No ratings yet
Batch1 Ds
15 pages
Data Visualization with Matplotlib
No ratings yet
Data Visualization with Matplotlib
8 pages
Davp Pyq 2023 Solution
No ratings yet
Davp Pyq 2023 Solution
15 pages
Name:-Nisha Ambike: Roll No: - 02
No ratings yet
Name:-Nisha Ambike: Roll No: - 02
2 pages
Heart Disease Diagnosis Using Machine Learning
No ratings yet
Heart Disease Diagnosis Using Machine Learning
26 pages
Matplotlib Plotting Techniques in Python
No ratings yet
Matplotlib Plotting Techniques in Python
11 pages
A2 60 Rohit Jakkam EDA of Iris - Ipynb - Colaboratory
No ratings yet
A2 60 Rohit Jakkam EDA of Iris - Ipynb - Colaboratory
5 pages
FDS LAB Record Print
No ratings yet
FDS LAB Record Print
45 pages
7 Output
No ratings yet
7 Output
4 pages
ADS Practical Exam Questions
No ratings yet
ADS Practical Exam Questions
14 pages
137 Vsec 6
No ratings yet
137 Vsec 6
2 pages
Data Visualization
No ratings yet
Data Visualization
18 pages
13-9-23 Data Pre-Processing - Jupyter Notebook
No ratings yet
13-9-23 Data Pre-Processing - Jupyter Notebook
8 pages
Lab Extern L
No ratings yet
Lab Extern L
8 pages
DV Mid Internal 1
No ratings yet
DV Mid Internal 1
8 pages
Data Analyzer
No ratings yet
Data Analyzer
10 pages
HSB Chapter12-Attraction and Love
No ratings yet
HSB Chapter12-Attraction and Love
29 pages
B2 SAMPLE TEST With Key For Magistrate Students
No ratings yet
B2 SAMPLE TEST With Key For Magistrate Students
11 pages
API 520 Part 1 Blowdown Liquids LESSER PSV
No ratings yet
API 520 Part 1 Blowdown Liquids LESSER PSV
1 page
Lodestone EIA Amendment Report - Mine - 5nov20 IAP Review Final1 Exec Summ
No ratings yet
Lodestone EIA Amendment Report - Mine - 5nov20 IAP Review Final1 Exec Summ
20 pages
CALENG 2 LQ3 Notes
No ratings yet
CALENG 2 LQ3 Notes
14 pages
Essential Email Etiquette Tips
No ratings yet
Essential Email Etiquette Tips
1 page
Neuromarketing Research in The Last Five Years A Bibliometric Analysis Interessante para Artigo
No ratings yet
Neuromarketing Research in The Last Five Years A Bibliometric Analysis Interessante para Artigo
37 pages
Student Dialog Exercises
No ratings yet
Student Dialog Exercises
3 pages
Wellbeing Class 9 English 05-11-2023 (H) (Inde. 20) Nur
No ratings yet
Wellbeing Class 9 English 05-11-2023 (H) (Inde. 20) Nur
116 pages
Joseph Small CV 2021
No ratings yet
Joseph Small CV 2021
1 page
Exploring The Implementation of The Last Planner® System Through Iglc Community: Twenty One Years of Experience
No ratings yet
Exploring The Implementation of The Last Planner® System Through Iglc Community: Twenty One Years of Experience
11 pages
5th Grade Lesson Plan Product 6 February
No ratings yet
5th Grade Lesson Plan Product 6 February
8 pages
Grade 2 Science EOS1 Print Paper 2
No ratings yet
Grade 2 Science EOS1 Print Paper 2
6 pages
Vlsi Unit 3
No ratings yet
Vlsi Unit 3
83 pages
The Study of Life: Teacher Notes and Answers
No ratings yet
The Study of Life: Teacher Notes and Answers
4 pages
Buy Ebook Applied Linear Algebra Second Edition Peter J. Olver Cheap Price
100% (4)
Buy Ebook Applied Linear Algebra Second Edition Peter J. Olver Cheap Price
37 pages
ECV 308 SOIL MECHANICS II-Slides 1-15
No ratings yet
ECV 308 SOIL MECHANICS II-Slides 1-15
16 pages
How To Add New Disks To ONTAP's Existing ADP Aggregates
No ratings yet
How To Add New Disks To ONTAP's Existing ADP Aggregates
5 pages
Pfeffer's Comment On Ghoshal
No ratings yet
Pfeffer's Comment On Ghoshal
6 pages
PS 6 Final
No ratings yet
PS 6 Final
17 pages
Minimizing Backlash in Spur Gears
No ratings yet
Minimizing Backlash in Spur Gears
8 pages
Strengths Finder Assessment Insights
No ratings yet
Strengths Finder Assessment Insights
4 pages
Vrts112 - Activity #1
No ratings yet
Vrts112 - Activity #1
1 page
Sustainable Industrial Design and Waste Management... - (10.1 Introduction)
No ratings yet
Sustainable Industrial Design and Waste Management... - (10.1 Introduction)
1 page
CYPHER Brand Guidelines
No ratings yet
CYPHER Brand Guidelines
24 pages
Topic-Related Vocabulary by Diyorbek Tursunboyev
No ratings yet
Topic-Related Vocabulary by Diyorbek Tursunboyev
69 pages
Design, Development and Optimization of Nano Emulsified Drug Delivery System of Poorly Permeable Drugs
No ratings yet
Design, Development and Optimization of Nano Emulsified Drug Delivery System of Poorly Permeable Drugs
2 pages
JPTS Institute of Science Management and
No ratings yet
JPTS Institute of Science Management and
17 pages
Technical Writing
No ratings yet
Technical Writing
9 pages
CUET UG Psychology Sample Paper2 (Sscstudy - Com)
No ratings yet
CUET UG Psychology Sample Paper2 (Sscstudy - Com)
8 pages

Assignment 3

Uploaded by

Assignment 3

Uploaded by

Assignment -03

from sklearn import datasets

df = [Link](data= np.c_[iris['data'], iris['target']],

columns= iris['feature_names'] + ['target'])

0 5.1 3.5 1.4 0.2 0.0

1 4.9 3.0 1.4 0.2 0.0

2 4.7 3.2 1.3 0.2 0.0

3 4.6 3.1 1.5 0.2 0.0

4 5.0 3.6 1.4 0.2 0.0

... ... ... ... ... ...

145 6.7 3.0 5.2 2.3 2.0

146 6.3 2.5 5.0 1.9 2.0

147 6.5 3.0 5.2 2.0 2.0

148 6.2 3.4 5.4 2.3 2.0

149 5.9 3.0 5.1 1.8 2.0

150 rows × 5 columns

print(df['sepal length (cm)'].mean())

print(df['sepal width (cm)'].mean())

print(df['sepal width (cm)'].mean())

print(df['petal length (cm)'].mode())

print(df['sepal length (cm)'].mode())

print(df['sepal width (cm)'].mode())

print(df['sepal width (cm)'].mode())

Name: petal length (cm), dtype: float64

Name: sepal length (cm), dtype: float64

Name: sepal width (cm), dtype: float64

Name: sepal width (cm), dtype: float64

print(df['petal length (cm)'].median())

print(df['sepal length (cm)'].median())

print(df['sepal width (cm)'].median())

print(df['sepal width (cm)'].median())

print(df['petal length (cm)'].std())

print(df['sepal length (cm)'].std())

print(df['sepal width (cm)'].std())

print(df['sepal width (cm)'].std())

print(df['petal length (cm)'].min())

print(df['sepal length (cm)'].min())

print(df['sepal width (cm)'].min())

print(df['sepal width (cm)'].min())

print(df['petal length (cm)'].max())

print(df['sepal length (cm)'].max())

print(df['sepal width (cm)'].max())

print(df['sepal width (cm)'].max())

import [Link] as plt

[Link](df['petal length (cm)'], bins=10)

[Link]('Petal Length Distribution')

[Link]('Petal Length (cm)')

[Link](df['sepal length (cm)'], bins=10)

[Link]('Sepal Length Distribution')

[Link]('Sepal Length (cm)')

[Link](df['sepal width (cm)'], bins=10)

[Link]('Sepal Width Distribution')

[Link]('Sepal Width (cm)')

[Link](df['petal width (cm)'], bins=10)

[Link]('Petal Width Distribution')

[Link]('Petal Width (cm)')

[Link]('Box Plots Before Outlier Removal')

Text(0.5, 1.0, 'Box Plots Before Outlier Removal')

def remove_outliers_iqr(df, column):

lower_bound = Q1 - 1.5 * IQR

upper_bound = Q3 + 1.5 * IQR

df_filtered = df[(df[column] >= lower_bound) & (df[column] <= upper_bound)]

[Link]('Box Plots After Outlier Removal')

You might also like