0% found this document useful (0 votes)

64 views

Data Frame

A dataframe is a two-dimensional data structure that can be used to represent data in the form of rows and columns like a spreadsheet or SQL table. It is the most commonly used pandas object for storing and manipulating data. Once data is stored in a dataframe, various operations can be performed to analyze and understand the data. A dataframe has row and column indices and can contain heterogeneous data. Dataframes allow data to be accessed and manipulated using row labels or column names. Boolean indexing allows selecting data from dataframes using boolean vectors.

Uploaded by

Sameeksha Kosaria

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views

Data Frame

Uploaded by

Sameeksha Kosaria

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

DATAFRAME

DATAFRAME-It is a two-dimensional object that is useful in

representing data in the form of rows and columns. It is similar to a
spreadsheet or an SQL table. This is the most commonly used pandas
object. Once we store the data into the Dataframe, we can perform
various operations that are useful in analyzing and understanding the
data.

DATAFRAME STRUCTURE

COLUMNS PLAYERNAME IPLTEAM BASEPRICEINCR

0 ROHIT MI 13

1 VIRAT RCB 17

2 HARDIK MI 14

INDEX DATA

PROPERTIES OF DATAFRAME

A Dataframe has axes (indices)-

Row index (axis=0)
Column index (axes=1)
It is similar to a spreadsheet , whose row index is called index and
column index is called column name.
A Dataframe contains Heterogeneous data.
A Dataframe Size is Mutable.
A Dataframe Data is Mutable.
A data frame can be created using any of the following-

1. Series
2. Lists
3. Dictionary
4. A numpy 2D array

How to create Dataframe From Series

Program-
Output-
import pandas as pd
0
s = pd.Series(['a','b','c','d']) 0 a
1 b Default Column Name As 0
df=pd.DataFrame(s)
2 c
print(df) 3 d
DataFrame from Dictionary of Series

Example-

DataFrame from List of Dictionaries

Example-
Iteration on Rows and Columns

If we want to access record or data from a data frame row wise or

column wise then iteration is used. Pandas provide 2 functions to
perform iterations-

1. iterrows ()
2. iteritems ()

iterrows()

It is used to access the data row wise. Example-

iteritems()

It is used to access the data column wise.

Example-
Select operation in data frame

To access the column data ,we can mention the column name as
subscript.
e.g. - df[empid] This can also be done by using df.empid.
To access multiple columns we can write as df[ [col1, col2,---] ]

Example -
>>df.empid or df[‘empid’]
0 101
1 102
2 103
3 104
4 105
5 106
Name: empid, dtype: int64

>>df[[‘empid’,’ename’]]
empid ename
101 Sachin
102 Vinod
103 Lakhbir
104 Anil
105 Devinder
106 UmaSelvi
To Add & Rename a column in data
frame

import pandas as pd

s = pd.Series([10,15,18,22])

df=pd.DataFrame(s)

df.columns=[‘List1’] To Rename the default column of Data

Frame as List1

df[‘List2’]=20 To create a new column List2 with all values

as 20

df[‘List3’]=df[‘List1’]+df[‘List2’] Output-

Add Column1 and Column2 and store in List1 List2 List3

0 10 20 30
New column List3 1 15 20 35
2 18 20 38
print(df) 3 22 20 42
To Delete a Column in data frame

We can delete the column from a data frame by using any of

the the following –
1. del
2. pop()
3. drop()

>>del df[‘List3’] We can simply delete a column by passing

column name in subscript with df
>>df
Output-

List1 List2
0 10 20
1 15 20
2 18 20
3 22 20

>>df.pop(‘List2’) we can simply delete a column by passing column

name in pop method.
>>df

List1
0 10
1 15
2 18
3 22
To Delete a Column Using drop()

import pandas as pd
s= pd.Series([10,20,30,40])
df=pd.DataFrame(s)
df.columns=[‘List1’]
df[‘List2’]=40
df1=df.drop(‘List2’,axis=1) (axis=1) means to delete Data
column wise
df2=df.drop(index=[2,3],axis=0) (axis=0) means to delete
data row wise with given index
print(df)
print(“ After deletion::”)
print(df1)
print (“ After row deletion::”)
print(df2)

Output-
List1 List2
0 10 40
1 20 40
2 30 40
3 40 40
After deletion::
List1
0 10
1 20
2 30
3 40
After row deletion::
List1
0 10
1 20
Accessing the data frame through loc()
and iloc() method or indexing using Labels

Pandas provide loc() and iloc() methods to access the subset from a
data frame using row/column.

Accessing the data frame through loc()

It is used to access a group of rows and columns.

Syntax-

Df.loc[StartRow : EndRow, StartColumn : EndColumn]

Note -If we pass : in row or column part then pandas provide the entire
rows or columns respectively.

To access a single row

To access multiple Rows Qtr1 to Qtr3

Example 2:-

To access single column

To access Multiple Column namely TCS and WIPRO

Example-3

To access first row

To access first 3 Rows

Accessing the data frame through iloc()

It is used to access a group of rows and columns based on numeric

index value.

Syntax-

Df.loc[StartRowindexs : EndRowindex, StartColumnindex : EndColumnindex]

Note -If we pass : in row or column part then pandas provide

the entire rows or columns respectively.

To access First two Rows

and Second column

To access all Rows and First

Two columns Record
head() andVisittail() Method
Python4csip.com for more update s

The method head() gives the first 5 rows and the method
tail() returns the last 5 rows.
import pandas as pd
empdata={ 'Doj':['12-01-2012','15-01-2012','05-09-2007',
'17-01-2012','05-09-2007','16-01-2012'],
'empid':[101,102,103,104,105,106],
'ename':['Sachin','Vinod','Lakhbir','Anil','Devinder','UmaSelvi']
}
df=pd.DataFrame(empdata)
print(df)
print(df.head())
print(df.tail())
Output-
Doj empid ename
0 12-01-2012 101 Sachin
1 15-01-2012 102 Vinod
2 05-09-2007 103 Lakhbir Data Frame
3 17-01-2012 104 Anil
4 05-09-2007 105 Devinder
5 16-01-2012 106 UmaSelvi
Doj empid ename
0 12-01-2012 101 Sachin
1 15-01-2012 102 Vinod head() displays first 5 rows
2 05-09-2007 103 Lakhbir
3 17-01-2012 104 Anil
4 05-09-2007 105 Devinder
Doj empid ename
1 15-01-2012 102 Vinod
2 05-09-2007 103 Lakhbir
3 17-01-2012 104 Anil tail() display last 5 rows
4 05-09-2007 105 Devinder
5 16-01-2012 106 UmaSelvi
CREATED BY: PRAKASH KUMAR DEWANGAN, PGT(COMPUTER SCIENCE), KV NO.1 RAIPUR (SHIFT-2)
To display first 2 rows we can use head(2) and to returns last2
rows we can use tail(2) and to return 3rd to 4th row we can write
df[2:5].
import pandas as pd
empdata={ 'Doj':['12-01-2012','15-01-2012','05-09-2007',
'17-01-2012','05-09-2007','16-01-2012'],
'empid':[101,102,103,104,105,106],
'ename':['Sachin','Vinod','Lakhbir','Anil','Devinder','UmaSelvi']
}
df=pd.DataFrame(empdata)
print(df)
print(df.head(2))
print(df.tail(2))
print(df[2:5])
Output-
Doj empid ename
0 12-01-2012 101 Sachin
1 15-01-2012 102 Vinod
2 05-09-2007 103 Lakhbir
3 17-01- 2012 104 Anil
4 05-09-2007 105 Devinder
5 16-01-2012 106 UmaSelvi

Doj empid ename

0 12-01-2012 101 Sachin head(2) displays first 2 rows
1 15-01-2012 102 Vinod

Doj empid ename

4 05-09-2007 105 Devinder tail(2) displays last 2 rows
5 16-01-2012 106 UmaSelvi
Doj empid ename
2 05-09-2007 103 Lakhbir
3 17-01- 2012 104 Anil df[2:5] display 2nd to 4th row
4 05-09-2007 105 Devinder

CREATED BY: PRAKASH KUMAR DEWANGAN, PGT(COMPUTER SCIENCE) , KV NO.1 RAIPUR (SHIFT-2)
Boolean Indexing in Data Frame

Boolean indexing helps us to select the data from the DataFrames

using a boolean vector. We create a DataFrame with a boolean index to
use the boolean indexing.

To Return Data frame where index is True

We can pass only integer value in iloc

CREATED BY: PRAKASH KUMAR DEWANGAN, PGT(COMPUTER SCIENCE) , KV NO.1 RAIPUR (SHIFT-2)

Safety and Security Manual For Safety Manager SC
0% (1)
Safety and Security Manual For Safety Manager SC
171 pages
CAPE Digital Media Syllabus With Specimen Papers 2020
50% (4)
CAPE Digital Media Syllabus With Specimen Papers 2020
116 pages
Cloudivs 3000S
No ratings yet
Cloudivs 3000S
6 pages
Data Handing Using Pandas-I
100% (2)
Data Handing Using Pandas-I
46 pages
Pandas Dataframe activity_removed_removed (1)_removed
No ratings yet
Pandas Dataframe activity_removed_removed (1)_removed
11 pages
Data Handling Using Pandas-I-ORG
No ratings yet
Data Handling Using Pandas-I-ORG
44 pages
Dataframe Notes
No ratings yet
Dataframe Notes
47 pages
Data Handlinng Using Pandas-I (1) - 18-31
No ratings yet
Data Handlinng Using Pandas-I (1) - 18-31
14 pages
Data Handlinng Using Pandas-I
No ratings yet
Data Handlinng Using Pandas-I
46 pages
1 Data Handlinng Using Pandas-I
No ratings yet
1 Data Handlinng Using Pandas-I
46 pages
Pandas Class 12 Ncertttt
No ratings yet
Pandas Class 12 Ncertttt
48 pages
Class XII Data Handlinng Using PandasI
No ratings yet
Class XII Data Handlinng Using PandasI
46 pages
DATAFRAME (1)
No ratings yet
DATAFRAME (1)
16 pages
Pandas
No ratings yet
Pandas
13 pages
UNIT 1 PYTHON PROGRAMMING-II
No ratings yet
UNIT 1 PYTHON PROGRAMMING-II
15 pages
IP 12th Chapter 3
No ratings yet
IP 12th Chapter 3
9 pages
exp3 python (1)
No ratings yet
exp3 python (1)
15 pages
Ainotes dataframe
No ratings yet
Ainotes dataframe
5 pages
Class X11 Dataframe Notes PDF
No ratings yet
Class X11 Dataframe Notes PDF
17 pages
EDS - Python Cheat Sheet
No ratings yet
EDS - Python Cheat Sheet
3 pages
09_Pandas slides
No ratings yet
09_Pandas slides
33 pages
Dataframes-I (Create & Selection)
No ratings yet
Dataframes-I (Create & Selection)
10 pages
12 IP Dataframe and Pyplot Notes
No ratings yet
12 IP Dataframe and Pyplot Notes
14 pages
Data Handling using pandas - I Q & ANS (1)
No ratings yet
Data Handling using pandas - I Q & ANS (1)
9 pages
Python Coding Interview Questions On DataFrame and Zip
No ratings yet
Python Coding Interview Questions On DataFrame and Zip
6 pages
Data Science - Unit-3-Part-2
No ratings yet
Data Science - Unit-3-Part-2
32 pages
12 IP Notes On Series
No ratings yet
12 IP Notes On Series
5 pages
14_Pandas
No ratings yet
14_Pandas
25 pages
Block 1-Data Handling Using Pandas DataFrame
No ratings yet
Block 1-Data Handling Using Pandas DataFrame
17 pages
12 Pandas
No ratings yet
12 Pandas
9 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
6 pages
Unit 4 DSE
No ratings yet
Unit 4 DSE
9 pages
dav 2 unit
No ratings yet
dav 2 unit
55 pages
Pandas DataFrame1
No ratings yet
Pandas DataFrame1
22 pages
Class Notes Class: XII Date: 17-04-2021 Subject: Informatics Practices Topic: Chapter-1
No ratings yet
Class Notes Class: XII Date: 17-04-2021 Subject: Informatics Practices Topic: Chapter-1
5 pages
R Basic and Advanced
No ratings yet
R Basic and Advanced
9 pages
IP Practic MINE
No ratings yet
IP Practic MINE
30 pages
PPT for Assignment-3 (Final_Pandas_Lab)
No ratings yet
PPT for Assignment-3 (Final_Pandas_Lab)
40 pages
Pandas 2 Complete Notes Class XII
No ratings yet
Pandas 2 Complete Notes Class XII
18 pages
Data frames pandas, handout 1 (1)
No ratings yet
Data frames pandas, handout 1 (1)
16 pages
12 IP Unit 1 Python Pandas I (Part 3 Dataframes) Notes
No ratings yet
12 IP Unit 1 Python Pandas I (Part 3 Dataframes) Notes
24 pages
Movie Ticket Data Analysis System (Ip Class 12) (2024-25)
No ratings yet
Movie Ticket Data Analysis System (Ip Class 12) (2024-25)
26 pages
Pandas Dataframe
No ratings yet
Pandas Dataframe
48 pages
IP Imp Notes
No ratings yet
IP Imp Notes
5 pages
Chapter 2 Data Handling using pandas - I(DATA FRAME)
No ratings yet
Chapter 2 Data Handling using pandas - I(DATA FRAME)
15 pages
DataFrame Notes1
No ratings yet
DataFrame Notes1
32 pages
Pandas 1705297450
No ratings yet
Pandas 1705297450
21 pages
Pandas in Python
No ratings yet
Pandas in Python
59 pages
Python Lab
No ratings yet
Python Lab
8 pages
Pandas_Dataframe_All_Operations_1735471870
No ratings yet
Pandas_Dataframe_All_Operations_1735471870
4 pages
PYTHON UNIT-5 Part-C
No ratings yet
PYTHON UNIT-5 Part-C
4 pages
Tutorial
No ratings yet
Tutorial
7 pages
Oxy Metre
No ratings yet
Oxy Metre
17 pages
Chapter 2 Python Pandas - II
No ratings yet
Chapter 2 Python Pandas - II
19 pages
Pandas CheatSheet
No ratings yet
Pandas CheatSheet
18 pages
IP - Pandas 1 & 2 (Worksheet) Class 12
No ratings yet
IP - Pandas 1 & 2 (Worksheet) Class 12
16 pages
Pandas Dataframe1
No ratings yet
Pandas Dataframe1
43 pages
Python Pandas Demo PDF
100% (2)
Python Pandas Demo PDF
23 pages
fods lab
No ratings yet
fods lab
36 pages
12 Ip Dataframes Notes
No ratings yet
12 Ip Dataframes Notes
7 pages
IP-LAB-FILE-PYTHON
No ratings yet
IP-LAB-FILE-PYTHON
9 pages
R Syntax Examples 1
No ratings yet
R Syntax Examples 1
6 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Chhattisgarh List
No ratings yet
Chhattisgarh List
28 pages
Ieo Imp Que
No ratings yet
Ieo Imp Que
26 pages
LAW-Indian Partnership Act
No ratings yet
LAW-Indian Partnership Act
25 pages
Not - For - Profit Organisations (Npo) : Receipts Payments Income Expenditure Assets Liabilities
No ratings yet
Not - For - Profit Organisations (Npo) : Receipts Payments Income Expenditure Assets Liabilities
1 page
Math Magic 2
No ratings yet
Math Magic 2
5 pages
Differential Equations Part2
No ratings yet
Differential Equations Part2
14 pages
Sudarshan Guha JDLee
100% (2)
Sudarshan Guha JDLee
3 pages
Get Practical Python Data Visualization: A Fast Track Approach To Learning Data Visualization With Python Ashwin Pajankar PDF ebook with Full Chapters Now
100% (2)
Get Practical Python Data Visualization: A Fast Track Approach To Learning Data Visualization With Python Ashwin Pajankar PDF ebook with Full Chapters Now
40 pages
Recommended Guidelines For The Collection and Use of Geospatially Referenced Data
No ratings yet
Recommended Guidelines For The Collection and Use of Geospatially Referenced Data
114 pages
300 Level Course Outlines_adeniyi Peter i
No ratings yet
300 Level Course Outlines_adeniyi Peter i
1 page
Oberheim DPX-1 OS Firmware 2.1 & 2.2 Addendum
No ratings yet
Oberheim DPX-1 OS Firmware 2.1 & 2.2 Addendum
20 pages
NetCare-USER Guideline v2
No ratings yet
NetCare-USER Guideline v2
14 pages
Mil-Module-4qtr-Week 9-10
No ratings yet
Mil-Module-4qtr-Week 9-10
7 pages
Picture Book Homework
100% (1)
Picture Book Homework
6 pages
DD P Ddos 7 7 Admin Guide en Us
No ratings yet
DD P Ddos 7 7 Admin Guide en Us
396 pages
B311-221 10.0.1.1 (H187SP60C983) Firmware Release Notes
No ratings yet
B311-221 10.0.1.1 (H187SP60C983) Firmware Release Notes
10 pages
Loytec: Installation Instructions
No ratings yet
Loytec: Installation Instructions
4 pages
Business Intelligence in Pharmaceutical Industry
100% (1)
Business Intelligence in Pharmaceutical Industry
12 pages
Guide to Making Every App Intelligent With Embedded Analytics Microstrategy
No ratings yet
Guide to Making Every App Intelligent With Embedded Analytics Microstrategy
13 pages
5a Terraform Modules Sources
No ratings yet
5a Terraform Modules Sources
7 pages
Java Features From Java 8 To Java 17
No ratings yet
Java Features From Java 8 To Java 17
7 pages
तांत्रिक मुद्रा विज्ञान - Tantrik Mudra Vijnan PDF
100% (2)
तांत्रिक मुद्रा विज्ञान - Tantrik Mudra Vijnan PDF
19 pages
Assignment 2 Compiler Design: Name-Akash Deep Das Rollno:-SBU190275
No ratings yet
Assignment 2 Compiler Design: Name-Akash Deep Das Rollno:-SBU190275
6 pages
Download ebooks file Microeconometrics Using Stata Cross Sectional and Panel Regression Models 2nd Edition A Colin Cameron Pravin K Trivedi all chapters
100% (10)
Download ebooks file Microeconometrics Using Stata Cross Sectional and Panel Regression Models 2nd Edition A Colin Cameron Pravin K Trivedi all chapters
40 pages
ANEVH(F) introduction (1)
No ratings yet
ANEVH(F) introduction (1)
25 pages
Phishing Documentation
No ratings yet
Phishing Documentation
31 pages
S.Y Syllabus
No ratings yet
S.Y Syllabus
57 pages
Full Download Crypto Dictionary: 500 Tasty Tidbits For The Curious Cryptographer 1st Edition Jean-Philippe Aumasson PDF
100% (3)
Full Download Crypto Dictionary: 500 Tasty Tidbits For The Curious Cryptographer 1st Edition Jean-Philippe Aumasson PDF
62 pages
SQL - Structured Query Language
No ratings yet
SQL - Structured Query Language
24 pages
Audit in An It Environment
No ratings yet
Audit in An It Environment
5 pages
Cryptography: (Slides Edited by Erin Chambers)
No ratings yet
Cryptography: (Slides Edited by Erin Chambers)
31 pages